29 Deltaclass Technology Solutions Jobs
6-9 years
Site Reliability Engineer - Incident Management (6-9 yrs)
Deltaclass Technology Solutions
posted 17hr ago
Key skills for the job
Role : SRE Senior Engineer
Experience : 6-8 years
Responsibilities :
- Define, track, and report on SLOs and SLIs for critical service
- Setup Monitoring and observability for the system
- Take lead on complex incidents and provide deep technical expertise to resolve issues quickly.
- Perform RCA in-depth for incident management and suggest permanent fix
- Participate in design reviews focusing on reliability and scalability
- Design and implement automation for high-availability systems and fault-tolerant architectures
Skills :
- Expertise in Site Reliability Engineering (SRE) processes and skills.
- Solid understanding of Service Level Agreements (SLA), Service Level Indicators (SLI), and Service Level Objectives (SLO).
- Extensive experience with infrastructure monitoring, Application Performance Management (APM), and observability tools.
- Proficient in Incident and Problem Management.
- Proven track record in contributing to toil reduction.
- Strong troubleshooting skills, including Root Cause Analysis (RCA) and postmortem processes.
Functional Areas: Software/Testing/Networking
Read full job descriptionPrepare for Site Reliability Engineer roles with real interview advice
6-9 Yrs