Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by Coders Brain Team. If you also belong to the team, you can get access from here

Coders Brain Verified Tick

Compare button icon Compare button icon Compare
3.3

based on 39 Reviews

filter salaries All Filters

392 Coders Brain Jobs

Site Reliability Engineer - Incident Management (18-24 yrs)

18-24 years

Site Reliability Engineer - Incident Management (18-24 yrs)

Coders Brain

posted 3d ago

Job Description

Key Responsibilities :

Leadership & Strategy :

- Provide technical and people leadership to SRE, DevOps, Monitoring, and Database Operations teams.

- Collaborate with leadership on budgeting, planning, hiring, and managing third-party contracts.

- Oversee project status, assemble project teams, and define assignments with schedules and milestones.

Platform Reliability & Performance :

- Drive continuous improvement of reliability, stability, and performance of digital platforms.

- Oversee implementation of automated telemetry, observability, and applied intelligence systems.

- Lead efforts to develop automated alerting, self-healing mechanisms, and intelligent response systems.

Incident & Escalation Management :

- Ensure 24/7 uptime of sites and services, with minimal unplanned downtime.

- Serve as Escalation Manager/Critical Incident Manager during major incidents, leading teams in rapid service restoration.

- Provide on-call escalation support based on 24/7/365 schedules.

- Communicate timely updates and incident reports to senior leadership.

Collaboration & Integration :

- Partner with administrators, platform engineers, and other stakeholders to achieve highly reliable infrastructure, systems, and integrations.

- Collaborate with product, application development, QA, and technology teams to enhance service reliability and performance.

Incident Management & Automation :

- Provide advanced Incident and Problem Management support to effectively diagnose, remediate, and resolve platform issues.

- Automate critical workflows across the platform to minimize manual errors and reduce human intervention.

- Implement ITIL processes like Incident, Problem, and Change Management.

Monitoring & Scalability:

- Design and implement effective monitoring systems with proper alerting and escalation mechanisms for critical events.

- Ensure timely capacity planning and infrastructure upgrades for optimal reliability.

- Develop and refine processes to minimize Mean Time to Recover (MTTR) and extend Mean Time to Failure (MTTF).

Documentation & Compliance:

- Create and maintain detailed documentation, including run books, incident response guides, post-mortem reports, RCAs, and mitigation plans.

- Ensure all changes adhere to established procedures and documentation standards.

Business Alignment :

- Understand business workflows and map technology solutions to address problems effectively.

- Lead conversations and provide technical support to both internal and external customers.


Functional Areas: Software/Testing/Networking

Read full job description

Prepare for Site Reliability Engineer roles with real interview advice

People are getting interviews at Coders Brain through

(based on 3 Coders Brain interviews)
Job Portal
100%
Moderate Confidence
?
Moderate Confidence means the data is based on a sufficient number of responses received from the candidates

What people at Coders Brain are saying

What Coders Brain employees are saying about work life

based on 39 employees
68%
67%
91%
100%
Flexible timing
Monday to Saturday
No travel
Day Shift
View more insights

Coders Brain Benefits

Work From Home
Soft Skill Training
Job Training
Education Assistance
Cafeteria
Team Outings +6 more
View more benefits

Compare Coders Brain with

TCS

3.7
Compare

Infosys

3.7
Compare

Wipro

3.7
Compare

HCLTech

3.5
Compare

Tech Mahindra

3.6
Compare

LTIMindtree

3.9
Compare

Mphasis

3.4
Compare

Persistent Systems

3.5
Compare

Hexaware Technologies

3.6
Compare

kipi.ai

4.3
Compare

Saama Technologies

3.6
Compare

Magic Edtech

3.1
Compare

Cyfuture

3.0
Compare

IT By Design

4.0
Compare

Mantra Technologies

3.8
Compare

Systems Plus

4.3
Compare

ANR Software Private Limited

4.5
Compare

DISYS

3.0
Compare

Shiash Info Solutions

3.2
Compare

VDart

4.5
Compare

Similar Jobs for you

Devops Engineer at Coders Brain Technology Private Limited

18-24 Yrs

₹ 50-60 LPA

Site Reliability Engineer at Centific Global Technologies

Chennai

15-25 Yrs

₹ 40-60 LPA

Site Reliability Engineer at Virtusa Consulting Services Private Limited

8-15 Yrs

₹ 18-32 LPA

Site Reliability Engineer at Centific Global Technologies

15-25 Yrs

₹ 40-60 LPA

Devops Engineer at EDGESOFT

10-15 Yrs

₹ 25-30 LPA

Site Reliability Engineer at Harbinger Group

10-20 Yrs

₹ 20-35 LPA

Site Reliability Engineer at Black Turtle

Hyderabad / Secunderabad, Pune

18-27 Yrs

₹ 65-90 LPA

Site Reliability Engineer at Fractal31 Pvt Ltd

10-15 Yrs

₹ 25-35 LPA

Site Reliability Engineer at Tricog Health India Private Limited

Bangalore / Bengaluru

12-20 Yrs

₹ 25-40 LPA

Site Reliability Engineer at Mindtel Global Private Limited

18-20 Yrs

₹ 52-60 LPA

write
Share an Interview