i
Tech Mahindra
508 Tech Mahindra Jobs
7-12 years
₹ 10 - 20L/yr
Hyderabad / Secunderabad, Pune
1 vacancy
Senior Site Reliability Engineer
Tech Mahindra
posted 7hr ago
Role & responsibilities
- Implement and manage observability solutions using tools like Splunk and AppDynamics to monitor and analyze system performance, focusing on proactive issue detection.
- Collaborate with application, database, network, storage, firewall, and security teams to ensure optimal system reliability and performance.
- Analyze traffic patterns, errors, and exceptions from logs to suggest and implement improvement ideas.
- Drive stability improvements to achieve zero downtime for web services and databases.
- Conduct capacity planning and implement resiliency measures, such as failover mechanisms for databases and ecosystem data sources.
- Participate in on-call rotations and lead incident response efforts, including blameless postmortems.
Preferred candidate profile
Extensive experience with database systems, including mainframe, DB2, RDBMS, and web services (REST and SOAP).
- Proficiency in application monitoring tools such as Splunk, Extra hop, AppDynamics, Prometheus, and Grafana.
- Strong understanding of JVM and database metrics, including CPU, memory, disk space utilization, threads, and connection counts.
- Expertise in Java web services, particularly REST services.
- Excellent communication and interpersonal skills, with the ability to articulate complex technical concepts to senior management and various stakeholders.
- Proven track record of working independently to drive stability improvements and ensure high availability
Perks and benefits
- Experience with cloud platforms and containerization technologies.
- Knowledge of automation and infrastructure-as-code practices.
- Familiarity with DevOps methodologies and continuous integration/continuous delivery (CI/CD) pipelines.
The successful candidate will play a crucial role in enhancing our observability capabilities, transitioning from reactive to proactive monitoring, and effectively managing stakeholder relationships across various teams. They will be responsible for identifying patterns in alerts, addressing capacity challenges, and implementing robust solutions to prevent recurring issues.
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Senior Site Reliability Engineer roles with real interview advice
7-12 Yrs
₹ 10 - 20L/yr
Hyderabad / Secunderabad, Pune
5-7 Yrs
Hyderabad / Secunderabad, Bangalore / Bengaluru
5-10 Yrs
Hyderabad / Secunderabad, Bangalore / Bengaluru
8-13 Yrs
Hyderabad / Secunderabad, Pune, Bangalore / Bengaluru
10-17 Yrs
Bangalore Rural, Bangalore / Bengaluru
5-10 Yrs
Hyderabad / Secunderabad
0-5 Yrs
₹ 2 - 2.25L/yr
Noida