Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 1K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

RATE NOW!
- ABECA 2025
  
  RATE NOW!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

InOpTra Digital

Compare

4.6

based on 26 Reviews

11 InOpTra Digital Jobs

InOpTra Digital - Senior Reliability Engineer - Prometheus (10-18 yrs)

InOpTra Digital

4.6

based on 26 Reviews

10-18 years

InOpTra Digital

posted 2mon ago

Job Role Insights

Flexible timing

Key skills for the job

DevOps Python Kubernetes Incident Management Site Reliability Engineering Grafana

+ 1 more

Job Description

Job Title : Senior Site Reliability Engineer (SRE) - Prometheus

Experience Level : 10+ Years of experience

Location : Seattle/Remote

Employment Type : Full-time

Note : We Are Looking For Only Native Usa Candidates

Job Description :

We are looking for a skilled Senior Site Reliability Engineer (SRE) with deep expertise in Prometheus, Grafana, and Kubernetes to join our remote team. In this role, you will manage and optimize the infrastructure supporting a large-scale hardware monitoring project, ensuring high availability, reliability, and scalability for thousands of server hardware.

Key Responsibilities :

- Monitoring and Observability : Design, implement, and maintain comprehensive monitoring systems using Prometheus and Grafana to track and visualize metrics from thousands of hardware servers.

- Kubernetes Orchestration : Deploy, manage, and optimize applications on Kubernetes clusters, ensuring optimal performance and scalability.

- Automation and Scripting : Develop and implement automation for routine tasks, including alerting, system monitoring, and response mechanisms.

- Incident Management : Troubleshoot, diagnose, and resolve infrastructure incidents, ensuring the uptime and reliability of services.

- Performance Tuning : Optimize system performance, ensuring efficient data storage, querying, and alerting in Prometheus and Grafana environments.

- CI/CD Integration : Collaborate with development teams to integrate monitoring into the CI/CD pipeline and ensure smooth deployments.

- Capacity Planning : Perform capacity analysis and ensure that systems are appropriately scaled to handle increasing load.

- Post Deployment support : Support for monitoring solution once monitoring solution is implemented, troubleshooting incidents.

Required Skills :

- Prometheus : Advanced experience in configuring, tuning, and managing Prometheus for large-scale environments.

- Grafana : Proficiency in setting up Grafana dashboards for real-time monitoring and alerting.

- Kubernetes : Strong hands-on experience with managing Kubernetes clusters, deployments, and container orchestration.

- Scripting : Proficiency in scripting languages such as Python or Bash to automate tasks.

- Alerting & Incident Management : Experience setting up advanced alerting and incident management processes.

- Infrastructure as Code (IaC) : Experience with tools like Helm.

- CI/CD Pipelines : Knowledge of CI/CD tools and automation frameworks for seamless deployment.

Preferred Skills :

- Familiarity with external storage for prometheus (ex. Mimir) for high-scale storage backends.

- Experience with any Cloud Platforms (ex. AWS, GCP, Azure) for deploying infrastructure.

- Knowledge of microservices architecture and REST APIs.

- Knowledge of Redfish APIs.

Qualifications :

- 6+ years of hands-on experience as an SRE , DevOps Engineer, or similar role in managing complex infrastructure systems.

- 2+ years of hands-on experience in implementing and configuring prometheus monitoring.

- Strong understanding of DevOps practices and infrastructure automation.

- Proven experience in large-scale monitoring systems and high-availability environments.

- Excellent troubleshooting, analytical, and problem-solving skills.

Functional Areas: Manufacturing

Read full job description

Prepare for Senior Reliability Engineer roles with real interview advice

People are getting interviews at InOpTra Digital through

(based on 3 InOpTra Digital interviews)

Job Portal

100%

Moderate Confidence

What people at InOpTra Digital are saying

What InOpTra Digital employees are saying about work life

based on 26 employees

86%

95%

63%

100%

Flexible timing

Monday to Friday

No travel

Day Shift

View more insights

Compare InOpTra Digital with

TCS

3.7

Compare

Infosys

3.7

Compare

Wipro

3.7

Compare

HCLTech

3.6

Compare

Tech Mahindra

3.6

Compare

LTIMindtree

3.6

Compare

Mphasis

3.4

Compare

Hexaware Technologies

3.6

Compare

Cyient

3.7

Compare

Exl India

3.5

Compare

Primus Global Technologies

4.0

Compare

TriGeo Technologies

3.2

Compare

GrapplTech

4.8

Compare

Plada Infotech Services

3.5

Compare

Hummingwave Technologies

4.8

Compare

Fusion

3.2

Compare

Anlage Infotech

3.7

Compare

Infocus Technologies

3.9

Compare

Riddhi Corporate Services

3.8

Compare

CGS

3.5

Compare

Similar Jobs for you

Shift Engineer at Innova ESI

Mumbai, Bangalore / Bengaluru + 1

6-9 Yrs

₹ 15-20 LPA

Reliability Engineer at Selsoft

Hyderabad / Secunderabad

4-9 Yrs

₹ 17-36 LPA

Plant Operations Manager at Boolean Staffing Solution

Haridwar

10-18 Yrs

₹ 20-30 LPA

Manufacturing Manager at Catenon Executive Search Firm

Chhattisgarh, Raipur

10-18 Yrs

₹ 40-50 LPA

Head Engineering at Rize People Konnect Pvt. Ltd.

Gurgaon / Gurugram

10-18 Yrs

₹ 30-60 LPA

Maintenance Manager at Yulu Bikes

8-10 Yrs

₹ 20-22 LPA

Reliability Engineer at Pivotal

9-15 Yrs

₹ 30-50 LPA

Manager Learning & Development at WNS Global Services Private Limited

Pune

12-18 Yrs

₹ 23-30 LPA

Senior Manager - Quality Assurance at Mancer Consulting

Bangalore / Bengaluru

12-18 Yrs

₹ 40-55 LPA

Senior Service Manager at ConnectPro Management Consultants Pvt Ltd

14-18 Yrs

₹ 35-50 LPA