Upload Button Icon Add office photos
filter salaries All Filters

19 InfoService Jobs

Site Reliability Engineer - ELK Stack (5-7 yrs)

5-7 years

Site Reliability Engineer - ELK Stack (5-7 yrs)

InfoService

posted 15hr ago

Job Description

Role : Site Reliability Engineer (SRE) - Observability and Telemetry.

Job Summary :

We are seeking a highly skilled Site Reliability Engineer (SRE) - Observability and Telemetry to join our dynamic and innovative team.

The ideal candidate will have a deep understanding of observability principles, infrastructure monitoring, and performance optimization in virtualized and containerized environments.

This role will focus on designing, building, and maintaining observability platforms to ensure the reliability, scalability, and performance of our systems.

Key Responsibilities :

- Design and Implement Observability Solutions : Develop and maintain scalable observability systems, ensuring robust telemetry, logging, and monitoring across cloud-native and hybrid infrastructures.

- Monitoring and Alerting : Create effective monitoring strategies using tools such as Prometheus, Grafana, and ELK Stack to detect anomalies and ensure system health.

- Performance Optimization : Develop and implement performance dashboards and reports to track system metrics, resource utilization, and application behavior.

- Telemetry Integration : Drive adoption and implementation of OpenTelemetry to enhance distributed tracing, logging, and metrics collection across microservices and containerized applications.

- Infrastructure Management : Collaborate with infrastructure teams to improve observability for virtualized environments (VMware) and container orchestration platforms (Kubernetes).

- Automation : Develop and enhance automated solutions for incident response, alert management, and system health reporting to reduce manual intervention and improve reliability.

- Capacity Planning and Reliability : Proactively analyze performance trends and system logs to forecast capacity needs and ensure system reliability.

-Collaboration and Documentation : Work closely with development, operations, and infrastructure teams to promote best practices in observability and provide clear documentation and training on tools and processes.

Required Skills and Experience :

Proven Expertise in Observability Tools :

- Hands-on experience with Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), and OpenTelemetry for monitoring, logging, and tracing.

- Strong Knowledge of Virtualized and Containerized Environments:.

- Experience working with VMware and Kubernetes platforms for managing and monitoring system resources.

Dashboards and Visualization : .

- Proven ability to design, build, and optimize management dashboards that visualize critical performance and reliability metrics.

Scripting and Automation :

- Proficiency in scripting languages such as Python, Bash, or Go to automate observability workflows.

Infrastructure as Code :

- Familiarity with tools like Terraform, Ansible, or Helm for automated infrastructure deployment and configuration management.

Strong Analytical and Problem-Solving Skills :

- Ability to analyze complex system behaviors, troubleshoot performance bottlenecks, and implement data-driven optimizations.

Collaboration and Communication :

- Excellent interpersonal skills to work effectively with cross-functional teams and communicate complex technical concepts to diverse stakeholders.

Preferred Qualifications :

- Experience with service mesh architectures and tools like Istio or Linkerd for observability in microservices environments.

- Knowledge of cloud platforms (AWS, Azure, GCP) and their native monitoring solutions.

- Familiarity with security and compliance monitoring frameworks and tools.


Functional Areas: Software/Testing/Networking

Read full job description

Prepare for Site Reliability Engineer roles with real interview advice

People are getting interviews at InfoService through

(based on 32 InfoService interviews)
Job Portal
Referral
Campus Placement
Company Website
Walkin
Recruitment Consultant
32%
25%
22%
9%
6%
6%
High Confidence
?
High Confidence means the data is based on a large number of responses received from the candidates.

What people at InfoService are saying

What InfoService employees are saying about work life

based on 264 employees
63%
52%
55%
93%
Flexible timing
Monday to Friday
No travel
Day Shift
View more insights

InfoService Benefits

Work From Home
Job Training
Soft Skill Training
Health Insurance
Cafeteria
Team Outings +6 more
View more benefits

Compare InfoService with

Cognizant

3.8
Compare

NTT Data Information Processing Services

4.0
Compare

Sutherland Global Services

3.7
Compare

Hexaware Technologies

3.6
Compare

Virtusa Consulting Services

3.8
Compare

CGI Group

4.0
Compare

GlobalLogic

3.7
Compare

UST

3.8
Compare

Nagarro

4.0
Compare

Hewlett Packard Enterprise

4.2
Compare

ITC Infotech

3.8
Compare

Publicis Sapient

3.5
Compare

Synechron

3.6
Compare

IGT Solutions

3.3
Compare

CMS IT Services

3.1
Compare

Capita

3.6
Compare

Societe Generale Global Solution Centre

3.9
Compare

Quest Global

3.6
Compare

KocharTech

4.0
Compare

Fujitsu

3.8
Compare

Similar Jobs for you

Site Reliability Engineer at Travash Software Solutions Private Limited

6-8 Yrs

₹ 15-24 LPA

Senior Site Reliability Engineer at SwiftWin Technologies LLP

5-8 Yrs

₹ 15-24 LPA

Site Reliability Engineer at FatakPay Digital Pvt. Ltd.

Mumbai

3-5 Yrs

₹ 18-30 LPA

Site Reliability Engineer at Kiash Solutions LLp

7-9 Yrs

₹ 12-18 LPA

Developer at Export Genius

5-7 Yrs

₹ 20-25 LPA

Site Reliability Engineer at WS Audiology

5-8 Yrs

₹ 24-25 LPA

Site Reliability Engineer at Truelancer.com

8-10 Yrs

₹ 24-30 LPA

Site Reliability Engineer at Dotflick Solutions

5-13 Yrs

₹ 19-72 LPA

Site Reliability Engineer at Coders Brain Technology Private Limited

5-10 Yrs

₹ 15-20 LPA

Site Reliability Engineer at Virtusa Consulting Services Private Limited

8-15 Yrs

₹ 18-32 LPA

Site Reliability Engineer - ELK Stack (5-7 yrs)

5-7 Yrs

2d ago·via hirist.com

Full Stack Engineer - Java/React.js (5-6 yrs)

5-6 Yrs

2d ago·via hirist.com

Senior Storage Engineer - SAN/NAS (1-2 yrs)

1-2 Yrs

2d ago·via hirist.com

Senior Observability Engineer - Splunk (5-6 yrs)

5-6 Yrs

2d ago·via hirist.com

Senior Azure Data Engineer - ETL (5-7 yrs)

5-7 Yrs

2d ago·via hirist.com

Network Service Engineer - Datacenter (4-7 yrs)

4-7 Yrs

2d ago·via hirist.com

Senior Data Engineer - Apache Airflow (4-6 yrs)

4-6 Yrs

2d ago·via hirist.com
write
Share an Interview