Upload Button Icon Add office photos
filter salaries All Filters

19 InfoService Jobs

Site Reliability Engineer - ELK Stack (5-7 yrs)

5-7 years

Site Reliability Engineer - ELK Stack (5-7 yrs)

InfoService

posted 1mon ago

Job Description

Role : Site Reliability Engineer (SRE) - Observability and Telemetry.

Job Summary :

We are seeking a highly skilled Site Reliability Engineer (SRE) - Observability and Telemetry to join our dynamic and innovative team.

The ideal candidate will have a deep understanding of observability principles, infrastructure monitoring, and performance optimization in virtualized and containerized environments.

This role will focus on designing, building, and maintaining observability platforms to ensure the reliability, scalability, and performance of our systems.

Key Responsibilities :

- Design and Implement Observability Solutions : Develop and maintain scalable observability systems, ensuring robust telemetry, logging, and monitoring across cloud-native and hybrid infrastructures.

- Monitoring and Alerting : Create effective monitoring strategies using tools such as Prometheus, Grafana, and ELK Stack to detect anomalies and ensure system health.

- Performance Optimization : Develop and implement performance dashboards and reports to track system metrics, resource utilization, and application behavior.

- Telemetry Integration : Drive adoption and implementation of OpenTelemetry to enhance distributed tracing, logging, and metrics collection across microservices and containerized applications.

- Infrastructure Management : Collaborate with infrastructure teams to improve observability for virtualized environments (VMware) and container orchestration platforms (Kubernetes).

- Automation : Develop and enhance automated solutions for incident response, alert management, and system health reporting to reduce manual intervention and improve reliability.

- Capacity Planning and Reliability : Proactively analyze performance trends and system logs to forecast capacity needs and ensure system reliability.

-Collaboration and Documentation : Work closely with development, operations, and infrastructure teams to promote best practices in observability and provide clear documentation and training on tools and processes.

Required Skills and Experience :

Proven Expertise in Observability Tools :

- Hands-on experience with Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), and OpenTelemetry for monitoring, logging, and tracing.

- Strong Knowledge of Virtualized and Containerized Environments:.

- Experience working with VMware and Kubernetes platforms for managing and monitoring system resources.

Dashboards and Visualization : .

- Proven ability to design, build, and optimize management dashboards that visualize critical performance and reliability metrics.

Scripting and Automation :

- Proficiency in scripting languages such as Python, Bash, or Go to automate observability workflows.

Infrastructure as Code :

- Familiarity with tools like Terraform, Ansible, or Helm for automated infrastructure deployment and configuration management.

Strong Analytical and Problem-Solving Skills :

- Ability to analyze complex system behaviors, troubleshoot performance bottlenecks, and implement data-driven optimizations.

Collaboration and Communication :

- Excellent interpersonal skills to work effectively with cross-functional teams and communicate complex technical concepts to diverse stakeholders.

Preferred Qualifications :

- Experience with service mesh architectures and tools like Istio or Linkerd for observability in microservices environments.

- Knowledge of cloud platforms (AWS, Azure, GCP) and their native monitoring solutions.

- Familiarity with security and compliance monitoring frameworks and tools.


Functional Areas: Software/Testing/Networking

Read full job description

Prepare for Site Reliability Engineer roles with real interview advice

Top InfoService Site Reliability Engineer Interview Questions

Q1. Yaml file how did u configured and how mule will encript the details
Q2. what is head,name me 7 types of tags, How to add image in html,what is tag,some questions regarding css and some basic questions on sql.Last ... read more
Q3. How can we use code in software programs
View all 21 questions

What people at InfoService are saying

What InfoService employees are saying about work life

based on 266 employees
63%
53%
55%
93%
Flexible timing
Monday to Friday
No travel
Day Shift
View more insights

InfoService Benefits

Work From Home
Job Training
Soft Skill Training
Health Insurance
Cafeteria
Team Outings +6 more
View more benefits

Compare InfoService with

Cognizant

3.7
Compare

Sutherland Global Services

3.6
Compare

Optum Global Solutions

4.0
Compare

Hexaware Technologies

3.5
Compare

FIS

3.9
Compare

Virtusa Consulting Services

3.8
Compare

CGI Group

4.0
Compare

GlobalLogic

3.6
Compare

Bosch Global Software Technologies

3.9
Compare

UST

3.8
Compare

Nagarro

4.0
Compare

Hewlett Packard Enterprise

4.2
Compare

ITC Infotech

3.6
Compare

Publicis Sapient

3.5
Compare

Synechron

3.5
Compare

NTT DATA, Inc.

4.0
Compare

IGT Solutions

3.3
Compare

CMS IT Services

3.1
Compare

Societe Generale Global Solution Centre

3.8
Compare

Capita

3.6
Compare

Similar Jobs for you

Site Reliability Engineer at Signzy Technologies

3-5 Yrs

₹ 12-18 LPA

Site Reliability Engineer at Travash Software Solutions Private Limited

6-8 Yrs

₹ 15-24 LPA

Site Reliability Engineer at Natobotics Technologies Pvt Limited

4-8 Yrs

₹ 12-24 LPA

Site Reliability Engineer at Kiash Solutions LLp

7-9 Yrs

₹ 12-18 LPA

Site Reliability Engineer at Ascendion

6-9 Yrs

₹ 15-30 LPA

Site Reliability Engineer at GTECH

5-7 Yrs

₹ 15-20 LPA

Devops Engineer at TecQubes Technologies

4-6 Yrs

₹ 12-18 LPA

Site Reliability Engineer at Agivant Technologies

7-12 Yrs

₹ 20-30 LPA

Site Reliability Engineer at Dotflick Solutions

5-13 Yrs

₹ 19-72 LPA

Site Reliability Engineer at Truelancer.com

8-10 Yrs

₹ 24-30 LPA

Site Reliability Engineer - ELK Stack (5-7 yrs)

5-7 Yrs

1mon ago·via hirist.com

AWS Data Engineer - ETL/Python/SQL (3-8 yrs)

3-8 Yrs

29d ago·via hirist.com

Data Scientist - Python/R/SQL (5-7 yrs)

5-7 Yrs

29d ago·via hirist.com

Senior Next.Js Developer - Javascript (5-9 yrs)

5-9 Yrs

29d ago·via hirist.com

Senior Consultant - MS Dynamics (5-10 yrs)

5-10 Yrs

29d ago·via hirist.com

Salesforce Integration Engineer - Coveo (5-7 yrs)

5-7 Yrs

29d ago·via hirist.com

Business Analyst - ServiceNow (5-8 yrs)

5-8 Yrs

29d ago·via hirist.com

Senior Observability Engineer - Splunk (5-6 yrs)

5-6 Yrs

1mon ago·via hirist.com

Recently Viewed

JOBS

Browse jobs

Discover jobs you love

JOBS

AlchemyJob

No Jobs

JOBS

BiteSpeed

No Jobs

JOBS

Hiring Eye

No Jobs

JOBS

Closeloop

No Jobs

JOBS

TalentXO

No Jobs

JOBS

DigiHelic

No Jobs

JOBS

Info Edge

No Jobs

JOBS

Ascendion

No Jobs

JOBS

Tietoevry

No Jobs

write
Share an Interview
How was your last interview experience?
Rate your experience using AmbitionBox
Terrible
Terrible
Poor
Poor
Average
Average
Good
Good
Excellent
Excellent