Premium Employer

i

This company page is being actively managed by Xebia Team. If you also belong to the team, you can get access from here

Xebia Verified Tick

Compare button icon Compare button icon Compare
3.4

based on 739 Reviews

filter salaries All Filters

26 Xebia Jobs

Xebia - Site Reliability Engineer - DevOps (5-14 yrs)

5-14 years

Xebia - Site Reliability Engineer - DevOps (5-14 yrs)

Xebia

posted 12hr ago

Job Role Insights

Flexible timing

Job Description

Hybrid

Shift : 3 PM - 12 AM

As an SRE, you will work at the intersection of software engineering and system operations to ensure our cloud infrastructure is reliable, efficient, and scalable.


You will be responsible for monitoring, troubleshooting, and automating our systems, with a special focus on AI/ML services deployed on Google Cloud Platform (GCP).


This role is perfect for individuals who are passionate about improving service reliability through automation, observability, and a data-driven approach.

Key Responsibilities :

- Ensure the availability, performance, and scalability of services running on Google Cloud Platform (GCP), particularly for AI/ML services.

- Monitor and optimize cloud-based systems, ensuring uptime and reducing downtime through proactive monitoring and automation.

- Develop and implement automation scripts for infrastructure provisioning, configuration, and deployment.

- Design and manage monitoring and alerting systems, utilizing tools such as Prometheus, Grafana, and Stackdriver, to track key performance indicators and reliability metrics (SLIs, SLOs, SLAs).

- Collaborate with engineering teams to ensure applications are built for high reliability and scalability in a cloud environment, particularly those leveraging AI/ML services on GCP.

- Troubleshoot complex production issues and drive improvements to the infrastructure and application design to enhance reliability and performance.

- Work on incident response and root cause analysis (RCA), identifying areas for improvement and implementing solutions to prevent recurrence.

- Participate in on-call rotations, providing support for production systems and resolving critical incidents quickly and effectively.

- Implement infrastructure-as-code (IaC) practices using tools like Terraform, Ansible, or Google Cloud Deployment Manager.

- Optimize cost and resource usage in GCP, ensuring services are running efficiently and within budget.

Required Skills and Qualifications :

- Experience working as a Site Reliability Engineer (SRE), DevOps engineer, or in a similar role with a focus on cloud infrastructure.

- Strong hands-on experience with Google Cloud Platform (GCP) services (e.g., Compute Engine, Kubernetes Engine, BigQuery, Cloud Storage).

- Familiarity with AI/ML services on GCP, such as AI Platform, TensorFlow, or BigQuery ML.

- Proficient in scripting and automation using languages such as Python, Go, or Bash.

- Solid understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, Google Stackdriver, or similar).

- Experience with containerization technologies like Docker and Kubernetes.

- Strong knowledge of systems architecture, networking, and distributed systems.

- Experience with infrastructure as code (IaC) tools like Terraform, CloudFormation, or Ansible.

- Ability to troubleshoot complex systems and effectively manage incident response.

- Strong analytical and problem-solving skills with a focus on continuous improvement.

- Excellent communication skills and the ability to collaborate with cross-functional teams.

Preferred Qualifications :

- Google Cloud Platform (GCP) certifications, such as Professional Cloud Architect or Professional Cloud DevOps Engineer.

- Experience with CI/CD pipelines and tools like Jenkins, GitLab CI, or CircleCI.

- Familiarity with AI/ML frameworks and cloud services for machine learning.

- Knowledge of security best practices for cloud environments.

- Experience working in an agile or DevOps-driven environment.


Functional Areas: Software/Testing/Networking

Read full job description

Prepare for Site Reliability Engineer roles with real interview advice

What people at Xebia are saying

What Xebia employees are saying about work life

based on 739 employees
75%
88%
48%
95%
Flexible timing
Monday to Friday
No travel
Day Shift
View more insights

Xebia Benefits

Submitted by Company
Competitive Salaries & Bonuses
Elevating and Empowering
Voluntary Leaves for Self-Development
Company Sponsored Skilling Programs
Health and Wellness
Flexible Work Hours & Remote Options
Submitted by Employees
Work From Home
Soft Skill Training
Health Insurance
Cafeteria
Job Training
Team Outings +6 more
View more benefits

Compare Xebia with

TCS

3.7
Compare

Infosys

3.7
Compare

Wipro

3.7
Compare

HCLTech

3.5
Compare

Tech Mahindra

3.5
Compare

LTIMindtree

3.8
Compare

Mphasis

3.4
Compare

Persistent Systems

3.5
Compare

Hexaware Technologies

3.6
Compare

Xoriant

4.2
Compare

CitiusTech

3.4
Compare

HERE Technologies

3.9
Compare

BT Business

4.1
Compare

HTC Global Services

3.6
Compare

Tietoevry

4.3
Compare

Unisys

3.7
Compare

Slk Software Services

3.3
Compare

Altimetrik

3.8
Compare

Apexon

3.3
Compare

TEKsystems

3.4
Compare

Similar Jobs for you

Site Reliability Engineer at Softpath Technologies LLC

6-8 Yrs

₹ 18-24 LPA

Site Reliability Engineer at Collabera

5-8 Yrs

₹ 12-26 LPA

Site Reliability Engineer at MRCC

7-15 Yrs

₹ 15-28 LPA

Site Reliability Engineer at Grizmo Labs Private Limited

5-7 Yrs

₹ 15-20 LPA

Site Reliability Engineer at Xebia IT Architects India Pvt Ltd

7-12 Yrs

₹ 25-45 LPA

Site Reliability Engineer at Recro

4-6 Yrs

₹ 12-20 LPA

Site Reliability Engineer at Coders Brain Technology Private Limited

5-10 Yrs

₹ 15-20 LPA

Site Reliability Engineer at HUNTINGCUBE RECRUITMENT SOLUTIONS PRIVATE LIMITED

5-8 Yrs

₹ 28-42 LPA

Devops Engineer at Dynproindia

7-10 Yrs

₹ 20-30 LPA

Devops Engineer at Coders Brain Technology Private Limited

5-12 Yrs

₹ 18-30 LPA

Xebia Gurgaon / Gurugram Office Location

View all
Gurgaon Office
Capital Cyberscape, 4th Floor, Sector-59, Golf Course Extension Road Gurgaon

Java Back End Developer

7-12 Yrs

₹ 30 - 40L/yr

Bangalore / Bengaluru

1d ago·via naukri.com

Senior QA Engineer ( Manhattan OMS)

5-10 Yrs

Gurgaon / Gurugram, Chennai, Bangalore / Bengaluru

2d ago·via naukri.com

Senior QA Engineer ( Manhattan OMS)

5-10 Yrs

Gurgaon / Gurugram, Chennai, Bangalore / Bengaluru

2d ago·via naukri.com

Fullstack Developer

7-12 Yrs

Chennai, Bangalore / Bengaluru, Delhi/Ncr

2d ago·via naukri.com

Xebia (Global IT MNC) is Hiring For SRE

7-11 Yrs

Pune, Bangalore / Bengaluru, Delhi/Ncr

2d ago·via naukri.com
write
Share an Interview