Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by CloudifyOps Team. If you also belong to the team, you can get access from here

CloudifyOps Verified Tick

Compare button icon Compare button icon Compare
3.8

based on 27 Reviews

filter salaries All Filters

8 CloudifyOps Jobs

Site Reliability Engineer 1

6-7 years

Kolkata, Mumbai, New Delhi + 4 more

1 vacancy

Site Reliability Engineer 1

CloudifyOps

posted 1d ago

Job Description

Infrastructure Operations: Manage, administer, and optimize our Linux-based infrastructure, ensuring high availability, scalability, and performance for mission-critical applications.
Incident Management : Lead and manage incident response, troubleshoot complex issues across production systems, and ensure timely resolution based on SLA, SLO, and SLI.
Linux Administration: Perform in-depth system administration tasks, including user management, system tuning, performance optimization, patching, and log analysis for Linux servers.
Automation Configuration Management: Automate repetitive tasks, manage Docker containers, and implement configuration management and provisioning using Terraform and other automation tools.
Monitoring Performance Tuning: Utilize Grafana, Prometheus, and other monitoring tools to track system health and performance, ensuring proactive measures to maintain reliability and reduce downtime.
Runbook Documentation: Develop and maintain runbooks, detailed troubleshooting guides, and operational documentation for incident response and system recovery.
Debugging Troubleshooting: Perform advanced debugging and root-cause analysis to diagnose complex system issues, with a focus on minimizing system downtime and improving operational stability.
Collaboration: Work closely with cross-functional teams, including development, QA, and operations, to ensure reliability and performance standards for new features and releases.
Capacity Planning Optimization: Plan for future infrastructure needs, scale systems
as required, and optimize resource utilization to meet growing demands.

Skills and Qualifications:
5+ years of experience with Strong Linux Administration with Networking, including deep expertise in system configuration, user management, kernel tuning, log analysis, and performance troubleshooting.
Hands-on experience with Docker and container orchestration (e.g., Kubernetes).
Solid knowledge of Terraform for infrastructure automation, provisioning, and management.
Proficiency with monitoring tools like Grafana, Prometheus, and Opsgenie to track performance and uptime.
Experience in incident management, ensuring that issues are resolved promptly while maintaining SLA, SLO, and SLI metrics.
Expertise in debugging and resolving complex technical issues in distributed systems, with a focus on minimizing downtime.
Proven ability to write and maintain runbooks and operational procedures for troubleshooting and system recovery.
Experience in data center management and ensuring 24/7 availability of production infrastructure.
Strong understanding of automation tools (e.g., Terraform, Ansible) and continuous improvement practices.
Excellent communication and teamwork skills, with the ability to collaborate effectively across departments.
Linux networking is primary, Ubuntu based infra. Strong linux skills required. For eg: Hardening
Ansible for configuration management, jenkins, Gitlabs for deploying infra.
Automation of tasks, Shell scripting and Python.
ELK for logging, Prometheus Grafana
Databricks Tableau are also being used so SQL skills are required.
Docker containers understanding is a must. Preferred Qualifications:
Experience with cloud platforms (AWS, Azure, GCP).
Familiarity with CI/CD pipelines and tools like Jenkins, Git, or similar.
Exposure to Kubernetes and container orchestration technologies.
Certifications in Linux administration, SRE, DevOps, or cloud technologies.

Employment Type: Full Time, Permanent

Read full job description

Prepare for Site Reliability Engineer roles with real interview advice

What people at CloudifyOps are saying

Site Reliability Engineer salary at CloudifyOps

reported by 1 employee with 6 years exp.
₹10.8 L/yr - ₹13.8 L/yr
13% less than the average Site Reliability Engineer Salary in India
View more details

What CloudifyOps employees are saying about work life

based on 27 employees
90%
79%
60%
100%
Flexible timing
Monday to Friday
No travel
Day Shift
View more insights

CloudifyOps Benefits

Soft Skill Training
Work From Home
Job Training
Team Outings
Health Insurance
Cafeteria +6 more
View more benefits

Compare CloudifyOps with

HCL Infosystems

3.9
Compare

Accel Frontline

3.9
Compare

Northcorp Software

4.4
Compare

Diverse Lynx

3.8
Compare

Elentec Power India (EPI) Pvt. Ltd.

3.7
Compare

HyScaler

4.5
Compare

Appsierra

4.3
Compare

Emblix Solutions

4.9
Compare

Solartis Technology Services

3.7
Compare

Trawex Technologies

4.7
Compare

Yashi Consulting Services

3.9
Compare

VHS Consulting

3.7
Compare

IVTL Infoview Technologies

3.6
Compare

Apex CoVantage

3.3
Compare

Knoldus Inc

4.1
Compare

DynPro

3.8
Compare

Apmosys Technologies

3.5
Compare

Avontix

4.0
Compare

AvenData GmbH

3.2
Compare

Dahua Technology India Pvt.Ltd.

3.6
Compare

Similar Jobs for you

Site Reliability Engineer at Intelex Technologies ULC

Kolkata, Mumbai + 5

5-7 Yrs

₹ 7-10 LPA

Site Reliability Engineer at Morningstar India (P) Ltd.

Mumbai

3-7 Yrs

₹ 5-9 LPA

Site Reliability Engineer at Oracle India Pvt. Ltd.

Bangalore / Bengaluru

8-13 Yrs

₹ 9-19 LPA

Site Reliability Engineer at Sirion Pte Ltd

Gurgaon / Gurugram

4-7 Yrs

₹ 6-9 LPA

Site Reliability Engineer at DST Systems, Inc.

Hyderabad / Secunderabad

7-8 Yrs

₹ 9-10 LPA

Site Reliability Engineer at Skyhigh Networks

Bangalore / Bengaluru

8-13 Yrs

₹ 11-16 LPA

Site Reliability Engineer at Netskope

Kolkata, Mumbai + 5

7-9 Yrs

₹ 9-11 LPA

Site Reliability Engineer at Catchpoint Systems, Inc

Kolkata, Mumbai + 5

2-7 Yrs

₹ 4-9 LPA

Site Reliability Engineer at Aerospike, Inc

Bangalore / Bengaluru

5-6 Yrs

₹ 8-12 LPA

Site Reliability Engineer at Turing

Remote

3-7 Yrs

₹ 5-9 LPA

CloudifyOps Chennai Office Location

View all
Chennai Office
Indiqube Vantage, 3rd Phase, No.1, OMR Service Road, Santhosh Nagar, Kandhanchavadi, Perungudi, Chennai, Tamil Nadu 600096 Chennai
600096

Site Reliability Engineer 1

6-7 Yrs

Kolkata, Mumbai, New Delhi +4 more

3d ago·via naukri.com

Site Reliability Engineer

2-5 Yrs

Remote

18d ago·via naukri.com

Site Reliability Engineer

2-5 Yrs

Chennai, Bangalore / Bengaluru

19d ago·via naukri.com

Site Reliability Engineer

5-7 Yrs

Bangalore / Bengaluru

20d ago·via naukri.com

Devops Engineer

4-5 Yrs

Bangalore / Bengaluru

1mon ago·via naukri.com

Business Development Manager

6-11 Yrs

Bangalore / Bengaluru

1mon ago·via naukri.com
write
Share an Interview