Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Engaged Employer

Cloudologic

Compare

4.3

based on 32 Reviews

6 Cloudologic Jobs

Senior Site Reliability Engineer - DevOps (5-7 yrs)

Cloudologic

4.3

based on 32 Reviews

5-7 years

Cloudologic

posted 11d ago

Job Role Insights

Flexible timing

Key skills for the job

DevOps AWS Cloud Computing Kubernetes Azure DevOps Incident Management

+ 4 more

Job Description

Company Description :

Cloudologic is a prominent cloud consulting and IT service provider based in Singapore and rooted in India, focusing on cloud operations, cyber security, and managed services. With a decade of expertise, our dedication to delivering high-quality services has earned the trust of clients worldwide, making us a valued partner in the tech industry.

Role Description :

This is a full-time onsite role for a Senior Site Reliability Engineer at Cloudologic. The SRE Specialist will be responsible for troubleshooting, software development, system administration, and infrastructure maintenance. While the role is based in Gurgaon, remote work is acceptable.

System Reliability & Performance :

- Ensure high availability, reliability, and scalability of services.

- Implement SLOs (Service Level Objectives) and SLIs (Service Level Indicators).

- Monitor system performance and proactively address bottlenecks.

Incident Management & Troubleshooting :

- Respond to incidents, conduct root cause analysis (RCA), and implement fixes.

- Develop and improve monitoring, alerting, and diagnostic tools.

- Conduct blameless postmortems to improve system resilience.

- Automation & Infrastructure as Code (IaC). Automate deployments, scaling, and recovery processes.

- Manage infrastructure using tools like Terraform, Ansible, or Kubernetes.

- Implement CI/CD pipelines for seamless software releases.

Observability & Monitoring :

- Use monitoring tools (e.g., Prometheus, Grafana, Datadog, Splunk, ELK) to track system health.

- Define and maintain dashboards and alerts for proactive system monitoring.

- Security & Compliance. Implement security best practices, vulnerability scanning, and patch management

- Ensure compliance with regulatory requirements (GDPR, ISO 27001, etc.).

- Conduct security audits and risk assessments.

Capacity Planning & Cost Optimization :

- Forecast system demands and scale infrastructure accordingly.

- Optimize cloud costs by managing resource utilization efficiently.

- Work with development teams to build cost-effective solutions.

- Collaboration & Documentation. Work closely with developers, DevOps, and IT teams to improve system reliability.

- Document processes, best practices, and incident response playbooks.

- Participate in on-call rotations and knowledge-sharing sessions.

Qualifications :

- Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).

- 5+ years of experience in a Site Reliability Engineering, DevOps, or similar role.

- Strong understanding of system reliability, performance, and scalability principles.

- Proficiency in scripting languages (e.g., Python, Bash) and automation tools.

- Experience with Infrastructure as Code (IaC) tools (e.g., Terraform, Ansible, Kubernetes).

- Expertise in monitoring and logging tools (e.g., Prometheus, Grafana, Datadog, Splunk, ELK).

- Solid understanding of cloud platforms (AWS, Azure, GCP).

- Experience with CI/CD pipelines and software release management.

- Strong problem-solving and troubleshooting skills.

- Excellent communication and collaboration skills.

- Knowledge of security best practices and compliance requirements. -

Preferred Qualifications :

- Experience with containerization and orchestration technologies (Docker, Kubernetes).

- Experience with database administration and optimization.

- Relevant certifications (e.g., AWS Certified DevOps Engineer, Google Cloud Certified Professional Cloud DevOps Engineer).

Functional Areas: Software/Testing/Networking

Read full job description

Prepare for Senior Site Reliability Engineer roles with real interview advice