Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by Cloudologic Team. If you also belong to the team, you can get access from here

Cloudologic Verified Tick

Compare button icon Compare button icon Compare
filter salaries All Filters

6 Cloudologic Jobs

Azure Team Lead - Site Reliability (10-13 yrs)

10-13 years

Azure Team Lead - Site Reliability (10-13 yrs)

Cloudologic

posted 7d ago

Job Description

Company Description :


Cloudologic is a prominent cloud consulting and IT service provider based in Singapore with roots in India. Specializing in cloud operations, cyber security, and managed services, Cloudologic has built a reputation for high-quality services that clients worldwide trust and value.

Role Description :


This is a full-time Hybrid role located in Hyderabad for an Azure SRE Team Lead at Cloudologic. The Senior Manager will be responsible for day-to-day tasks associated with reliability engineering, troubleshooting, and engineering management in Azure Site Reliability Engineering.

As the Azure SRE Team Lead, you will be responsible for leading and mentoring a team of Site Reliability Engineers, driving operational excellence, and ensuring the reliability and scalability of our Azure-based infrastructure.

You will collaborate with cross-functional teams to design, implement, and maintain highly available systems, while advocating for best practices in automation, monitoring, and incident management. The ideal candidate will have strong leadership skills, deep technical expertise in Azure cloud services, and a passion for improving system reliability and performance.

Key Responsibilities :


Leadership & Mentorship :


- Lead, mentor, and grow a team of Site Reliability Engineers, fostering a culture of collaboration, innovation, and continuous improvement.

- Drive operational excellence by setting clear goals, priorities, and performance metrics for the team.

- Encourage professional development and knowledge sharing within the team.

Reliability & Scalability :


- Own the availability, performance, and scalability of critical services in Azure.

- Define and implement Site Reliability Engineering (SRE) best practices, including SLAs, SLOs, and error budgets.

- Proactively identify potential risks and performance bottlenecks and implement strategies to mitigate them.

Automation & Infrastructure Management :


- Oversee the automation of operational tasks, including provisioning, deployment, monitoring, and incident response.

- Lead efforts to implement Infrastructure as Code (IaC) using tools such as Ansible, Terraform, or Azure DevOps Pipelines.

Incident Management & Resolution :


- Manage and lead incident response for Azure-based infrastructure, ensuring quick resolution and root cause analysis.

- Define and continuously improve incident management processes, ensuring minimal downtime and impact on users.

Collaboration & Stakeholder Communication :


- Work closely with engineering, DevOps, and security teams to design and deploy solutions that meet both reliability and security requirements.

- Communicate effectively with stakeholders across the organization, providing visibility into SRE efforts and service health metrics.

Monitoring & Observability :


- Ensure robust monitoring, logging, and alerting are in place to proactively identify issues before they impact customers.

- Lead the adoption and continuous improvement of observability frameworks (i.e., Prometheus, Grafana, Azure Monitor).

Continuous Improvement :


- Drive continuous improvement initiatives, including post-incident reviews, blameless retrospectives, and process optimizations.

- Stay up to date with the latest Azure technologies and industry best practices, integrating new solutions to improve reliability.

Qualifications :


Experience :


- 10+ year in IT/Infrastructure operations with 5+ years of experience in a Site Reliability Engineering (SRE) or DevOps role with significant exposure to Azure cloud environments.

- 5+ years of experience in a leadership or management role, ideally leading an SRE or infrastructure team.

- Proven experience in building and maintaining high-availability, distributed systems on Azure.

- Hands-on experience with Azure services such as Application Gateways, Azure Networking, NSG, Kubernetes (AKS), App Services, and Azure Functions.

Technical Skills :


- Deep knowledge of Azure architecture, services, and infrastructure.

- Expertise in automation tools such as Terraform, Ansible, Azure DevOps, ARM templates, or similar.

- Proficient in scripting languages (i.e., Python, Bash, PowerShell) for automation and orchestration.

- Strong experience with containerization and orchestration tools, particularly Azure Kubernetes Service (AKS).

- Familiarity with monitoring tools such as Azure Monitor, Prometheus, Grafana, or ELK stack.

- In-depth knowledge of CI/CD pipelines and deployment strategies.


Functional Areas: Other

Read full job description

Prepare for Team Lead roles with real interview advice

What people at Cloudologic are saying

What Cloudologic employees are saying about work life

based on 32 employees
76%
100%
50%
100%
Flexible timing
Monday to Friday
No travel
Day Shift
View more insights

Cloudologic Benefits

Free Transport
Child care
Gymnasium
Cafeteria
Work From Home
Free Food +6 more
View more benefits

Compare Cloudologic with

TCS

3.7
Compare

Infosys

3.6
Compare

Wipro

3.7
Compare

HCLTech

3.5
Compare

Tech Mahindra

3.5
Compare

Accenture

3.8
Compare

Capgemini

3.7
Compare

Cognizant

3.7
Compare

IBM

4.0
Compare

LTIMindtree

3.8
Compare

Maxgen Technologies

4.5
Compare

Cyfuture

3.0
Compare

Magic Edtech

3.0
Compare

VDart

4.0
Compare

ANR Software Private Limited

4.4
Compare

Glorious Insight

4.6
Compare

Value Point Systems

3.7
Compare

JoulestoWatts Business Solutions

2.9
Compare

F1 Info Solutions and Services

3.8
Compare

ARMSOFTECH.AIR

3.3
Compare

Similar Jobs for you

Site Reliability Engineer Lead at New Age Consulting

8-15 Yrs

₹ 10-35 LPA

Site Reliability Engineer Lead at McAfee Software (India) Pvt. Ltd

7-13 Yrs

₹ 32-39 LPA

Site Reliability Engineer Lead at Factset

7-10 Yrs

₹ 25-30 LPA

Architect at Infogain

12-14 Yrs

₹ 35-45 LPA

Senior DevOps Software Engineer at Galaxy Web Links Limited

5-8 Yrs

₹ 15-24 LPA

Reliability Lead at Spruce IT Pvt. Ltd.

8-10 Yrs

₹ 20-30 LPA

Azure DevOps Engineer at SRS Solutions

6-13 Yrs

₹ 9-26 LPA

Presales Solution Architect at Microland Ltd

5-10 Yrs

₹ 15-30 LPA

Principal Site Reliability Engineer at Pylon Management Consulting

9-14 Yrs

₹ 27-42 LPA

Lead DevOps Engineer at CDW

8-10 Yrs

₹ 20-25 LPA

Azure Team Lead - Site Reliability (10-13 yrs)

10-13 Yrs

7d ago·via hirist.com

Cloudologic - Manager - Sales (3-6 yrs)

3-6 Yrs

22d ago·via iimjobs.com
write
Share an Interview