Upload Button Icon Add office photos
filter salaries All Filters

1 Groww Job

Groww - Lead Site Reliability Engineer - Grafana/Prometheus (5-8 yrs)

5-8 years

Groww - Lead Site Reliability Engineer - Grafana/Prometheus (5-8 yrs)

Groww

posted 19d ago

Job Description

About Groww :

We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform.

Each day, we help millions of customers take charge of their financial journey.

Customer obsession is in our DNA :

Every product, every design, every algorithm down to the tiniest detail is executed keeping the customers' needs and convenience in mind.

Our people are our greatest strength :

Everyone at Groww is driven by ownership, customer-centricity, integrity and the passion to constantly challenge the status quo.

Are you as passionate about defying conventions and creating something extraordinary as we are? Let's chat.

Our Vision :

Every individual deserves the knowledge, tools, and confidence to make informed financial decisions.

At Groww, we are making sure every Indian feels empowered to do so through a cutting-edge multi-product platform offering a variety of financial services.

Our long-term vision is to become the trusted financial partner for millions of Indians.

Our Values :

Our culture enables us to be what we are - India's fastest-growing financial services company.

It fosters an environment where collaboration, transparency, and open communication take center-stage and hierarchies fade away.

There is space for every individual to be themselves and feel motivated to bring their best to the table, as well as craft a promising career for themselves.

The values that form our foundation are :

- Radical customer centricity.

- Ownership-driven culture.

- Keeping everything simple.

- Long-term thinking.

- Complete transparency.

Expertise and Qualifications :

We are seeking a highly motivated and experienced Senior Site Reliability Engineer to join our engineering team.

As an SRE, you will be responsible for ensuring the reliability, availability, scalability, and performance of our applications and infrastructure.

You will collaborate closely with software developers, platform engineers, and other team members to design, provision, build, and maintain systems that are scalable, secure, and highly available.

Responsibilities :

- Monitor and troubleshoot issues related to system performance, reliability, and security.

- Define and implement Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets to measure and improve service reliability.

- Analyze and report on metrics and trace data using Grafana, prometheus.

- Participate in an on-call rotation to provide 24/7 support for critical production systems.

- Evaluate and automate manual and repetitive tasks to reduce toil and improve system efficiency.

- Design and manage infrastructure using tools like Terraform, Crossplane, or Kubernetes Composite Resource Definitions (XRDs).

- Implement and manage security measures to protect infrastructure and data.

- Coordinate between developers and operations to ensure smooth software releases and timely resolution of production issues.

- Conduct thorough root cause analysis (RCA) of production incidents and implement preventive measures.

- Review and optimize system performance, identify bottlenecks, and implement capacity planning and recovery strategies.

- Maintain comprehensive documentation of systems, processes, and incident responses.

- Continuously seek and implement improvements to infrastructure, processes, and tools to enhance system reliability and performance.

Requirements :

- 5+ years of relevant work experience.

- Bachelor's or Master's degree in Computer Science or a related field.

- Strong understanding of Linux/Unix systems administration and networking, with troubleshooting skills.

- Must have experience with Kubernetes, Docker, and other containerization technologies.

- Experience with cloud platforms such as GCP, AWS, or Azure is required.

- Strong programming skills in one or more languages such as Go, Python, or Java.

- Experience with monitoring and alerting tools such as Grafana, Prometheus, PagerDuty, or similar technologies is desirable.

- Must have experience with infrastructure provisioning tools such as Terraform, Pulumi, CloudFormation, or similar technologies.

- Strong interpersonal and team collaboration skills.


Functional Areas: Other

Read full job description

Prepare for Site Reliability Engineer Lead roles with real interview advice

People are getting interviews at Groww through

(based on 32 Groww interviews)
Referral
Job Portal
Campus Placement
Company Website
38%
34%
9%
6%
13% candidates got the interview through other sources.
High Confidence
?
High Confidence means the data is based on a large number of responses received from the candidates.

What people at Groww are saying

What Groww employees are saying about work life

based on 205 employees
65%
51%
72%
100%
Flexible timing
Monday to Friday
No travel
Day Shift
View more insights

Groww Benefits

Health Insurance
Work From Home
Team Outings
Job Training
Cafeteria
Soft Skill Training +6 more
View more benefits

Compare Groww with

Zerodha

4.2
Compare

Sharekhan

3.9
Compare

Upstox

3.7
Compare

Paytm Money

3.4
Compare

ET Money

3.9
Compare

Kuvera

3.5
Compare

Scripbox

3.8
Compare

Fisdom

3.7
Compare

Sqrrl Fintech

4.9
Compare

Angel One

3.9
Compare

SBI Cards & Payment Services

3.7
Compare

Axis Direct

3.9
Compare

Kotak Securities

3.7
Compare

FactSet

4.0
Compare

Aadhar Housing Finance

4.1
Compare

Bajaj Capital

3.8
Compare

ICICI Home Finance

3.7
Compare

Ocwen Financial Solutions

4.0
Compare

Synchrony

4.4
Compare

Edelweiss

3.9
Compare

Similar Jobs for you

Reliability Engineering Manager at Traceable

Bangalore / Bengaluru

9-13 Yrs

₹ 20-35 LPA

Site Reliability Engineer at LanceSoft, Inc

Bangalore / Bengaluru

5-10 Yrs

₹ 12-26 LPA

Cloud Deployment Specialist at Vintronics Consulting

Chennai

5-8 Yrs

₹ 15-25 LPA

Monitor at Consultancy

5-10 Yrs

₹ 16-38 LPA

Lead DevOps Engineer at Softility

6-12 Yrs

₹ 15-35 LPA

Site Reliability Engineer 2 at Zinnia

5-8 Yrs

₹ 22-32 LPA

Reliability Engineering Manager at Zscaler

Bangalore / Bengaluru

8-10 Yrs

₹ 24-30 LPA

Cloud Infrastructure Engineer at Web Spiders India Pvt. Ltd.

Remote

5-10 Yrs

₹ 15-24 LPA

Reliability Engineering Manager at Gateway Search

Bangalore / Bengaluru, Pune

8-10 Yrs

₹ 15-20 LPA

Site Reliability Engineer 2 at Bright Money

5-8 Yrs

₹ 15-28 LPA

Groww Bangalore / Bengaluru Office Location

View all
Bengaluru Office
Headquarter
2rd Floor, Padmavathi Complex, 80 Feet Rd, Koramangala Bengaluru
write
Share an Interview