Upload Button Icon Add office photos

Okta

Compare button icon Compare button icon Compare
filter salaries All Filters

75 Okta Jobs

Okta - Staff Site Reliability Engineer - AWS Infrastructure (8-10 yrs)

8-10 years

Okta - Staff Site Reliability Engineer - AWS Infrastructure (8-10 yrs)

Okta

posted 1d ago

Job Description

Position Overview :

The Staff Site Reliability Engineer (SRE) will play a key role in building and managing Kubernetes platforms that support cloud-native applications and services.

This position focuses on architecting and managing reliable, scalable, and secure Kubernetes-based platforms on AWS, ensuring high availability and performance while optimizing costs and automation.

The ideal candidate will have hands-on experience with AWS infrastructure, Kubernetes platform creation, Helm charts, Karpenter scaling, and Istio service mesh.

Key Responsibilities :

Kubernetes Platform Creation :


- Design, implement, and maintain highly available, scalable, and fault-tolerant Kubernetes platforms.

- Ensure clusters are optimized for production workloads, providing high resilience and operational efficiency.

AWS Infrastructure Management :


- Build, manage, and optimize AWS cloud infrastructure, including EKS,ECS, S3, VPCs, RDS, IAM, and more.

- Implement best practices for cost management, scaling, and security within AWS.

Helm Management :


- Utilize Helm to automate and streamline the deployment of applications and services to Kubernetes clusters.

- Create, maintain, and manage Helm charts for production-ready deployments.

Karpenter Implementation :


- Implement and manage Karpenter to dynamically scale Kubernetes clusters in response to workload demands.

Istio Service Mesh Management :


- Configure and manage Istio to provide service-to-service communication, security, and observability within the Kubernetes clusters.

- Enable fine-grained traffic management, service discovery, and policy enforcement.

Platform Automation & Scaling :


- Automate the deployment, scaling, and management of infrastructure and applications.

- Work with CI/CD pipelines to ensure a seamless flow from development to production with minimal downtime.

Incident Management & Troubleshooting :


- Respond to incidents, troubleshoot, and resolve system issues related to performance, availability, and security in a timely and effective manner.

Security & Compliance :


- Design and implement secure cloud infrastructure with appropriate access controls, network security, and compliance frameworks.

Required Qualifications :

- 5+ years of experience with Kubernetes/ K8s, Helm,Karpenter,Istio;

- 8+ years of Experience with infrastructure-as-code tools like Terraform, Chef or Ansible

- 8+ years of Experience with serverless computing (AWS Lambda, API Gateway) and microservices architecture.

- Proven experience with AWS (EKS, ECS, RDS, S3, CloudFormation, IAM, etc.) and solid understanding of cloud-native architectures.

- Strong expertise in Kubernetes platform creation, management, and optimization (e., setting up highly available clusters, networking, and storage).

- Hands-on experience with Helm for Kubernetes application deployment and management.

- Practical experience with Karpenter for dynamic scaling of Kubernetes clusters and optimizing resource usage.

- Expertise in managing and securing Istio for service mesh, including traffic management, security, and observability features.

- Proficiency in CI/CD pipelines and automation tools (e., Jenkins, GitLab, CircleCI, Terraform, Spinnaker, Ansible).

- Strong scripting and automation skills in Python or Go for infrastructure management and platform automation.

- Experience with monitoring, logging, and alerting tools such as Prometheus, Grafana, CloudWatch, and ELK Stack.

Preferred Qualifications :

- Experience with multi-region cloud environments.

- Understanding of security best practices for cloud platforms and Kubernetes (e., role-based access control (RBAC), encryption, and compliance frameworks).

- Familiarity with Docker and containerization principles.

- Bachelor's degree in Computer Science, Engineering, or related field (or equivalent professional experience).

Certifications (Preferred) : CKA (Certified Kubernetes Administrator), CKAD (Certified Kubernetes Application Developer), or AWS Certified DevOps Engineer are highly desirable


Functional Areas: Software/Testing/Networking

Read full job description

Okta Interview Questions & Tips

Prepare for Okta Site Reliability Engineer roles with real interview advice

Top Okta Site Reliability Engineer Interview Questions

Q1. Difference b/w freetyle and normal pipeline . How to check if build is successful.
Q2. How to manage terraform state file in common place so if one making changes other get modified file.
Q3. Given 1000 servers and continue running steam check the stream is strictly increasing order. Divide those streams in 100 servers and process ... read more
View all 12 questions

What people at Okta are saying

What Okta employees are saying about work life

based on 12 employees
56%
87%
89%
60%
Strict timing
Monday to Friday
No travel
Day Shift
View more insights

Okta Benefits

Free Transport
Child care
Gymnasium
Cafeteria
Work From Home
Free Food +6 more
View more benefits

Compare Okta with

Google

4.4
Compare

Zscaler Softech

3.6
Compare

Palo Alto Networks

3.9
Compare

Gen

4.0
Compare

CyberArk

3.8
Compare

Proofpoint

4.1
Compare

CrowdStrike

4.1
Compare

FireEye

4.3
Compare

Splunk

4.4
Compare

Trend Micro

4.3
Compare

Qualys

3.8
Compare

MagicPin

3.0
Compare

HealthKart

4.0
Compare

Awign Enterprises

4.0
Compare

Nestaway

3.9
Compare

Shaadi.com

3.3
Compare

Way.com

4.6
Compare

Flyhomes

4.3
Compare

Simplimadly

Compare

Ketto

3.9
Compare

Similar Jobs for you

Site Reliability Engineer at Grizmo Labs Private Limited

5-7 Yrs

₹ 15-20 LPA

Senior Site Reliability Engineer at NetAnalytiks Technologies

6-10 Yrs

₹ 20-30 LPA

Site Reliability Engineer at Flipped.ai

5-8 Yrs

₹ 14-25 LPA

Site Reliability Engineer at Whitefield Careers

7-10 Yrs

₹ 20-25 LPA

Site Reliability Engineer at IT Firm

5-8 Yrs

₹ 28-45 LPA

Site Reliability Engineer at Fixity Technologies

6-9 Yrs

₹ 18-20 LPA

Site Reliability Engineer at Shadow Placements

6-10 Yrs

₹ 15-32 LPA

Site Reliability Engineer at MRCC

7-15 Yrs

₹ 15-28 LPA

Site Reliability Engineer at Burgeon It Services Pvt Ltd

8-10 Yrs

₹ 15-25 LPA

Site Reliability Engineer at Coders Brain Technology Private Limited

10-15 Yrs

₹ 15-30 LPA

Enterprise Regional Sales Manager, Auth0

5-12 Yrs

New Delhi

1d ago·via naukri.com

Application Analyst

3-5 Yrs

Bangalore / Bengaluru

2d ago·via naukri.com

Senior Site Reliability Engineer

3-6 Yrs

Bangalore / Bengaluru

2d ago·via naukri.com

Senior Application Analyst

3-7 Yrs

Bangalore / Bengaluru

2d ago·via naukri.com

Data Quality Analyst II - Salesforce, ETL

3-7 Yrs

Bangalore / Bengaluru

2d ago·via naukri.com

Senior RPA Developer - Automation Anywhere

5-8 Yrs

Bangalore / Bengaluru

2d ago·via naukri.com

Senior Revenue Operations Specialist

5-9 Yrs

Bangalore / Bengaluru

2d ago·via naukri.com

Senior DevSecOps - AWS - SRE

5-10 Yrs

Bangalore / Bengaluru

2d ago·via naukri.com

Senior Lab Infrastructure Specialist - DevOps, Cloud

5-10 Yrs

Bangalore / Bengaluru

2d ago·via naukri.com
write
Share an Interview