i
Grizmo Labs
14 Grizmo Labs Jobs
Site Reliability Engineer - AWS Infrastructure (5-7 yrs)
Grizmo Labs
posted 4d ago
Fixed timing
Key skills for the job
Key Responsibilities :
- Design, build, and maintain highly available, scalable, and resilient AWS infrastructure.
- Automate infrastructure provisioning and management using tools like Terraform, CloudFormation, and Ansible.
- Implement and maintain robust monitoring and alerting systems using tools like Prometheus, Grafana, CloudWatch, and Datadog.
- Respond to and resolve production incidents effectively and efficiently.
- Conduct root cause analysis of incidents and implement preventative measures.
- Develop and implement automation scripts for various infrastructure tasks.
- Participate in capacity planning and performance tuning.
- Collaborate with development teams to improve the reliability and performance of applications.
- Stay up-to-date with the latest AWS technologies and best practices.
- Contribute to the development and improvement of SRE best practices and processes.
Required Skills :
- 5+ years of experience in managing and maintaining AWS infrastructure.
- Strong experience with core AWS services such as EC2, VPC, ECS/EKS, S3, RDS, Lambda, and IAM.
- Proficiency in scripting languages like Python or Go.
- Experience with configuration management tools like Ansible, Puppet, or Chef.
- Experience with containerization technologies like Docker and Kubernetes.
- Experience with monitoring and alerting tools like Prometheus, Grafana, CloudWatch, and Datadog.
- Strong understanding of networking concepts (TCP/IP, routing, subnetting).
- Excellent problem-solving, analytical, and troubleshooting skills.
- Strong communication and collaboration skills.
Desired Skills :
- Experience with DevOps practices and tools (CI/CD pipelines).
- Experience with security best practices and tools.
- Experience with serverless computing.
- Experience with container orchestration platforms like Kubernetes.
- AWS certifications (AWS Certified Solutions Architect, AWS Certified DevOps Engineer)
Functional Areas: Software/Testing/Networking
Read full job descriptionPrepare for Site Reliability Engineer roles with real interview advice
15-20 Yrs
Bangalore / Bengaluru