i
RARR Technologies
354 RARR Technologies Jobs
5-8 years
Kolkata, Mumbai, New Delhi + 4 more
1 vacancy
Sr. Site Reliability Engineer (AWS)
RARR Technologies
posted 2hr ago
Flexible timing
Key skills for the job
As a Senior Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and performance of our cloud infrastructure on AWS. You will collaborate with cross-functional teams to design, implement, and manage systems and processes that enable continuous availability and seamless operation of our applications and services. The ideal candidate will have extensive experience in AWS cloud technologies, strong problem-solving skills, and a passion for building resilient and efficient systems.
Responsibilities:
- Design, implement, and maintain highly available and scalable cloud infrastructure on AWS platform.
- Develop and implement automated monitoring, alerting, and incident response mechanisms to ensure proactive identification and resolution of system issues.
- Collaborate with software engineering teams to establish Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure system reliability and performance.
- Integrate security practices into the DevOps pipeline, ensuring the implementation of security controls at every stage of the software development lifecycle.
- Architect, deploy, and manage cloud infrastructure at scale, with a focus on security best practices and compliance requirements.
- Monitor security alerts and incidents and respond promptly to security breaches and incidents.
- Conduct regular performance analysis, capacity planning to anticipate and address scaling requirements.
- Implement and maintain disaster recovery and failover strategies to mitigate service disruptions and ensure business continuity.
- Lead incident response and post-mortem analysis to identify root causes and implement preventive measures.
- Continuously improve system reliability through automation, optimization, and implementation of best practices.
- Stay updated with the latest AWS services and technologies and evaluate their applicability to enhance our infrastructure and operations.
- Mentor junior team members and foster a culture of collaboration, learning, and continuous improvement
Qualifications:
- Bachelor s degree in computer science, Engineering, or related field. Master s degree preferred.
- AWS Certified Solutions Architect - Professional or AWS Certified DevOps Engineer - Professional certification is required.
- 6 - 7 years of experience in Site Reliability Engineering, DevOps, or related roles, with a focus on AWS cloud technologies.
- Strong understanding of cloud architecture principles and experience with AWS services such as EC2, S3, RDS, Lambda, DynamoDB, etc.
- Proficiency in scripting and automation using languages such as Python, Bash, or PowerShell.
- Experience with infrastructure as code (IaC) tools such as Terraform or CloudFormation for provisioning and configuration management.
- Hands-on experience with monitoring, logging, and observability tools such as CloudWatch, Prometheus, Grafana, ELK stack, etc.
- Solid understanding of CI/CD principles and experience with related tools like Jenkins, GitLab CI/CD, or AWS Code Pipeline.
- Excellent problem-solving skills and the ability to troubleshoot complex issues in distributed systems.
- Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams and influence stakeholders at all levels.
Service Level Indicators (Slis), Aws, Service Level Objectives (Slo), Site Reliability Engineer
Employment Type: Full Time, Permanent
Read full job description5-8 Yrs
Kolkata, Mumbai, New Delhi +4 more
15-18 Yrs
₹ 30 - 40L/yr
Hyderabad / Secunderabad, Pune, Chennai