i
Coders Brain
429 Coders Brain Jobs
7-10 years
Site Reliability Engineer - IT Infrastructure (7-10 yrs)
Coders Brain
posted 11hr ago
Flexible timing
Key skills for the job
Job Description :
- Experience with cloud platforms (AWS, Azure, GCP).
- Familiarity with CI/CD pipelines and tools like Jenkins, Git, or similar.
- Exposure to Kubernetes and container orchestration technologies.
- Certifications in Linux administration, SRE, DevOps, or cloud technologies.
- Strong Linux Administration experience, including deep expertise in system configuration, user management, kernel tuning, log analysis, and performance troubleshooting.
- Hands-on experience with Docker and container orchestration (e.g., Kubernetes).
- Solid knowledge of Terraform for infrastructure automation, provisioning, and management.
- Proficiency with monitoring tools like Grafana, Prometheus, and Opsgenie to track performance and uptime.
- Experience in incident management, ensuring that issues are resolved promptly while maintaining SLA, SLO, and SLI metrics.
- Expertise in debugging and resolving complex technical issues in distributed systems, with a focus on minimizing downtime.
- Proven ability to write and maintain runbooks and operational procedures for troubleshooting and system recovery.
- Experience in data center management and ensuring 24/7 availability of production infrastructure.
- Strong understanding of automation tools (e.g., Terraform, Ansible) and continuous improvement practices.
Functional Areas: Software/Testing/Networking
Read full job descriptionPrepare for Site Reliability Engineer roles with real interview advice