Job Title - Site Reliability Engineer + Specialist + Global Song
Management Level :9,Specialist
Location:Kochi
Must have skills:Python, Go, or Java
Good to have skills:Expertise with cloud platforms (AWS, Azure, GCP) and tools.
Job Summary:As a Site Reliability Engineer (SRE), you'll bring together your software engineering expertise and systems knowledge to ensure our systems are scalable, reliable, and efficient. You'll be instrumental in automating operations, solving complex infrastructure challenges, and driving continuous improvement to deliver seamless and resilient services.
Your responsibilities will include:
Design, build, and maintain scalable infrastructure and systems.
Automate operational tasks to improve efficiency and reliability.
Implement application monitoring and continuous improvement of application performance and stability.
Develop and implement disaster recovery and incident management strategies.
Collaborate with developers to improve application architecture and deployment.
Optimize system availability, latency, and performance metrics.
Manage CI/CD pipelines for seamless software delivery.
Perform root cause analysis and lead detailed post-mortems.
Consult with software development teams to implement reliability best practices.
Write and maintain infrastructure and operational documentation.
Operational responsibility of a number of distributed applications. Including on-call shifts.
Roles & Responsibilities:Strong experience in software engineering and systems architecture.
Multiple years of experience programming in languages such as Python, Go, or Java.
Expertise with cloud platforms (AWS, Azure, GCP) and tools.
Hands-on experience with infrastructure as code (Terraform, Ansible, etc.).
Familiarity with Linux/Unix systems and networking fundamentals.
Familiarity with containerization and orchestration tools like Docker and Kubernetes.
Proven ability to monitor, debug, and optimize distributed systems.
Experience managing CI/CD pipelines and automation frameworks.
Strong problem-solving skills and attention to detail.
Excellent communication and collaboration skills for cross-functional teamwork.
Ability to analyze and improve complex systems for reliability and scalability.
Self-motivated with a passion for continuous learning and improvement.
Professional & Technical Skills:Additional Information:
(do not remove the hyperlink)
Qualifications
Experience:Minimum
7-10 year(s) of experience is required
Educational Qualification:Any graduation / BE / B Tech