7 New Age Management Consulting Jobs
Lead Site Reliability Engineer - Docker/Kubernetes (8-15 yrs)
New Age Management Consulting
posted 18hr ago
Key skills for the job
Key skills :
- SRE AWS- AWS Cloud, Kubernetes Expert, Docker, Terraform, Ansible, Azure
- SRE GCP - Docker, GCP Cloud, More into cloud, Azure, Devops engineer
Job Responsibilities(JR) :
- 6 - 8 Areas Actionable (4-6)
- Help build a Site Reliability Engineering culture by sharing the best practices, approaches, documentation, and code with other engineering teams
- Apply automation and software to any tasks or parts of the system which are performed manually
- Able to troubleshoot complicated, cross platform issues handling OS, Networking, Database in a cloud-based SaaS environment and handle live production incidents
- Monitor application performance take steps to improve overall application performance and stability and follow through with implementation
- Conduct system analysis, configuration management and develops improvements for system software performance, availability and reliability
- Design, write, ship, and motivate the creation of software and systems to increase observability, product reliability and organizational efficiency
- Maintain and monitoring deployment, orchestration, of the servers, docker containers, databases, and general backend infrastructure
- Develop Run Books/Standard Operating Procedure for recurring Production issues, also working on a permanent solve.
- Perform Incident Analysis on a regular basis with the intention of preventing and finding a long term solve for Incidents.
Educational Qualifications (examples listed below) :
Key Skills(examples listed below) :
Total Yrs of experience : 8-10
Educ :
- B Tech in Computer Science or related
Skills :
- Experience in monitoring and analyzing infrastructure performance using standard performance monitoring tools
- Demonstrable experience in Containerization-Docker and orchestration (Kubernetes) discipline preferred.
- Experience with Infrastructure As Code (Terraform, Cloud Formation, Ansible)
- Knowledge and proven hands-on experience in large-scale databases and
distributed technologies, such as Kafka and Confluent Platform Kafka
- Basic programming and scripting skills
- Major Stakeholders(intra team and cross functional stakeholders, who would need to be interacted with for discharging duties) (examples listed below)
Functional Areas: Other
Read full job descriptionPrepare for Site Reliability Engineer Lead roles with real interview advice