i
Coders Brain
442 Coders Brain Jobs
8-13 years
Bangalore / Bengaluru
Lead Platform Engineer - Azure Kubernetes Service (8-13 yrs)
Coders Brain
posted 1mon ago
Flexible timing
Key skills for the job
Lead Platform Engineer (Azure Kubernetes Services)
- Seeking a seasoned Lead Platform Engineer to drive the maintenance and expansion of our Azure Kubernetes Services (AKS) environment.
- This critical role is central to our IT services and cloud platform strategy, overseeing the architecture, design, development, implementation, and on-call production support of our 18-component Kubernetes ecosystem and all associated CI/CD pipeline services.
- In this position, you will collaborate closely with digital product, software development, infrastructure, and operations teams to enhance the Developer Experience.
- You'll lead the utilization of CI/CD tools, including GitHub Actions and Flux CD, and leverage monitoring tools such as Grafana to ensure optimal performance of our applications and API services.
- As a thought leader in Kubernetes, you will play a key role in shaping and executing our Kubernetes platform strategy, managing technical debt efficiently, and ensuring a robust, scalable, and secure platform.
- Additionally, as a member of the Enterprise Architecture team, you will leverage your deep expertise in Application and Cloud Platform Engineering to help drive Camping World's growth and long-term success.
What You'll Do :
- Architect, design, and implement Kubernetes clusters on Azure Kubernetes Service (AKS), ensuring high availability, scalability, and reliability.
- Develop, manage, and support Infrastructure as Code (IaC) components, leveraging Terraform to deploy and maintain primary and supporting infrastructures.
- Design, implement, and maintain CI/CD pipelines for Kubernetes deployments, utilizing GitHub Actions and Flux CD.
- Collaborate with development teams by offering guidance throughout the development and deployment phases, reviewing and modifying code within GitHub repositories to ensure smooth integration and fully automated deployment processes.
- Provide on-call production support, troubleshoot, and resolve complex issues related to AKS and container orchestration, ensuring quick resolution and minimal downtime.
- Optimize cluster performance, scalability, and security to meet evolving requirements and resolve technical challenges.
- Monitor and manage Kubernetes resources using observability tools (Grafana, SolarWinds, Dynatrace, Datadog, New Relic, etc.) to proactively identify and resolve issues.
- Troubleshoot and address malfunctioning or underperforming applications, ensuring root causes are identified and long-term solutions are implemented.
- Serve as a thought leader in Kubernetes, driving the platform strategy, advocating for best practices, and fostering continuous improvement and innovation.
What You'll Need to Have for the Role:
- 5+ years of hands-on experience in designing, managing, and supporting complex, enterprise-grade Microsoft AKS environments.
- Extensive experience with Azure cloud services, including Azure SQL Database, Storage Accounts, and Azure Container Registry.
- Strong understanding and hands-on experience with Terraform for automating infrastructure deployment and management.
- Deep knowledge of containerization technologies (Docker) and orchestration (Kubernetes), including Helm for managing Kubernetes applications.
- Proven experience in designing, implementing, and managing CI/CD pipelines using GitHub Actions and Flux CD.
- Proficient in reading, understanding, and modifying code in GitHub, supporting development teams, and ensuring smooth integration with Kubernetes platforms.
- Expertise in security best practices within Kubernetes environments, ensuring secure and compliant deployments.
- Hands-on experience with monitoring and observability tools, including the Grafana stack (Grafana, Loki, Mimir, Tempo), for creating dashboards and alerts.
- Practical experience with Kuma/Kong Mesh service mesh technologies.
- Hands-on experience managing Kong API gateways.
- Exceptional problem-solving skills and strong communication abilities, capable of leading troubleshooting sessions and guiding cross-functional teams.
- Experience in platform architecture (IaaS, PaaS), site reliability engineering (SRE), quality assurance (QA), system design, integrations, and end-to-end implementation.
- Experience working with Enterprise Architecture (EA) teams, participating in EA processes, and engaging with Architecture Review Boards (ARB), Change Advisory Boards (CAB), and other governance bodies (GRC).
Functional Areas: Other
Read full job descriptionPrepare for Platform Engineer Lead roles with real interview advice
8-13 Yrs
Bangalore / Bengaluru
4-8 Yrs