i
Coders Brain
353 Coders Brain Jobs
6-10 years
Bangalore / Bengaluru
Lead Platform Engineer - Azure Kubernetes Service (6-10 yrs)
Coders Brain
posted 2mon ago
Flexible timing
Key skills for the job
Role : Lead Platform Engineer (Azure Kubernetes Services)
Seeking a seasoned Lead Platform Engineer to drive the maintenance and expansion of our Azure Kubernetes Services (AKS) environment. This critical role is central to our IT services and cloud platform strategy, overseeing the architecture, design, development, implementation, and on-call production support of our 18-component Kubernetes ecosystem and all associated CI/CD pipeline services.
In this position, you will collaborate closely with digital product, software development, infrastructure, and operations teams to enhance the Developer Experience. You'll lead the utilization of CI/CD tools, including GitHub Actions and Flux CD, and leverage monitoring tools such as Grafana to ensure optimal performance of our applications and API services.
As a thought leader in Kubernetes, you will play a key role in shaping and executing our Kubernetes platform strategy, managing technical debt efficiently, and ensuring a robust, scalable, and secure platform. Additionally, as a member of the Enterprise Architecture team, you will leverage your deep expertise in Application and Cloud Platform Engineering to help drive Camping World's growth and long-term success.
What You'll Do :
- Architect, design, and implement Kubernetes clusters on Azure Kubernetes Service (AKS), ensuring high availability, scalability, and reliability.
- Develop, manage, and support Infrastructure as Code (IaC) components, leveraging Terraform to deploy and maintain primary and supporting infrastructures.
- Design, implement, and maintain CI/CD pipelines for Kubernetes deployments, utilizing GitHub Actions and Flux CD.
- Collaborate with development teams by offering guidance throughout the development and deployment phases, reviewing and modifying code within GitHub repositories to ensure smooth integration and fully automated deployment processes.
- Provide on-call production support, troubleshoot, and resolve complex issues related to AKS and container orchestration, ensuring quick resolution and minimal downtime.
- Optimize cluster performance, scalability, and security to meet evolving requirements and resolve technical challenges.
- Monitor and manage Kubernetes resources using observability tools (Grafana, SolarWinds, Dynatrace, Datadog, New Relic, etc.) to proactively identify and resolve issues.
- Troubleshoot and address malfunctioning or underperforming applications, ensuring root causes are identified and long-term solutions are implemented.
- Serve as a thought leader in Kubernetes, driving the platform strategy, advocating for best practices, and fostering continuous improvement and innovation.
What You'll Need to Have for the Role :
- 5+ years of hands-on experience in designing, managing, and supporting complex, enterprise-grade Microsoft AKS environments.
- Extensive experience with Azure cloud services, including Azure SQL Database, Storage Accounts, and Azure Container Registry.
- Strong understanding and hands-on experience with Terraform for automating infrastructure deployment and management.
- Deep knowledge of containerization technologies (Docker) and orchestration (Kubernetes), including Helm for managing Kubernetes applications.
- Proven experience in designing, implementing, and managing CI/CD pipelines using GitHub Actions and Flux CD.
- Proficient in reading, understanding, and modifying code in GitHub, supporting development teams, and ensuring smooth integration with Kubernetes platforms.
- Expertise in security best practices within Kubernetes environments, ensuring secure and compliant deployments.
- Hands-on experience with monitoring and observability tools, including the Grafana stack (Grafana, Loki, Mimir, Tempo), for creating dashboards and alerts.
- Practical experience with Kuma/Kong Mesh service mesh technologies.
- Hands-on experience managing Kong API gateways.
- Exceptional problem-solving skills and strong communication abilities, capable of leading troubleshooting sessions and guiding cross-functional teams.
- Experience in platform architecture (IaaS, PaaS), site reliability engineering (SRE), quality assurance (QA), system design, integrations, and end-to-end implementation.
- Experience working with Enterprise Architecture (EA) teams, participating in EA processes, and engaging with Architecture Review Boards (ARB), Change Advisory Boards (CAB), and other governance bodies (GRC).
Functional Areas: Other
Read full job descriptionPrepare for Platform Engineer Lead roles with real interview advice
6-10 Yrs
Bangalore / Bengaluru