18 Patch Infotech Jobs
4-9 years
Kubernetes Operator - Site Reliability Engineering (4-9 yrs)
Patch Infotech
posted 11hr ago
Flexible timing
Key skills for the job
Job Description :
We are looking for an experienced Kubernetes Operator to join our team. The ideal candidate will have expertise in Kubernetes operator-based applications, deep knowledge of CRD-based deployments, and the ability to optimize and troubleshoot complex cloud-native environments.
The SRE will be responsible for ensuring high availability, performance, and scalability of ouri nfrastructure while working closely with development and operations teams.
Roles and Responsibilities :
- Kubernetes Operator Expertise - Deploy, manage, and maintain Kubernetes operator-based applications in cloud and on-prem environments.
- CRD-Based Deployments - Implement and troubleshoot Custom Resource Definition (CRD)-based deployments to enhance automation and operational efficiency.
- Region Awareness & Pod Topology Spread Constraints - Configure Kubernetes workloads with pod topology spread constraintsto achieve high availability and fault tolerance across multiple regions.
- Node Affinity & Scheduling Policies - Apply node selector and affinity rules to optimize pod scheduling and resource allocation across nodes.
- Cluster Deployment & Upgrades - Troubleshoot and optimize cluster deployments, operator installations, and rolling updatesto ensure smooth and reliable system upgrades.
- Incident Management & Troubleshooting - Diagnose and resolve infrastructure and application issues by analyzing logs, metrics, and alerts.
- Customer Support & Ticket Handling - Work on customer tickets, provide effective solutions, and collaborate with development teams to resolve issues efficiently.
- Application Monitoring & Optimization -Utilize monitoring tools to analyse application performance and implement improvements.
- Documentation & Knowledge Sharing - Create and maintain technical documentation, troubleshooting guides, and best practicesfor internal teams and customers.
- Automation & CI/CD Integration - Improve deployment efficiency by implementing automation, Infrastructure as Code (IaC), and CI/CD pipelines using tools.
Requirements :
Must Have Skills :
Education : B.Tech in computer engineering, Information Technology, or related field.
- Experience : 5+ years of Experience with Kubernetes Operator Expertise.
- Having in depth knowledge on deploy, manage, maintain and pod topology.
- CRD-Based Deployments : 3+ Years of in-depth experience to implement and trouble shoot CRD.
- Application Monitoring & Optimization : 3+ Years of experience in using tools such as Grafana, Prometheus
- Terraform or Helm : 2+ years of experience in using terraform or Helm for infrastructure Automation & CI/CD Integration.
- Bash, Python, or Golang : 2+ years of experience and in depth understanding of scripting tools
Functional Areas: Software/Testing/Networking
Read full job descriptionPrepare for Site Reliability Engineer roles with real interview advice