29 Vinz Global Jobs
11-13 years
Site Reliability Engineer - Google Cloud Platform (11-13 yrs)
Vinz Global
posted 6d ago
Flexible timing
Key skills for the job
We are seeking a highly experienced Senior Cloud Engineer or Site Reliability Engineer (SRE) with over 11 years of IT expertise.
This role requires advanced knowledge in Kubernetes, cloud platforms, CI/CD, and automation tools.
You will play a critical role in designing, deploying, and maintaining cloud-based infrastructure and applications to ensure reliability, scalability, and performance.
Roles and Responsibilities :
- Design and maintain Kubernetes (GKE, OpenShift) clusters, including upgrades, migrations, and microservices deployment.
- Develop and maintain Infrastructure as Code (IaC) using Terraform, Helm Charts, and Google Deployment Manager.
- Automate integrations and processes across environments using Ansible, Jenkins pipelines, and Groovy scripts.
- Build and containerize applications using Docker for Java Spring Boot and Node.js microservices.
- Implement service mesh solutions like Istio and configure Ingress controllers for Kubernetes.
- Collaborate on cloud infrastructure design and manage GCP components, including IAM, Pub/Sub, BigQuery, VPC, and Cloud DNS.
- Integrate HashiCorp Vault with Kubernetes/OpenShift for secure secret management.
- Create and manage CI/CD pipelines with Jenkins and GitLab, leveraging shared libraries for efficient automation.
- Conduct performance tuning, autoscaling, and disaster recovery planning for cloud-based applications.
- Develop custom images using Packer for tools like Jenkins, MySQL, and BI Connector.
- Ensure robust monitoring and logging using tools like Kibana, Grafana, Splunk, and AppDynamics.
- Perform troubleshooting, debugging, and root cause analysis for infrastructure and application issues.
- Migrate legacy on-prem applications to modern cloud-native architectures on GKE and OpenShift.
- Guide and mentor team members on best practices in DevOps, cloud engineering, and automation.
Skills and Qualifications Required :
- 11+ years of IT experience, including roles in SRE, Cloud Engineering, System Administration, and Software Development.
- Certified Kubernetes Administrator (CKA) with expertise in Kubernetes and OpenShift.
- Proficiency in IaC tools such as Terraform, Helm Charts, and Google Deployment Manager.
- Advanced scripting skills in Python, Shell, and Groovy for automation and maintenance tasks.
- Experience in service mesh technologies like Istio and secure integration with HashiCorp Vault.
- Strong knowledge of GCP services, including GCE, GKE, Pub/Sub, BigQuery, and VPC configurations.
- Hands-on experience with CI/CD tools like Jenkins and GitLab, and configuration management tools like Ansible, Puppet, and Chef.
- Familiarity with logging and monitoring tools such as Kibana, Grafana, Prometheus, Splunk, and AppDynamics.
- Proficient in database technologies like MongoDB, MySQL, ProxySQL, and BI Connector.
- Strong problem-solving skills and expertise in performance tuning and disaster recovery planning.
- Knowledge of agile and waterfall methodologies, with a focus on collaboration and team guidance.
- Excellent communication and organizational skills, with the ability to manage multiple complex tasks.
Technical Skills :
- Cloud Platforms : Google Cloud Platform (GCP)
- Containerization and CD : Docker, Kubernetes (GKE), OpenShift
- IaC Tools : Terraform, Google Deployment Manager
- Service Mesh : Istio
- SCM and Artifact Tools : Git, Nexus
- Logging & Monitoring : Kibana, Grafana, Splunk, Prometheus, Datadog
- Languages : C, Java, Groovy, Python, Shell scripting
- Configuration Management : Puppet, Ansible, Chef
Functional Areas: Software/Testing/Networking
Read full job descriptionPrepare for Site Reliability Engineer roles with real interview advice