Experience with cloud platforms, containerization, and CI/CD tools. Strong skills in automation and monitoring tools (Prometheus, Grafana) Build and manage scalable cloud platforms (AWS, Azure, or GCP). Develop CI/CD pipelines and use Infrastructure as Code (Terraform, Ansible). Work with Docker/Kubernetes to deploy and manage microservices. Monitor performance, troubleshoot, and improve platform efficiency.
5+ years in platform engineering or DevOps : Candidates should demonstrate hands-on experience managing cloud infrastructure, optimizing deployments, and ensuring platform reliability. A proven track record in automating processes, scaling environments, and improving platform performance is key.
Cloud Platforms : Expertise in one or more cloud environments AWS, Azure, or GCP. Candidates should be comfortable deploying, maintaining, and scaling cloud infrastructure.
Containerization and CI/CD : Experience with Docker and Kubernetes for containerization and orchestration of microservices. Additionally, candidates should have experience in building and maintaining Continuous Integration and Continuous Deployment (CI/CD) pipelines to ensure automated, streamlined software delivery.
Automation Monitoring Tools : Strong skills in automation using tools such as Terraform and Ansible for Infrastructure as Code (IaC). Experience in setting up and managing monitoring solutions using tools like Prometheus and Grafana to track system health, performance metrics, and to enable proactive troubleshooting.
CI/CD Pipelines and Infrastructure as Code (IaC) : Proficiency in setting up CI/CD pipelines and utilizing tools like Terraform or Ansible for IaC. The ability to automate deployments and infrastructure provisioning across environments is essential.
Docker/Kubernetes : Experience in deploying and managing applications using Docker for containerization, and Kubernetes for orchestration, ensuring scalable and highly available services.
Performance Monitoring and Troubleshooting : Skills in monitoring platform performance, identifying bottlenecks, and improving efficiency through tools and techniques for proactive troubleshooting and resource optimization.