Cloud Observability: Implement and manage advanced cloud observability tools to monitor and optimize system performance, ensuring high availability and reliability for our service offerings.
Container Orchestration: Deploy and maintain Kubernetes (K8s) clusters, specifically using Amazon EKS, to ensure robust container orchestration and streamlined deployments.
Cloud Expertise: Demonstrate proficiency in one of the major cloud platforms (AWS, Azure, or Google Cloud) to architect, manage, and optimize scalable cloud infrastructures.
Operations & Incident Response: Lead the incident response process utilizing tools like PagerDuty or equivalent to ensure swift identification, management, and resolution of critical issues, maintaining service continuity.
FinOps: Analyze and manage cloud costs effectively using FinOps principles, employing various tools and aggregators to ensure cost efficiency and resource optimization.
CICD: Design, implement, and maintain continuous integration and continuous deployment (CICD) pipelines to enhance development workflows and delivery cycles.
Developer Tools: Advance the use of modern developer tools, including AI-powered assistants like GitHub Copilot, to streamline coding practices and enhance overall productivity.
Repository and Source Control Architecture: Architect and manage repository structures and source control processes to support development teams in effectively collaborating and maintaining code quality.
Developer Build Environments: Create and optimize developer build environments to facilitate smooth and efficient development, testing, and deployment processes.
Data Storage Efficiency: Design strategies and systems for efficient data storage, ensuring optimal performance, cost management, and scalability of storage solutions.
Requirements:
Minimum of 5 years of relevant experience in a DevOps role, with a focus on cloud observability, container orchestration, and cloud operations.
Cloud Expertise: In-depth knowledge and hands-on experience with one of the major cloud platforms (AWS, Azure, Google Cloud).
Container Orchestration: Proven experience with Kubernetes (K8s) and Amazon Elastic Kubernetes Service (EKS).
Incident Response: Familiarity with incident management tools like PagerDuty or equivalent, with a track record of handling and resolving critical incidents.
FinOps: Strong understanding of Financial Operations (FinOps) principles and experience with tools and techniques for managing cloud costs.
CICD Pipelines: Demonstrable experience in designing and maintaining CICD pipelines using tools such as Jenkins, GitLab CI, or similar.
Developer Tools: Proficiency with modern developer tools, including AI-powered assistants like GitHub Copilot, and a solid understanding of their impact on the development lifecycle.
Source Control: Expertise in repository management and source control systems, particularly with Git, GitHub, GitLab, or similar platforms.
Build Environments: Experience in creating and optimizing developer build environments, ensuring they are efficient and conducive to high-quality code production.
Data Storage: Proficiency in designing and managing data storage solutions that balance performance with cost-efficiency, considering scalability requirements.
Problem-Solving Skills: Strong analytical and troubleshooting skills to resolve complex technical issues and optimize system performance.
Collaboration: Excellent communication and collaboration skills to work effectively with cross-functional teams, fostering a culture of continuous improvement.
Plus
Certification in relevant cloud platforms (AWS Certified Solutions Architect, Google Cloud Professional Architect, Azure Solutions Architect Expert).
Experience in implementing and managing Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.
Familiarity with security best practices and compliance standards in cloud environments.