Implement and maintain CI/CD pipelines to automate build, test, and deployment processes for our ride-hailing platforms microservices architecture.
Develop and maintain infrastructure as code (IaC) and manage cloud resources on AWS/GCP/Azure.
Monitor system performance and reliability, proactively identify and resolve issues, and implement enhancements to optimise resource utilisation and scalability.
Develop in-house solutions to standardise and ease the management of cloud resources by the product engineering team
Collaborate with cross-functional teams to design and implement robust, scalable, and secure cloud infrastructure solutions that meet the requirements of our ride-hailing platform.
Implement and manage container orchestration using Kubernetes/Google Cloud Run or similar tools to deploy and manage containerized applications at scale.
Setup and manage observability stack (monitoring, alerting) for monitoring infrastructure and applications including health, performance, bottlenecks, etc.
Automate routine operational tasks and administrative processes to improve efficiency and reduce manual intervention.
Implement security best practices and compliance standards to safeguard sensitive data and ensure regulatory compliance.
Participate in on-call rotation and incident response, troubleshoot and resolve production issues promptly to minimise service disruptions.
Stay up-to-date with industry trends, emerging technologies, and best practices in DevOps and cloud computing, and share knowledge with the team.
Qualifications
Bachelors degree in Computer Science, Engineering, or related field.
3-5 years of experience in a DevOps or Site Reliability Engineering (SRE) role, preferably in a cloud-based environment.
Strong proficiency in scripting languages such as Python, Bash, or PowerShell for automation and infrastructure management tasks.
Hands-on experience with cloud platforms such as AWS, GCP, or Azure, including compute, storage, networking, and security services.
Experience with containerization and orchestration technologies such as Docker and Kubernetes.
Familiarity with configuration management tools like Ansible, Chef, or Puppet.
Knowledge of CI/CD tools such as Jenkins, GitLab CI/CD, or CircleCI.
Proficient in version control systems such as Git.
Solid understanding of networking concepts, TCP/IP stack, DNS, and load balancing.
Excellent problem-solving skills, with a strong focus on automation, scalability, and reliability.
Effective communication skills and ability to collaborate with cross-functional teams in a fast-paced environment.
Preferred Qualifications
Certification in cloud platforms (e.g., AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, Microsoft Certified: Azure DevOps Engineer Expert).
Experience with monitoring and observability tools such as Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), or Datadog.
Knowledge of infrastructure security best practices, including identity and access management (IAM), encryption, and network security controls.
Familiarity with agile development methodologies and practices.
Previous experience in the transportation or mobility industry is a plus.