Your role is crucial in pushing our technology forward to optimize functionality, performance, reliability, and scalability. You will gain an in-depth understanding of our products and services to drive next-generation cloud platform development. This involves estimating engineering efforts, prioritizing projects, planning implementations, and triaging production issues.
Key Responsibilities
Platform Development with Best-in-Class Technologies You will build and refine our cloud platform using cutting-edge tools like Kubernetes for container orchestration, Ansible and Terraform for infrastructure management, and distributed systems that ensure scalability and resilience. With a focus on Container-as-a-Service (CaaS), you will help empower IT-secured, developer-friendly environments that streamline application building, deployment, and management, both on-premises and in cloud settings.
Technical Leadership in Design and Decision-Making Your role involves analyzing diverse use cases and guiding the team in making informed design and technical choices that align with business objectives. Youll work closely with teams, ensuring all implementations support functionality, scalability, and security.
Design and deploy scalable, highly available, and fault-tolerant systems on cloud platforms (e.g., AWS, Azure, Google Cloud).
Implement and manage cloud services, including storage, compute, and security services.
Monitor and optimize cloud infrastructure performance.
Collaborate with development teams to integrate cloud solutions into existing workflows.
Ensure compliance with security policies and best practices.
Troubleshoot and resolve issues related to cloud infrastructure and services.
Your day to day
Cluster Migration & Automation Migration of legacy systems from Mesos to Kubernetes, onboarding multiple availability zones in both public and private clouds. Developed automated reporting and monitoring solutions, enhancing cluster hygiene and reducing maintenance overhead.
Improved System Reliability Develop and implemented SLAs for high-availability environments, meeting "dark green day" goals through consistent monitoring and proactive issue resolution.
Optimized Infrastructure Build scalable and cost-effective cloud environments that meet business needs, reducing platform-related incidents and optimizing resource utilization through IaC and performance tuning.
Collaboration & Communication Collaborate closely with cross-functional teams, including developers, QA, and product owners, to align infrastructure requirements with business objectives and accelerate delivery timelines.
What do you need to bring-
Proficiency in cloud services (AWS, Azure, Google Cloud).
Experience with infrastructure as code tools (e.g., Terraform, CloudFormation).
Strong scripting skills (e.g., Python, Bash).
Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
Understanding of networking concepts and security best practices.
Excellent problem-solving and analytical skills.
Hands on experience with Ansible and Puppet
Bachelor s degree in computer science, Information Technology, or related field.
Relevant certifications (e.g., AWS Certified Solutions Architect, Azure Administrator Associate) are a plus.
4+ years of experience in cloud engineering or a related field