4 IOWeb3 Jobs
Senior Site Reliability Engineer - Cloud Infrastructure (6-8 yrs)
IOWeb3
posted 10hr ago
Key skills for the job
Job Summary :
We are seeking a highly experienced Senior Site Reliability Engineer (SRE) to join our team in Bengaluru.
In this role, you will be responsible for designing, developing, and deploying scalable solutions using cloud, containerization, and microservices technologies.
You will collaborate with cross-functional teams to ensure the reliability, security, and performance of our infrastructure and applications.
The ideal candidate will have 6+ years of experience in SRE, a strong understanding of Go or Python, and expertise in cloud and container technologies.
Key Responsibilities :
Scalable Solution Design and Development :
- Design and develop scalable, modular solutions for diverse product suites.
- Establish and implement best practices in modern software architecture, including Microservices, Serverless, and API-first approaches.
Cloud Infrastructure Management :
- Manage and optimize cloud infrastructure on AWS, Azure, or GCP.
- Ensure robust, secure, and compliant cloud environments.
Containerization and Orchestration :
- Drive containerization strategies using Docker and Kubernetes.
- Implement and manage Kubernetes clusters for application deployment and scaling.
CI/CD Pipeline Management :
- Design, implement, and maintain CI/CD pipelines using tools such as GitHub Actions, ArgoCD, Harness.io, and GitLab CI.
- Automate deployment processes to ensure fast and reliable releases.
Reliability and Performance Monitoring :
- Implement monitoring and alerting systems to ensure high availability and performance of applications.
- Troubleshoot and resolve complex technical issues.
Collaboration and Communication :
- Collaborate with development, QA, and operations teams to integrate user feedback and improve processes.
- Communicate effectively with stakeholders and provide technical guidance.
- Proactively find areas of improvement.
Must Have Skills and Experience :
- 6+ years of experience in Site Reliability Engineering (SRE).
- Strong understanding of Go or Python programming languages.
- Expertise in cloud services (AWS, Azure, GCP).
- Proficiency in container technologies (Docker, Kubernetes).
- Experience with CI/CD tools (GitHub Actions, ArgoCD, Harness.io, GitLab CI).
- Strong problem-solving and organizational skills.
- Excellent communication and collaboration abilities.
Nice to Have Skills and Experience :
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
Certifications : CKAD, CKS, and/or CKA (highly preferred).
Technical Skills Required :
- Site Reliability Engineering : 6+ years experience
- Programming Languages : Go, Python
- Cloud Platforms : AWS, Azure, GCP
- Container Technologies : Docker, Kubernetes
- CI/CD Tools : GitHub Actions, ArgoCD, Harness.io, GitLab CI
- Certifications (Preferred) : CKAD, CKS, CKA
Benefits :
- Opportunity to work with cutting-edge technologies.
- Hybrid work environment.
- Competitive contract rate.
- Extendable contract
Functional Areas: Software/Testing/Networking
Read full job description