i
Phonon Communications
12 Phonon Communications Jobs
8-10 years
Phonon Communications - Site Reliability Engineering Manager - Cloud Infrastructure (8-10 yrs)
Phonon Communications
posted 17d ago
Fixed timing
Key skills for the job
Job Title : Site Reliability Manager (AWS Cloud).
Company : Phonon Communications Private Limited.
Location : Vadodara, Gujarat, India.
About Us :
Phonon is a dynamic technology company specializing in communication and customer interaction solutions.
We are seeking a Site Reliability Manager (AWS Cloud) to lead our efforts in optimizing cloud infrastructure, ensuring system reliability, and implementing best practices for performance and security.
This leadership role offers the opportunity to shape our cloud infrastructure strategy while working closely with development, operations, and other cross-functional teams to ensure high availability, performance, and security of cloud services.
This role requires a strong background in cloud technologies, Linux systems, and automation, enabling the company to meet its operational goals effectively.
Key Responsibilities :
Cloud Infrastructure Design and Management :
- Design, implement, and maintain highly available and scalable cloud infrastructure using AWS best practices.
- Manage AWS services such as EC2, S3, RDS, Lambda, VPC, CloudFront, CloudTrail, ELB, Application Gateway, SNS, Route 53, and IAM for optimal performance and cost-efficiency.
- Ensure scalable and reliable cloud infrastructure to meet business needs.
- Automation and Infrastructure as Code (IaC).
- Develop and maintain automation scripts and tools using Terraform, CloudFormation, or AWS CLI.
- Automate deployment, monitoring, and management of cloud resources to streamline operations.
- Collaboration and Application Resilience.
- Partner with development teams to design and deploy resilient, fault-tolerant applications on AWS.
- Implement and maintain CI/CD pipelines for seamless application deployment and testing.
Database and Application Management :
- Manage and maintain MongoDB, Metabase, Apache, and ActiveMQ for optimal performance and reliability.
- Integrate and monitor applications within the infrastructure for seamless operations.
System Performance and Optimization :
- Conduct in-depth system performance analysis and implement tuning strategies.
- Leverage Terraform for infrastructure provisioning and automation.
Log Management and Monitoring :
- Implement and oversee log management systems using tools like Grafana Loki.
- Analyze logs and performance metrics to identify and resolve issues proactively.
- Use monitoring tools such as Nagios to maintain system health and uptime.
Pipeline Management and Containerization :
- Implement and maintain CI/CD pipelines for seamless application deployment and testing.
- Manage and optimize pipelines for Kubernetes, containers, and Docker to ensure effective container orchestration and scalability.
Security and Compliance :
- Implement and enforce security best practices for AWS cloud environments, including robust measures such as IAM roles, encryption, and network security groups.
- Collaborate with cross-functional teams to maintain the highest level of data security and compliance.
- Ensure compliance with standards such as ISO27001.
Continuous Improvement and Innovation :
- Evaluate and adopt new technologies to improve cloud infrastructure reliability and efficiency.
- Document system designs, configurations, and procedures for knowledge sharing and troubleshooting.
Candidate Profile :
Education and Certifications :
- Bachelor's degree in computer science, Information Technology, or a related field.
Preferred certifications :
- RHCE (Red Hat Certified Engineer) or RHSA (Red Hat Certified System Administrator).
- AWS Certified Solutions Architect.
Experience :
- Minimum of 8-10 years of experience in AWS cloud infrastructure and Site Reliability roles.
Technical Skills :
- Expertise in AWS services such as EC2, S3, RDS, Lambda, VPC, CloudFront, and IAM.
- Strong knowledge of Linux/Unix system administration and scripting.
- Proficiency in database management tools (MongoDB, Metabase) and web servers (Apache, ActiveMQ).
- Hands-on experience with Terraform, Kubernetes, Docker, and Git.
- Advanced skills in system performance analysis and optimization.
Cloud Security Knowledge :
- Deep understanding of cloud security principles and best practices to safeguard infrastructure and data.
Functional Areas: Other
Read full job descriptionPrepare for Phonon Communications roles with real interview advice
8-10 Yrs