i
Uplers
103 Uplers Jobs
Senior Site Reliability Engineer
Uplers
posted 20hr ago
Flexible timing
Key skills for the job
Senior Site Reliability Engineer
Experience: 4+ years
Salary : Competitive
Preferred Notice Period: Within 15 Days
Shift: 3:00AM to 12:00PM IST
Opportunity Type: Remote
Placement Type: Contractual
Contract Duration: Full-Time, Indefinite Period
(*Note: This is a requirement for one of Uplers' Partners)
What do you need for this opportunity
Must have skills required :
Python, SQL, Apache Spark, BigQuery
Good to have skills :
NoSql, Avro, Cloud Server (Google / AWS), Parquet
Our Hiring Partner is Looking for:
Senior Site Reliability Engineer who is passionate about their work, eager to learn and grow, and who is committed to delivering exceptional results. If you are a team player, with a positive attitude and a desire to make a difference, then we want to hear from you.
Role Overview Description
Senior Site Reliability Engineer
The Senior Site Reliability Engineer (SRE) plays a critical role in bridging the gap between software development and operations, ensuring systems are scalable, reliable, and efficient. This role focuses on reducing manual work through automation, improving system resilience, and delivering seamless services to our customers. Monitoring, security, and compliance are key aspects of this role, as you will proactively address potential risks while ensuring the systems meet the required operational and regulatory standards. As an offshore SRE, you will collaborate across global teams to optimize operations, implement effective monitoring solutions, and drive innovation in reliability engineering. The role will also include supporting cloud environments, primarily on AWS, and managing Apigee API Gateway on Google Cloud Offshore
Responsibilities:
â Ensure Reliability and Performance: Maintain high availability and performance of production systems and applications to meet SLA commitments.
â Proactive Monitoring: Develop, implement, and enhance monitoring solutions, dashboards, and alerts to detect and address issues before they impact users.
â Automate Operational Tasks: Design and maintain tools and scripts to automate repetitive tasks, deployments, and incident response processes.
â Improve System Resilience: Collaborate with cross-functional teams to identify and resolve potential bottlenecks and single points of failure.
â Cloud and API Management: Support and optimize cloud infrastructure on AWS, including EC2, S3, RDS, Lambda, and networking. Manage and enhance Apigee API Gateway for seamless API performance and integration.
â Support PHP, Node.js, Serverless Framework Applications: Monitor, troubleshoot, and optimize PHP, Node.js, Serverless
â Framework-based applications for reliability and scalability.
â Linux Systems Administration: Provide high-level Linux support, including advanced troubleshooting, performance tuning, and system optimization for reliability and scalability.
â Security Implementation: Collaborate with security teams to ensure applications and systems are configured to meet security and compliance requirements.
â Compliance Adherence: Monitor, document, and enforce compliance with industry standards, regulations, and company policies, such as ISO 27001, SOC 2, or GDPR.
â Promote Infrastructure Scalability: Use Infrastructure as Code (IaC) tools like Terraform or CloudFormation to manage, scale, and improve infrastructure.
â Collaborate for Continuous Improvement: Partner with development, operations, and security teams to embed SRE best practices and enhance operational efficiency.
â Optimize Performance: Analyze system performance metrics to implement tuning measures for improved application and database efficiency.
â Plan for Capacity Growth: Monitor infrastructure trends and forecast requirements to ensure systems scale with business growth.
â Document Processes: Maintain clear, up-to-date documentation for systems, tools, and procedures to facilitate knowledge sharing and team alignment.
â System Uptime: Achieve and maintain 99.9% uptime for critical applications and infrastructure. Incident Resolution: Ensure incidents are resolved within defined SLAs, with clear root cause analysis and follow-up actions.
â Automation Coverage: Increase automation of operational tasks by at least 30% annually, reducing manual intervention.
â Monitoring Efficiency: Implement proactive monitoring tools with minimal false positives, ensuring issues are flagged before customer impact.
â Application Support: Provide consistent, high-quality support for PHP and Node.js applications, reducing downtime and performance issues.
â Linux Optimization: Ensure Linux systems operate at peak efficiency, with documented performance tuning and troubleshooting standards.
â Security Compliance: Ensure all systems and processes adhere to defined security standards and compliance requirements.
Key Result Areas:
â Infrastructure Scalability: Support seamless scaling of infrastructure to meet 100% of projected growth requirements without major disruptions.
â Collaboration Impact: Actively contribute to cross-team initiatives, enhancing overall reliability and operational efficiency.
â Documentation Quality: Maintain 100% up-to-date and accurate system documentation to support operational excellence and knowledge sharing.
Skills & Qualifications:
â Education: Bachelor''s degree in Computer Science, IT, or a related field (or equivalent experience).
Experience:
â 3+ years in an SRE, DevOps, or equivalent role.
â Hands-on experience with AWS services (e.g., EC2, S3, RDS, Lambda, CloudWatch).
â Experience managing and optimizing APIs using Apigee and AWS API Gateway.
â Experience supporting PHP, Node.js and Serverless
â Framework-based applications in production environments.
Technical Skills:
â High proficiency in Linux systems administration, including troubleshooting, performance tuning, and system optimization.
â Proficiency in scripting/programming (e.g., Python, Bash).
â Strong expertise in monitoring tools (e.g., Cloudwatch, New Relic, Prometheus, Grafana).
â Knowledge of security and compliance frameworks (e.g., ISO 27001, SOC 2, GDPR).
â Experience with CI/CD pipelines and tools and gitflow strategy (e.g., AWS Codebuild, AWS Codepipeline, GitLab).
â Understanding of containerization and orchestration (e.g., Docker, Kubernetes).
â Familiarity with networking, security, and database management.
Good to have:
â Continuous learning mindset to stay updated on emerging technologies and trends
â Strong communication
â Enjoys problem solving skills, task automation and analysis
â Ability to work across time zones
â Analytical mindset with a focus on measuring and optimising operational processes
â Relevant certifications in cloud computing, devops or related fields (e.g., AWS Certified DevOps Engineer, AWS Certified Solutions
â Architect, Certified Kubernetes Administrator)
Engagement: Indefinite contract with Compare Club
Interview Process: 2 rounds
How to apply for this opportunity
About Our Hiring Partner:
Lifesight Is empowering decisions with Advanced Data Intelligence. Lifesight is a fast-growing SaaS company focused on helping businesses leverage data & AI to improve customer acquisition and retention. We have a team of 130 serving 300+ customers across 5 offices in the US, Singapore, India, Australia, and the UK.
About Uplers:
Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement. You will also be assigned to a dedicated Talent Success Coach during the engagement.
(Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well).
So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Senior Site Reliability Engineer roles with real interview advice
3-8 Yrs
₹ 15 - 22.5L/yr
Bangalore / Bengaluru
4-8 Yrs
₹ 25 - 35L/yr
Bangalore / Bengaluru