i
Apexon
11 Apexon Jobs
SRE Team Lead and Engineer / 5
Apexon
posted 26d ago
Flexible timing
Key skills for the job
Lead and mentor a team of SRE engineers, fostering a reliability, efficiency, and continuous improvement culture.
Develop and execute SRE strategies to enhance our systems and services' reliability, availability, and performance.
Designed and implemented observability and monitoring solutions using tools like New Relic, Azure Application Insights, AWS X-Ray, and other relevant technologies.
Establish and maintain alerting systems to proactively identify and address potential issues before they impact our customers.
Collaborate with cross-functional teams, including development, operations, and security, to build and maintain resilient and scalable systems.
Drive initiatives for automating operational processes, reducing manual interventions, and enhancing system performance.
Provide leadership in incident response, ensuring swift resolution of issues and effective post-incident reviews to prevent recurrence.
Stay current with industry trends and advancements in site reliability engineering, applying best practices to improve our operations continually.
Promote a data-driven approach to decision-making, leveraging observability data to identify opportunities for optimization and innovation.
Qualifications:
5-10 years of experience in site reliability engineering, infrastructure management, or a related field.
Proven experience in leading and mentoring engineering teams, with a focus on reliability and performance.
Strong strategic thinking skills, with the ability to develop and execute SRE strategies aligned with business goals.
Expertise in observability and monitoring tools such as New Relic, Azure Application Insights, AWS X-Ray, or similar platforms.
Experience with cloud platforms, particularly AWS and Azure, and a strong understanding of their services and capabilities.
Hands-on experience with alerting systems, incident management, and performance optimization.
Strong scripting and automation skills (e.g., Python, Bash, PowerShell) to automate and streamline operations.
Excellent problem-solving skills with a proactive and analytical approach to identifying and resolving issues.
Effective communication skills, with the ability to convey technical concepts to both technical and non-technical audiences.
Relevant certifications (such as AWS Certified Solutions Architect, Azure Administrator Associate, or similar) are a plus but not required.
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Team Lead roles with real interview advice
Getting recognition for the work we are doing
Perks and benefits could be improved
Read 9 reviews