i
XenonStack
82 XenonStack Jobs
Site Reliability Engineer (SRE) - AWS
XenonStack
posted 1y ago
Fixed timing
Key skills for the job
As we expand our customer deployments, XenonStack SRE Team is currently seeking an experienced SRE to deliver insights from massive scale data in real time. Specifically, we are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.
Job ResponsibilitiesRun the production environment by monitoring availability and taking a holistic view of system health
Build software and systems to manage platform infrastructure and applications
Improve reliability, quality, and time-to-market of our suite of software solutions
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
Provide primary operational support and engineering for multiple large distributed software applications
Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
Partner with development teams to improve services through rigorous testing and release procedures
Participate in system design consulting, platform management, and capacity planning
Create sustainable systems and services through automation and uplifts
Balance feature development speed and reliability with well-defined service level objectives
Ability to program (structured and OO) with one or more high level languages, such as Python, Java, C/C++, Ruby, and JavaScript
Experience with distributed storage technologies like NFS, HDFS, Ceph, S3 as well as dynamic resource management frameworks (Mesos, Kubernetes, Yarn)
A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
Strong Hands on Experience on Linux or Windows Administrations
Coding experience beyond simple scripts
Strong Skills around release engineering and continuous delivery
Excellent Communication skills
Attention to detail
Analytical mind and Problem Solving Aptitude
Strong Organizational Skills
Visual Thinking
Educational Qualification : Bachelor s degree in computer science or other highly technical, scientific discipline
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Site Reliability Engineer roles with real interview advice
2-3 Yrs