i
UKG
86 UKG Jobs
Senior Site Reliability Engineer
UKG
posted 16d ago
Flexible timing
Key skills for the job
Site Reliability Engineers at UKG are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering and auto remediation.
Site Reliability Engineers must be passionate about learning and evolving with current technology trends. They strive to innovate and are relentless in pursuing a flawless customer experience. They have an automate everything mindset, helping us bring value to our customers by deploying services with incredible speed, consistency, and availability.
Job Responsibilities:
Engage in and improve the lifecycle of services from conception to EOL, including system design consulting, and capacity planning
Define and implement standards and best practices related to: System Architecture, Service
delivery, metrics and the automation of operational tasks
Support services, product & engineering teams by providing common tooling and frameworks to deliver increased availability and improved incident response
Improve system performance, application delivery and efficiency through automation, process
refinement, postmortem reviews, and in-depth configuration analysis
Collaborate closely with engineering professionals within the organization to deliver reliable
services Increase operational efficiency, effectiveness, and quality of services by treating operational challenges as a software engineering problem (reduce toil)
Guide junior team members and serve as a champion for Site Reliability Engineering
Actively participate in incident response, including on-call responsibilities
Required Qualifications
Engineering degree, or a related technical discipline, or equivalent work experience
Experience coding in higher-level languages (e.g., Python, JavaScript, C++, or Java)
Knowledge of Cloud based applications & Containerization Technologies
Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing
Demonstrable fundamentals in 2 of the following: Computer Science, Cloud architecture, Security, or Network Design fundamentals Demonstrable fundamentals in 2 of the following: Computer Science, Cloud architecture, Security, or Network Design fundamentals
(Experience, Education, Certification, License and Training)
Must have at least 3 years of hands-on experience working in Engineering or Cloud
Minimum 2 years' experience with public cloud platforms (e.g. GCP, AWS, Azure)
Minimum 2 years' Experience in configuration and maintenance of applications and/or
systems infrastructure for large scale customer facing company
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Senior Site Reliability Engineer roles with real interview advice
The company culture is good, benefits are good.
There is a constant fear of getting fired among everyone irrespective of your performance.