i
UKG
79 UKG Jobs
Lead Site Reliability Engineer
UKG
posted 4d ago
Flexible timing
Key skills for the job
The duties of a Site Reliability Engineer will be to support and maintain various Cloud Infrastructure Technology Tools in our hosted production/DR environments. He/she will be the subject matter expert for specific tool(s) or monitoring solution(s). Will be responsible for testing, verifying and implementing upgrades, patches and implementations. He/She will also partner with the other service and/or service functions to investigate and/or improve monitoring solutions. May mentor one or more tools team members or provide training to other cross functional teams as required. May motivate, develop, and manage performance of individuals and teams while on shift. May be assigned to produces regular and adhoc management reports in a timely manner.
Proficient in Splunk/ELK, and Datadog.
Experience with observability tools such as Prometheus/InfluxDB, and Grafana.
Possesses strong knowledge of at least one scripting language such as Python, Bash, Powershell or any other relevant languages.
Design, develop, and maintain observability tools and infrastructure.
Collaborate with other teams to ensure observability best practices are followed.
Develop and maintain dashboards and alerts for monitoring system health.
Troubleshoot and resolve issues related to observability tools and infrastructure.
Bachelors Degree in information systems or Computer Science or related discipline with relevant experience of 5-8 years
Proficient in Splunk/ELK, and Datadog.
Experience with Enterprise Software Implementations for Large Scale Organizations
Exhibit extensive experience about the new technology trends prevalent in the market like SaaS, Cloud, Hosting Services and Application Management Service
Monitoring tools like : Grafana, Prometheus, Datadog,
Experience in deployment of application & infrastructure clusters within a Public Cloud environment utilizing a Cloud Management Platform
Professional and positive with outstanding customer-facing practices
Can-do attitude, willing to go the extra mile
Consistently follows-up and follows-through on delegated tasks and actions
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Site Reliability Engineer Lead roles with real interview advice