i
UKG
73 UKG Jobs
Dir. Database Reliability Engineering
UKG
posted 10d ago
Flexible timing
Key skills for the job
Director Database Reliability Engineering
As a Director of Data Reliability Engineering, you would combine software and systems engineering to build and run robust, fault-tolerant database platforms to operate at scale. A pivotal role with transformational leadership qualities creates a vision of running Data Platforms as Service and drives the Automation and AI first culture. Responsible for End-to-end observability, availability, performance, and uptime of mission-critical data platforms
Responsibilities:
This pivotal role will lead the DBRE and DBA team and be responsible for maintaining Data Platforms' reliability, performance, availability, and scalability.
Responsible for architecting, designing, and building various database technologies, including SQL (MySQL, PostgreSQL, Oracle), NoSQL (Cassandra, Astra), and SaaS DB instances (Cloud SQL, ASTRA).
Responsible for DB Maintenance, Disaster Recovery, and Implementation like Mirroring, Replication, log shipping, MS/PostgreSQL clustering. Etc
Drive the technical strategy and vision for major projects and initiatives, ensuring alignment with business goals and industry best practices.
Led the team in driving further adoption of Reliability practices such as Chaos engineering, SLOs, Error Budgets, release safety, load testing, and disaster recovery strategies for data platforms.
Build teams through hiring and people growth while balancing your ownership workload through delegation. Define and review individual and team goals, fostering a culture of continuous improvement and innovation.
Stay current with emerging technologies and industry trends, advocating for their adoption where appropriate to drive innovation and productivity enhancement within the team (e.g., Database as Service, AIOps, CoPilot).
Collaborate cross-organization to complete successful delivery with the broader functions, including but not limited to Security, Architecture, Operations, and Product Managers.
Develop and maintain comprehensive technical documentation for database systems. Continuously review and update existing best practices/SOPs.
Coach the organization on the principles of DBRE, including incident response, automation, observability improvements, toil reduction, self-healing, and root cause analysis.
Manage on-call rotations across the globe and implement the follow-the-sun model.
Minimum Qualifications:
Bachelor's or master's degree in engineering or a related technical field.
10+ years of experience as Production DBA (or) DBRE with at least 5+ years of management experience.
Hands-on Experience and Deep Technical expertise in one or more of the database platforms SQL (MySQL, PostgreSQL, Oracle), NoSQL (Cassandra, Astra), and SaaS DB instances (Cloud SQL, ASTRA).
Hands-on experience with scripting and automation using Python, Go, PowerShell, Google Cloud Scripting, and Infrastructure Code (IaC)
Experience and Knowledge of Public Cloud Infrastructure like Google Cloud, Cloud-based Applications, Containerization, and microservices architecture.
Proficiency in building telemetry or observability implementation in tools like Prometheus, Datadog, Splunk, Prometheus SolarWinds, and Percona PMM., etc.
Strong leadership, problem-solving skills, attention to detail, delivering high-quality solutions, excellent communication, and interpersonal skills, with the ability to influence and drive technical decisions across the organization.
Certification in one of the RDBMS Platforms (Microsoft SQL Server, PostgreSQL, MySQL, Cassandra) is a must, and having other Cloud certifications in GCP is preferable.
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Reliability Engineer roles with real interview advice