i
Oracle
981 Oracle Jobs
Site Reliability Developer 3
Oracle
posted 4d ago
Flexible timing
Key skills for the job
Responsible for the operation of production environments including systems, databases and supporting critical business operations. Will perform administration and analysis for multiple production GPU environments and recommend new and novel solutions to improve availability, performance, and supportability. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. This is an opportunity to bring a combination of deep technical knowledge with administration/analysis knowledge of Oracles Cloud Infrastructure to provide escalation support to a wide range of complex production environment problems related to immense growth and scaling, extremely high performance, and high availability requirements.
Career Level - IC3
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of GPU/AI environments. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Monitor, maintain, support, and optimize all production GPU server hardware and software. Provide escalated technical support for complex technical issues which may include leading problem management cases and providing management status. Coordinate escalated support cases and lead appropriate internal technical resources and/or third party vendors to resolution. Assist with server operating system and application upgrades, bug fixes, and patching; work on standardization projects for both hardware and software under the Oracle technology stack while providing consistent system uptime as expected in a Cloud environment. Provide on-call support, on a rotating basis.
Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Oracle roles with real interview advice
3-8 Yrs
₹ 13 - 23L/yr
Hyderabad / Secunderabad, Pune, Bangalore / Bengaluru
4-8 Yrs
Bangalore / Bengaluru
4-7 Yrs
₹ 15 - 20L/yr
Delhi/Ncr, Thiruvananthapuram
4-7 Yrs
₹ 15 - 20L/yr
Gandhinagar, Hyderabad / Secunderabad, Bangalore / Bengaluru