59 Natobotics Jobs
SRE , Site Reliable Engineer
Natobotics
posted 1hr ago
Fixed timing
Key skills for the job
Job Summary
SRE Production Support Engineer will be responsible for ensuring the smooth operations and performance of our production systems.
Monitor system performance, respond to incidents, and ensure minimal downtime.
Develop/maintain automation scripts to streamline operations and improve system reliability
This role requires expertise in troubleshooting and resolving complex technical issues to minimize downtime and ensure customer satisfaction.
Monitor and maintain the availability and performance of production systems.
Respond promptly to production incidents and provide quick resolutions.
Collaborate with development teams to identify and address root causes of recurring issues.
Implement proactive measures to prevent system failures and optimize performance.
Conduct regular system health checks and performance tuning.
Develop and maintain documentation of system configurations, processes, and troubleshooting guides.
Provide on-call support as required.
Bachelors degree in Computer Science or equivalent.
Minimum of 3 years of experience as a Production Support Engineer or similar role in a fast-paced, high-availability environment.
Monitoring and Incident Response: (Mandatory)
Proven experience in incident management and troubleshooting
Proficiency in monitoring tools (e.g., Prometheus, Datadog)
Exposure to AWS systems
Scripting and Automation:
Strong scripting skills (Python, Bash)
Experience with automation tools (Ansible, Terraform)
CI/CD Pipelines:
Exposure and understanding CI/CD pipelines (Jenkins, GitLab CI/CD) and deployment automation
Employment Type: Full Time, Permanent
Read full job description