6 Fulcrum Digital Jobs
Fulcrum Digital - System Reliability Engineer (3-5 yrs)
Fulcrum Digital
posted 23d ago
Flexible timing
Key skills for the job
Key Responsibilities :
- Design, implement, and maintain scalable and reliable systems.
- Identify and address system bottlenecks and performance issues.
- Implement strategies to improve system uptime and reduce MTTR.
- Conduct root cause analysis of system failures and implement corrective actions.
- Develop and maintain comprehensive monitoring solutions to proactively identify and resolve issues.
- Create and manage alerts and notifications for critical system events.
- Implement automated response mechanisms to minimize downtime.
- Lead incident response efforts, coordinating with various teams to quickly resolve issues.
- Conduct post-incident reviews to identify lessons learned and implement preventive measures.
- Forecast future system capacity needs and proactively scale infrastructure as required.
- Optimize resource utilization to maximize system efficiency.
- Develop and implement automation tools and scripts to streamline operations and reduce manual effort.
- Automate routine tasks to improve efficiency and reduce human error.
- Collaborate closely with development, operations, and infrastructure teams to ensure smooth system operations.
Required Skills and Experience :
- Strong understanding of system architecture and design principles.
- Proficiency in scripting languages (Python, Bash) and automation tools (Ansible, Puppet, Chef).
- Experience with cloud platforms (AWS, GCP, Azure).
- In-depth knowledge of monitoring tools (Prometheus, Grafana) and alerting systems.
- Experience with containerization technologies (Docker, Kubernetes).
- Strong problem-solving and troubleshooting skills.
- Excellent communication and collaboration skills.
- Experience with CI/CD pipelines and DevOps practices.
- Understanding of networking concepts (TCP/IP, DNS, load balancing).
- Experience with database technologies (MySQL, PostgreSQL).
Functional Areas: Other
Read full job descriptionPrepare for System Reliability Engineer roles with real interview advice
7-10 Yrs
5-7 Yrs
Pune