28 Neerinfo Solutions Jobs
Site Reliability Engineer - Applications Support (5-7 yrs)
Neerinfo Solutions
posted 9d ago
Flexible timing
Key skills for the job
Job Description :
We are seeking a talented Site Reliability Engineer (SRE) to join our team.
The ideal candidate will have a strong background in application support, microservices architecture, and automation, with hands-on experience in monitoring and dashboarding tools like Splunk and AppDynamics.
This role requires a proactive professional with a knack for solving complex production issues and driving continuous improvement in system reliability and performance.
Key Responsibilities :
Monitoring and Observability :
- Develop and manage dashboards on Splunk and AppDynamics for system performance and health monitoring.
- Monitor and analyze application and system performance to identify areas of improvement.
Automation and Configuration Management :
- Create and manage automation scripts using Ansible to streamline operational processes.
- Automate infrastructure provisioning and deployment tasks using Docker, Kubernetes, and OpenShift.
Application and System Support :
- Provide application support and resolve issues for production systems, ensuring minimal downtime.
- Work on troubleshooting microservices-based architectures and ensuring their smooth operation.
- Collaborate with development and operations teams to enhance system reliability.
Database Management :
- Support and manage NoSQL databases like MongoDB in production environments.
- Optimize database performance and resolve operational bottlenecks.
Cloud Engineering :
- Leverage experience with cloud platforms and tools such as Kubernetes, Docker, and APIGEE for deployment and scaling.
- Build and maintain cloud-based infrastructure for high availability and performance.
Front-End Development :
- Develop interactive web applications using AngularJS and Node.js as needed for operational tools and dashboards.
Continuous Improvement :
- Identify opportunities to improve system performance, reliability, and scalability.
- Implement best practices in site reliability engineering to ensure consistent system uptime and performance.
Required Skills and Qualifications :
Technical Skills :
- Monitoring Tools : Proficiency in Splunk and AppDynamics for dashboarding and performance monitoring.
- Automation Tools : Hands-on experience with Ansible for scripting and configuration management.
- Microservices : Strong understanding of microservices architecture and its operational nuances.
- Databases : Experience with NoSQL databases such as MongoDB.
- Cloud Technologies : Expertise in cloud engineering, including tools like Docker, Kubernetes, APIGEE, and OpenShift.
- Programming : Experience in AngularJS and Node.js for web app development.
Experience :
- Application Support : 3 years of experience in application support or site reliability engineering.
- Production Support : Proven experience in managing production systems and ensuring high availability.
Soft Skills :
- Strong problem-solving and analytical abilities.
- Excellent communication and collaboration skills to work effectively with cross-functional teams.
- Ability to work in a fast-paced, dynamic environment.
Preferred Qualifications :
- Experience with CI/CD pipelines and DevOps tools like Jenkins or GitLab.
- Knowledge of infrastructure-as-code (IaC) tools like Terraform.
- Familiarity with Agile/Scrum methodologies
Functional Areas: Software/Testing/Networking
Read full job descriptionPrepare for Site Reliability Engineer roles with real interview advice
6-10 Yrs