11 MARICI Solutions Jobs
Site Reliability Engineering - DevOps (5-7 yrs)
MARICI Solutions
posted 16hr ago
Key skills for the job
Responsibilities :
- Design, build, and maintain highly available and scalable infrastructure on AWS.
- Implement and manage CI/CD pipelines using Jenkins, Git/Bitbucket, and other relevant tools.
- Automate infrastructure provisioning and management using Terraform and Ansible.
- Monitor system performance and proactively identify and resolve issues using Grafana, Instana, Prometheus, and ELK stack.
- Troubleshoot and resolve production issues quickly and effectively.
- Participate in on-call rotations and provide 24/7 support for critical systems.
- Collaborate with software engineers to improve the reliability and performance of our applications.
- Contribute to the development and improvement of SRE best practices and processes.
- Stay up-to-date with the latest technologies and trends in the field of Site Reliability Engineering.
Required Skills :
- Strong understanding of DevOps principles and practices.
- Extensive experience with AWS services (EC2, VPC, S3, RDS, Lambda).
- Experience with Elasticsearch, Logstash, and Kibana for log management and analysis.
- Experience with configuring and managing NGINX for load balancing and reverse proxying.
- Experience with OpenSearch for search and analytics.
- Proficiency in using Terraform for infrastructure-as-code.
- Proficiency in using Ansible for automation and configuration management.
- Experience with building and managing Jenkins pipelines for continuous integration and continuous delivery.
- Proficiency in using Git/Bitbucket for source code management.
- Experience with Docker and containerization technologies.
- Experience with Kubernetes for container orchestration and management.
- Experience with Grafana for creating dashboards and monitoring system metrics.
- Experience with Instana for application performance monitoring.
- Experience with Prometheus for monitoring and alerting.
- Basics: Solid understanding of HTML, CSS, and JavaScript.
- Frontend: Experience with React, Angular, and TypeScript.
- Backend: Experience with Java, Java Spring Boot, Python, Node.js, and Redis.
- Experience with PostgreSQL.
- Experience with MongoDB and Elasticsearch.
- Message Queue: Experience with Kafka.
- Understanding of CDNs (i.e., Akamai).
- Networking: Strong understanding of networking concepts (TCP/IP, DNS, etc.
- Understanding of security best practices and concepts.
- Experience with API Gateways (i.e., Kong).
- Experience with code quality tools (i.e., Checkmarx, Sonar).
- Tools : Proficiency in using Jira, Confluence, Teams, ServiceNow, Lenses, Adobe Analytics, Amplitude, and Quantum Metrics.
Experience & Exposure :
- Strong awareness and experience of SRE principles and practices.
- Experience with building monitoring into the code.
- Experience with Agile software development and ITIL processes.
- Experience with building/working with REST APIs, API Integration, Microservices, Micro-frontends, and Web Services.
- Experience with SSL management.
- Experience with 24/7 high availability production environments.
- Experience with incident resolution and problem management.
Domain Knowledge :
- Preferred experience in the Retail and eCommerce domain.
Soft Skills :
- Excellent interpersonal and communication skills.
- Proficient spoken and written command of English.
- Strong teamwork and collaboration skills.
- A strong desire to learn and implement new technologies
Functional Areas: Other
Read full job descriptionPrepare for Site Reliability Engineer roles with real interview advice