We are an Artificial Intelligence Services and Solutions company that focuses on applying Machine Learning, Deep Learning and Advanced Analytics to solve the problems of businesses.
Amnet Digital has highly experienced talent from world-leading institutions and technology companies.
We have successfully applied AI Technologies in Enterprise Software, Retail, eCommerce and Healthcare.
Our digital product engineering teams design and deploy enterprise solutions that are robust, secure and scalable.
Job Level: Mid-Senior level
Experience: 5 - 8 years
Location: Hyderabad, India
About the Role
Our team is looking for a DevOps Engineer to join us in our mission.
In this role, you will contribute to the design, implementation, and maintenance of our DevOps infrastructure and processes.
The primary responsibility will be to ensure that our solutions can be reliably deployed to various environments (testing, staging, production) with minimal friction.
The ideal candidate will have experience in both software development and systems administration and be well-versed in modern DevOps practices such as infrastructure as code, containerization, continuous integration, and automation.
We currently have a couple of dozen production products based on a variety of technologies and several development teams, so the ideal candidate is someone who likes variety, is great at solving tough problems, and works and communicates well with people across countries and time zones.
Your Key Responsibilities
Collaborate with development teams to ensure seamless integration of new features and services into our DevOps environment, and mentor/educate developers in DevOps related practices in their teams and products.
Monitor and troubleshoot production systems, and lead incident response efforts as needed.
Continuously improving the scalability, security, and reliability of our infrastructure.
Candidates with 6+ years of relevant experience in DevOps practices, cloud computing, containerization (eg, Docker, Kubernetes), version control systems (eg, Git).
Knowledge and hands-on experience with Cloud Infrastructure such as GCP, AWS, or Azure on Network, Security, IAM, and DNS related services.
Git Platforms such as Gitlab (Preferred), GitHub, or similar for enabling Continuous Delivery and Release Management using provided CI tools and Databases such as NoSQL or Graph DB, Linux system administration or bash scripting.
Strong hands-on experience building enterprise scalable Kubernetes infrastructure.
Ability to demonstrate knowledge of applying Observability to a network of clusters using open-source stack.
Capable of debugging production outages related to resources such as network connectivity, DNS resolution, IP shortage, low disk space, crashing pods to keep the systems within strict SLAs while being cost effective.
Strong understanding with containerization technologies such as Docker.
Ability to containerize machine learning applications and manage containerized deployments efficiently.
Strong hands-on experience in building Gitlab CICD pipelines.
Strong Python knowledge is essential to develop and publish PyPi packages in addition to writing API services.
Knowledge of monitoring and logging tools for tracking the performance, health, and reliability of machine learning models and infrastructure components.
Experience with tools like Prometheus, Grafana, ELK stack (Elasticsearch, Logstash, Kibana) is beneficial.