We are seeking an experienced MLOps Engineer to join our team and help design, implement, and maintain scalable machine learning infrastructure. In this role, you will bridge the gap between data scientists and production environments, ensuring seamless integration, robust pipelines, and reliable operations of machine learning models in production.
Typical work week look like:-. Model Deployment:Build and maintain scalable pipelines for deploying machine learning models to production environments. Automation:Automate machine learning workflows, including model training, testing, monitoring, and retraining. Infrastructure Management:Design, build, and maintain cloud and/or on-premises infrastructure to support ML operations. Collaboration:Work closely with data scientists, data engineers, and software engineers to improve the overall lifecycle of machine learning models. Monitoring and Optimization:Establish monitoring systems to ensure model performance, accuracy, and efficiency, and address performance degradation or anomalies. Version Control:Implement versioning for datasets, models, and code, ensuring traceability and reproducibility of machine learning experiments. Data Pipelines:Build and maintain robust data pipelines for training and inference workflows. Security:Ensure ML pipelines and deployments adhere to security and compliance standards. Our ideal candidate should have:-. 3-5 Years of work experience. Experience in distributed computing and parallelization techniques for ML workloads. Knowledge of optimization techniques for large-scale data pipelines and LLMs. Understanding of DevOps principles applied to machine learning. Exposure to tools such as SageMaker, Amazon Bedrock, Azure , or Vertex AI. Technical Skills. Programming Languages:Strong proficiency in Python, with experience in ML libraries such as TensorFlow, PyTorch and CUDA. Infrastructure:Familiarity with Docker, Kubernetes, and cloud services (AWS, GCP, or Azure). CI/CD:Experience with CI/CD tools for ML pipelines (e.g., GitLab CI, Jenkins, or Airflow). Version Control:Hands-on experience with versioning tools like Git and ML-specific tools such as MLflow or DVC. Monitoring and Logging:Proficient in tools like Prometheus, Grafana, and ELK stack. Soft Skills. Strong analytical and problem-solving ability. Excellent communication skills to collaborate across cross-functional teams. Adaptability to work in a fast-paced, dynamic environment. What you can expect from ORI. Opportunity to work on cutting-edge AI/ML projects. A collaborative, innovation-driven work environment. Passion & happiness in the workplace with great people & open culture with amazing growth opportunities. An ecosystem where leadership is fostered which builds an environment where everyone is free to take necessary actions to learn from real experiences. Freedom to pursue your ideas and tinker. Show more Show less