Upload Button Icon Add office photos
filter salaries All Filters

4 LearnDesk Jobs

Machine Learning Engineer - Inference & Fine-Tuning

1-3 years

₹ 11 - 15L/yr

New Delhi

1 vacancy

Machine Learning Engineer - Inference & Fine-Tuning

LearnDesk

posted 18d ago

Job Description

Get hired by a US-based company focused on the US, UK, and European markets. This would include travel to the US office located in California.


What we're looking for

We are looking for a talented Machine Learning Engineer to lead the development of an Inference Service built on open-source technologies. You will design, deploy, and fine-tune machine learning models in the cloud using frameworks like Hugging Face, TensorFlow, PyTorch, and other open-source tools. The ideal candidate has hands-on experience building scalable inference services, fine-tuning pre-trained models, and working with cloud-native infrastructure.

In this role, you'll work on exciting machine learning applications spanning NLP, computer vision, and custom tasksall with a focus on efficient deployment and real-time inference. If you're passionate about open-source technologies and cloud-based ML infrastructure, this is the perfect role for you!


Responsibilities

  • Build and optimize scalable inference pipeline using popular open-source frameworks (e.g. TensorFlow serving, ONNX).
  • Design real-time API endpoints for model serving and integration using frameworks like FastAPI or Flask.
  • Implement and optimize batch processing and streaming data pipelines to handle large-scale workloads.
  • Fine-tune pre-trained models (e.g. Llama, GPT, YOLO), leverage open source tools like RayTune, and Hugging Face Trainer.
  • Deploy inference services on cloud platforms (GCP, AWS, Azure) using a containerized environment with Docker and Kubernetes.
  • Leverage cloud-native tools like Kubernetes, and Kubeflow to manage and scale model deployment.
  • Implement robust monitoring and logging systems using Prometheus, Grafana, or ELK stack.
  • Continuously optimize models for latency and throughput, troubleshooting bottlenecks.

Qualifications/ Required Skills and Experiences

  • Educational Background:
    • Bachelors or Master’s degree in Computer Science, Engineering, Data Science, or a related field, or equivalent practical experience.
  • Experience:
    • 3+ years of hands-on experience deploying machine learning models into production, with a focus on inference services and fine-tuning.
    • Proficiency in Python or C/CC+.
    • Proven track record in creating high-performance libraries and tools.
    • Proficiency in model serving frameworks such as TensorFlow Serving, TorchServe, and ONNX Runtime.
    • Familiarity with hyperparameter optimization libraries like Ray Tune, Optuna, and Hugging Face Trainer.
    • Experience with containerization tools (Docker) and orchestration platforms (Kubernetes, Helm).
    • Strong grasp of low-level OS concepts, including multi-threading, memory management, networking, storage, performance, and scalability.
    • Preferred: Knowledge of AI inference techniques like speculative decoding.
    • Preferred: Experience with CUDA/Triton programming, Rust, Cython, and compiler technologies.
  • Soft Skills:
    • Excellent communication skills with the ability to explain complex ML concepts to both technical and non-technical stakeholders.
    • Strong problem-solving and debugging skills, with the ability to analyze and resolve performance issues in production environments.
    • Comfortable working in an Agile environment and collaborating across teams to deliver results.

Employment Type: Full Time, Permanent

Read full job description

What people at LearnDesk are saying

What LearnDesk employees are saying about work life

based on 1 employee
100%
Night Shift
View more insights

LearnDesk Benefits

Free Transport
Child care
Gymnasium
Cafeteria
Work From Home
Free Food +6 more
View more benefits

Compare LearnDesk with

BYJU'S

3.1
Compare

Unacademy

3.0
Compare

upGrad

3.7
Compare

Edukart

5.0
Compare

MeritNation

3.6
Compare

Toppr

3.4
Compare

Simplilearn

3.2
Compare

Educomp Solutions

3.5
Compare

Vedant

4.2
Compare

NIIT

3.6
Compare

TCS

3.7
Compare

Accenture

3.8
Compare

Wipro

3.7
Compare

Cognizant

3.8
Compare

Capgemini

3.7
Compare

HDFC Bank

3.9
Compare

Infosys

3.6
Compare

ICICI Bank

4.0
Compare

HCLTech

3.5
Compare

Tech Mahindra

3.5
Compare

Similar Jobs for you

Machine Learning Engineer at Turing

Remote

3-7 Yrs

₹ 5-9 LPA

Machine Learning Engineer at Turing

Remote

3-7 Yrs

₹ 5-9 LPA

Machine Learning Engineer at Turing

Remote

3-7 Yrs

₹ 5-9 LPA

Machine Learning Engineer at Turing

Remote

3-7 Yrs

₹ 5-9 LPA

Machine Learning Engineer at PURECODE SOFTWARE R

Hyderabad / Secunderabad

3-8 Yrs

₹ 8-14 LPA

Machine Learning Engineer at Glance IT Solution

Bangalore / Bengaluru

3-5 Yrs

₹ 3-12 LPA

Machine Learning Engineer at Starkenn Technologies

3-6 Yrs

₹ 15-25 LPA

Machine Learning Engineer at RS Consultants

3-7 Yrs

₹ 12-25 LPA

Machine Learning Engineer at Precision AQ

Remote

2-5 Yrs

₹ 7-11 LPA

Machine Learning Engineer at Ezeiatech Systems Private Limited

Gurgaon / Gurugram

1-5 Yrs

₹ 8-14 LPA

Machine Learning Engineer - Inference & Fine-Tuning

1-3 Yrs

₹ 11 - 15L/yr

New Delhi

18d ago·via naukri.com

Senior IC Package Designer

3-5 Yrs

New Delhi

2d ago·via naukri.com

Lead Analyst - Creative Marketing

1-5 Yrs

₹ 8 - 18L/yr

Delhi/Ncr

9d ago·via naukri.com

Full Stack Developer

2-5 Yrs

Delhi/Ncr

10d ago·via naukri.com
write
Share an Interview