Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

Razorthink

Compare

3.3

based on 5 Reviews

2 Razorthink Jobs

AI / ML Developer ( LLMs & Prompt Engineering ) – Python & AI

RazorThink

3.3

based on 5 Reviews

3-7 years

Pune, Chennai, Bangalore / Bengaluru

AI / ML Developer ( LLMs & Prompt Engineering ) – Python & AI

Razorthink

posted 10d ago

Job Role Insights

Flexible timing

Key skills for the job

Python AWS Azure DevOps GCP Tensorflow Pytorch

Job Description

About the Role:
We are looking for a hands-on AI/ML Developer with experience in Large Language Models (LLMs), Prompt Engineering, and AI model integration. The ideal candidate should have practical experience working with AI models, fine-tuning them, optimizing prompts, and integrating them into real-world applications.

This role is perfect for someone who has already worked on AI-driven applications and wants to expand their expertise by researching and implementing new AI advancements. You will have the opportunity to experiment with different LLM architectures, improve AI model efficiency, and contribute to AI-driven solutions.

Key Responsibilities:
LLM Development & Implementation:
Work hands-on with LLMs like GPT, LLaMA, Mistral, Claude, and Gemini.
Fine-tune models using Hugging Face Transformers, PyTorch, or TensorFlow.
Train, optimize, and deploy LLMs for tasks like text generation, summarization, and chatbots.
Prompt Engineering & Optimization:
Design, test, and optimize prompts for various AI tasks.
Apply Few-shot, Zero-shot, Chain of Thought (CoT), and ReAct prompting.
Improve AI model accuracy by iterating and refining prompt strategies.
AI Model Integration & Deployment:
Develop Python-based applications that interact with LLM APIs.
Build and deploy AI models using FastAPI or Flask.
Work with vector databases (FAISS, ChromaDB, Pinecone) for efficient retrieval.
Deploy AI models on cloud platforms (AWS, Azure, GCP) using Docker and Kubernetes.
Data Processing & NLP:
Preprocess and clean large-scale text datasets.
Work with text embeddings, named entity recognition (NER), and knowledge retrieval.
Implement vector search techniques for AI-enhanced applications.
AI Research & Experimentation:
Stay up to date with the latest LLM advancements and AI research papers.
Implement new AI techniques into existing workflows.
Optimize models using quantization, vLLM, and low-rank adaptation (LoRA).

Required Skills & Experience:
Must-Have Hands-on Experience:
Python programming with AI/ML frameworks (Hugging Face, PyTorch, TensorFlow).
Hands-on experience working with LLMs and fine-tuning.
Experience in prompt engineering and optimizing AI model outputs.
Building APIs with FastAPI or Flask for AI model integration.
Familiarity with vector databases and embedding models.

Nice to Have (or Learn on the Job):
Experience with LangChain, LlamaIndex, or Retrieval-Augmented Generation (RAG).
Knowledge of quantization techniques (LoRA, GPTQ, vLLM, ONNX) for efficient model deployment.
Experience working with knowledge graphs and reasoning-based AI.
Background in MLOps for tracking and managing AI models.

Location-Remote,Delhi NCR, Bengaluru,Chennai,Pune,Kolkata,Ahmedabad,Mumbai, Hyderabad

Employment Type: Full Time, Permanent

Read full job description