2 Razorthink Jobs
3-7 years
Pune, Chennai, Bangalore / Bengaluru
AI / ML Developer ( LLMs & Prompt Engineering ) – Python & AI
Razorthink
posted 10d ago
Flexible timing
Key skills for the job
About the Role:
We are looking for a hands-on AI/ML Developer with experience in Large Language Models (LLMs), Prompt Engineering, and AI model integration. The ideal candidate should have practical experience working with AI models, fine-tuning them, optimizing prompts, and integrating them into real-world applications.
This role is perfect for someone who has already worked on AI-driven applications and wants to expand their expertise by researching and implementing new AI advancements. You will have the opportunity to experiment with different LLM architectures, improve AI model efficiency, and contribute to AI-driven solutions.
Key Responsibilities:
LLM Development & Implementation:
Work hands-on with LLMs like GPT, LLaMA, Mistral, Claude, and Gemini.
Fine-tune models using Hugging Face Transformers, PyTorch, or TensorFlow.
Train, optimize, and deploy LLMs for tasks like text generation, summarization, and chatbots.
Prompt Engineering & Optimization:
Design, test, and optimize prompts for various AI tasks.
Apply Few-shot, Zero-shot, Chain of Thought (CoT), and ReAct prompting.
Improve AI model accuracy by iterating and refining prompt strategies.
AI Model Integration & Deployment:
Develop Python-based applications that interact with LLM APIs.
Build and deploy AI models using FastAPI or Flask.
Work with vector databases (FAISS, ChromaDB, Pinecone) for efficient retrieval.
Deploy AI models on cloud platforms (AWS, Azure, GCP) using Docker and Kubernetes.
Data Processing & NLP:
Preprocess and clean large-scale text datasets.
Work with text embeddings, named entity recognition (NER), and knowledge retrieval.
Implement vector search techniques for AI-enhanced applications.
AI Research & Experimentation:
Stay up to date with the latest LLM advancements and AI research papers.
Implement new AI techniques into existing workflows.
Optimize models using quantization, vLLM, and low-rank adaptation (LoRA).
Required Skills & Experience:
Must-Have Hands-on Experience:
Python programming with AI/ML frameworks (Hugging Face, PyTorch, TensorFlow).
Hands-on experience working with LLMs and fine-tuning.
Experience in prompt engineering and optimizing AI model outputs.
Building APIs with FastAPI or Flask for AI model integration.
Familiarity with vector databases and embedding models.
Nice to Have (or Learn on the Job):
Experience with LangChain, LlamaIndex, or Retrieval-Augmented Generation (RAG).
Knowledge of quantization techniques (LoRA, GPTQ, vLLM, ONNX) for efficient model deployment.
Experience working with knowledge graphs and reasoning-based AI.
Background in MLOps for tracking and managing AI models.
Location-Remote,Delhi NCR, Bengaluru,Chennai,Pune,Kolkata,Ahmedabad,Mumbai, Hyderabad
Employment Type: Full Time, Permanent
Read full job description3-7 Yrs
Pune, Chennai, Bangalore / Bengaluru