we're hiring a skilled and proactive Data Scientist - Gen AI (2-6 yrs) to join our remote team in building AI copilots, chatbots, and intelligent agent systems using the latest in LLMs, document AI, and NLP. This role places a strong emphasis on the design, development, and orchestration of agents and multi-agent systems, which are critical to our AI architecture and product strategy.
you'll play a central role in developing POCs and scalable, real-time production applications powe'red by collaborative agents that drive intelligent automation.
Key Responsibilities:
Build advanced GenAI applications using LLMs, Advanced RAG, and especially agent-based and multi-agent architectures (eg, LangGraph, LangChain).
Design, orchestrate, and scale agent workflows that coordinate tasks across copilots, document processing, and real-time systems.
Plug, play, test, and integrate Hugging Face models (NLP, OCR, NER, etc) into modular, extensible agent pipelines.
Work with OCR, NER, and document extraction processing pipelines for intelligent document understanding.
Build intelligent copilots, chatbots, and backend logic using Python, FastAPI, async programming, websockets, and parallel processing.
Deploy and manage agent-based applications using Docker, Azure, and modern CI/CD pipelines.
Implement and manage vector databases, semantic search, and retrieval workflows for high-quality contextual responses.
Conduct prompt engineering, LLM/RAG/Agent evaluation, and continuous system improvement.
Collaborate with cross-functional teams - product, engineering, QA - to translate ideas into production-ready, agent-enabled tools.
Build POCs, internal tools, and full-fledged production applications based on multi-agent designs.
Stay updated with cutting-edge research papers and trends in LLMs, agentic workflows, and GenAI.
Required Skills Experience:
2-6 years of experience in Python, NLP, machine learning, and transformers.
Strong hands-on experience with LangChain, LangGraph, agent orchestration, multi-agent system design, and retrieval-augmented generation.
Proven experience working with agents and multi-agent collaboration patterns in real-time applications.
Experience with OCR, NER, document extraction, and automated document workflows.
Proficiency with Hugging Face Transformers and model testing/integration.
Hands-on experience deploying scalable applications using Docker, Azure, and CI/CD pipelines.
Experience working with FastAPI, asyncio, and websockets for building real-time, responsive interfaces.
Familiarity with rapid prototyping tools like Streamlit or frontend stacks like React.
Strong problem-solving mindset with excellent debugging and optimization skills.
Comfortable working with both structured and unstructured databases.
Knowledge of LLM fine-tuning, semantic caching, and memory-enhanced agents.
Excellent communication and planning skills; comfortable working across cross-functional teams.