** Note:This position is with one of our hiring partners.
Seeking an AI Research Scientist who thrives in tackling high-level, abstract questions and navigating ambiguity in pursuit of breakthrough insights.
This role is ideal for someone with a strong research background in large language models (LLMs) and agent-based AI, whether through academic publications or direct industry contributions.
The ideal candidate is self-led, curious, and excels at framing hard-to-define problems, bringing an innovative and autonomous approach to driving research initiatives..
Responsibilities.
Conduct pioneering research in generative AI within a dynamic and fast-moving environment..
Develop novel agent architectures aimed at optimizing end-to-end AI workflow performance..
Design, execute, and analyze experiments to enhance large language models (LLMs) across a range of benchmarks..
Handle data engineering tasks to prepare data for LLM pre-training, fine-tuning, and retrieval-augmented generation (RAG) processes..
Integrate and evaluate models with multi-modal capabilities tailored to various industry sectors..
Stay informed on the latest in generative AI and knowledge retrieval research by reviewing conference publications..
Reproduce, evaluate, and implement theoretical approaches from research papers on data curation, fine-tuning, and agent architecture within practical applications..
Configure and maintain fine-tuning and evaluation pipelines on AWS, GCP, and other cloud providers..
Oversee AI workload resources and monitor experiment metrics, systematically tracking results and performance..
Required Skills.
Minimum of 3 years of practical experience in generative AI research or engineering, which may include industry work or academic research during a Masters or Ph.D.
program with relevant publications..
Proven expertise in deep learning and transformer models..
Advanced proficiency in Python (especially PyTorch, NumPy, and agent frameworks) for building, fine-tuning, and evaluating AI workflows..
Strong grounding in data structures, algorithms, and core software engineering concepts..
Familiarity with LLM training methodologies, including model distillation, supervised fine-tuning, and policy optimization..
Exceptional analytical and problem-solving skills, with a proactive approach to tackling challenges..
Benefits.
Competitive compensation package, including salary, equity, and health insurance..
Flexible time off and remote work options..
Work with top-tier engineers and cutting-edge technology..
Opportunity to influence the direction of a forward-thinking AI platform..
About Pesto.
Pesto is where software developers go to build their career path for the next 5 years.
We don't just offer jobs; we provide unparalleled opportunities for your growth and success in the dynamic landscape of Tech Jobs..