Video SDK is a fast-growing technology company that specializes in video-conferencing integration solutions.
We are committed to delivering exceptional experiences to our customers by optimizing our online presence and driving organic traffic.
We are seeking a highly skilled AI Research Engineer (Intern) to contribute to the development of Speech-to-Text (STT), Text-to-Speech (TTS), and Large Language Model (LLM) pipelines.
As an intern, you will have the opportunity to work alongside experienced professionals, gaining valuable experience in building, training, and optimizing AI models at scale.
Responsibilities:.
Collaborate with the research team to design and implement STT, TTS, and LLM pipelines using cutting-edge techniques and frameworks.
Assist in the research, development, and optimization of various AI models, including large language models and voice/speech foundation models.
Participate in refining foundation model infrastructure to support the deployment of optimized AI models, focusing on C/C++, CUDA, and kernel-level programming enhancements.
Contribute to the implementation of state-of-the-art optimization techniques, such as quantization, distillation, and sparsity, for model performance enhancements.
Work on designing and developing novel large language models and architectures leveraging transformers and other state-of-the-art techniques.
Collaborate with the engineering team to integrate and customize frameworks like PyTorch, TensorFlow for accelerated model training and inference.
Support the advancement of deployment infrastructure with MLOps frameworks like KubeFlow and Terraform for robust development and deployment cycles.
Collaborate with cross-functional teams to translate research advancements into scalable services and products.
Qualifications:.
Currently pursuing a Computer Science / Engineering with a focus on AI, Machine Learning, or related fields.
Basic understanding of speech-to-text, text-to-speech, and large language model pipelines.
Proficiency in programming languages such as Python, C/C++, and experience with CUDA and kernel-level programming for AI applications.
Prior exposure to large-scale distributed training and fine-tuning of foundation models like LLaMA2 and Whisper is a plus.
Strong programming skills in Python and proficiency in PyTorch and other ML frameworks and tools.
Benefits:.
Strategic Impact: Engage in meaningful work where you can shape and implement strategic solutions, making a significant impact on our clients' success and contributing to the overall growth of the organization.
Collaborative Innovation: Thrive in a collaborative and dynamic work environment that encourages innovation.
Work closely with cross-functional teams to architect solutions that address complex business challenges.
Continuous Professional Advancement: Access continuous learning opportunities and professional growth initiatives.
Competitive Compensation Package: Receive a competitive salary and benefits package, which includes Employee Stock Ownership Plans (ESOPs) and performance-based incentives, recognizing the value of your strategic contributions.
Comprehensive Benefits Coverage: Enjoy a comprehensive benefits package that includes medical insurance and retirement plans.
We prioritize your well-being, ensuring you have the support you need for a successful and fulfilling career.
Flexibility and Work-Life Balance: Benefit from flexible work arrangements that allow you to balance your professional responsibilities with your personal life.
Achieve a harmonious work-life integration that suits your individual preferences.
Learn More: To learn more about our company and the work we do, please visit our website.