The primary purpose of this role is to develop an artificial intelligence (AI) platform that supports a wide array of machine learning (ML) models, including sophisticated deep learning frameworks and large language models (LLMs). This role will work on scaling model performance, building essential tools and frameworks, and managing compute and storage resources. The role involves close collaboration with cross-functional teams to identify new opportunities leveraging AI platform capabilities across different domains to accelerate AI infused product development.
Roles & Responsibilities:
Scales the platform for high performance and integrates new AI capabilities as APIs to ensure the platform remains adaptable and efficient in hosting a variety of ML models.
Designs, develops, and implements tools and frameworks that support ML experimentation and deployment.
Manages GPU and CPU resources to optimize the execution of AI models to ensure the platform runs efficiently, balancing performance with cost-effectiveness.
Works closely with data scientists to integrate AI models smoothly into platform.
Creates and manages efficient data movement and pipelines for the AI platform to operate smoothly. Optimizes data flows to support the demands of high-velocity AI model training and inference. user satisfaction. Analyzes platform performance metrics and user feedback to drive continuous improvement initiatives. Utilizes insights to guide platform enhancements, ensuring the AI platform remains at the forefront of technological advancements any
Collaborates effectively with diverse teams, integrating technical expertise with business insights and user needs.
Implements security protocols and governance measure for AI platform, ensuring data integrity and compliance with industry standards and best practices.
Years of Experience:
5-8 yrs of experience working AI/ML Platform Engineering