14 Wenger & Watson Inc. Jobs
12-15 years
Mumbai
Machine Learning Compiler Engineer - Deep Learning Models (12-15 yrs)
Wenger & Watson Inc.
posted 2mon ago
Flexible timing
Key skills for the job
Overview :
- Development and support of new AI/ML compiler features/technologies to accelerate deep learning models.
- Work closely with AI hardware accelerator teams and add support for compiler features covering optimization algorithms, code generation, etc.
to fully utilize the hardware features for maximum efficiency.
- Be well acquainted with the latest trends in ML models and compiler technologies to build innovative solutions in our products.
Roles and responsibilities :
- Develop end-to-end ML compiler leveraging standard compiler infrastructures like TVM, MLIR, Torch Dynamo/Inductor taking advantage of both intra-operator parallelism and graph/pipeline/dataflow parallelism while mapping to hardware compute/processing elements of custom AI accelerator
- Implement low level parallel programming model for development/deployment of high performance kernels fully utilizing the hardware capabilities
- Adapt advance techniques/algorithms for placement/scheduling and parallelization of model graphs to improve the performance of ML applications optimizing execution speed and resource utilization
- Code generation leveraging LLVM for the AI accelerator compute elements/cores of targeted ISA with custom instructions
- Debugging and profiling of features of compiler to identify issues and performance hot spots
- Own responsibility throughout the product lifecycle in solving functional/performance issues during execution phase
- Work closely with hardware architecture team for efficient HW/SW Co-design of AI accelerator IP and overall AI server-class SoC
- Keep up-to-date with the industry trend in compiler frameworks in terms of feature advancements, design methods and approaches
Qualifications :
- Should have minimum 12+ years of relevant experience in AI/ML
- Should have deep practical experience of developing end-to-end ML compiler with any AI accelerator architecture (GPU/ASIC/many-core heterogenous)
- Should have good experience with handling complex hierarchies of compute elements and memories of hardware accelerator and their mapping in backend compiler for efficient power/performance
- Should have very good knowledge in any one of compiler frameworks like TVM, MLIR, Torch Dynamo/Inductor or equivalent
- Good knowledge on SOTA Generative AI model architectures like LLM and model optimization techniques will be helpful
- Good knowledge on popular ML framework ecosystems (PyTorch/TensorFlow/ONNX)
- High proficiency in C/C++, Python, domain-specific languages and parallel programming languages like OpenCL/CUDA
Functional Areas: Other
Read full job description12-15 Yrs
Mumbai
5-10 Yrs
5-8 Yrs
Bangalore / Bengaluru