We are seeking a talented engineer to implement and optimize machine learning, computer vision,and numeric libraries for target hardware architecture, including CPUs, GPUs, DSPs, and otheraccelerators.
Your expertise will be instrumental in enabling efficient and high-performanceexecution of algorithms on these hardware platforms.
Key Responsibilities:
Implement and optimize machine learning, computer vision, and numeric libraries fortarget hardware architectures, including CPUs, GPUs, DSPs, and other accelerators.
Work closely with software and hardware engineers to ensure optimal performance ontarget platforms.
Implement low-level optimizations, including algorithmic modifications, parallelization,vectorization, and memory access optimizations, to fully leverage the capabilities of thetarget hardware architectures.
Work with customers to understand their requirements and implement libraries to meettheir needs.
Develop performance benchmarks and conduct performance analysis to ensure theoptimized libraries meet the required performance targets.
Stay current with the latest advancements in machine learning, computer vision, andhigh-performance computing.
Qualifications:
BTech/BE/MTech/ME/MS/PhD degree in CSE/IT/ECE
> 2 years of experience working in Algorithm Development, Porting, Optimization &Testing
Proficient in programming languages such as C/C++, CUDA, OpenCL, or other relevantlanguages for hardware optimization.
Hands-on experience with hardware architectures, including CPUs, GPUs, DSPs, andaccelerators, and familiarity with their programming models and optimization techniques.
Knowledge of parallel computing, SIMD instructions, memory hierarchies, and cacheoptimization techniques.
Experience with performance analysis tools and methodologies for profiling andoptimization.
Knowledge of deep learning frameworks and techniques is good to have
Strong problem-solving skills and ability to work independently or within a team.