Upload Button Icon Add office photos
Premium Employer

i

This company page is being actively managed by Yotta Infrastructure Solutions Team. If you also belong to the team, you can get access from here

Yotta Infrastructure Solutions

Compare button icon Compare button icon Compare
3.7

based on 84 Reviews

filter salaries All Filters

5 Yotta Infrastructure Solutions Jobs

Yotta - L3 HPC Administrator (5-7 yrs)

5-7 years

Yotta - L3 HPC Administrator (5-7 yrs)

Yotta Infrastructure Solutions

posted 1mon ago

Job Role Insights

Flexible timing

Job Description

Job Scope :

As an HPC Admin L3, you will be responsible for the provisioning, management, and maintenance of GPU Supercomputing clusters on NVIDIA reference architecture.

You will ensure optimal performance and uptime of these critical systems, supporting high-performance computing (HPC) requirements.

Job Responsibilities :

- Provision, configure, and maintain GPU Supercomputing clusters and associated networking configuration.

- Collaborate with NVIDIA Solution Architect & Engineering teams on large-scale GPU-as-a-service projects, both on-premises and in cloud deployments.

- Implement and optimize software stacks including MaaS (metal-as-a-service), Job Scheduler (SLURM/PBS), Cloud Orchestration (Kubernetes), and Network Management (NetQ for Ethernet fabric and UFM for InfiniBand).

- Conduct performance engineering activities such as debugging, profiling, benchmarking, and tuning of GPU applications on large-scale supercomputing clusters.

- Run benchmarking applications from widely used platforms such as MLPerf Training & Inference, AI Training (PyTorch, TensorFlow, NeMo, Megatron-LM), and AI Inference (TensorRT-LLM, Triton Inference Server, vLLM).

Must-Have Skill :

- Hands-on experience with NVIDIA GPU, particularly NVIDIA Data Centre GPUs (A100/H100)

- Proficiency in provisioning and managing software stacks like MaaS, Job Scheduler (SLURM/PBS), Cloud Orchestration (Kubernetes), and Network Management (NetQ for Ethernet fabric and UFM for InfiniBand).

- Prior experience collaborating with NVIDIA Solution Architect & Engineering teams on large-scale GPU-as-a-service projects.

- Familiarity with benchmarking applications from widely used platforms and frameworks, including MLPerf, PyTorch, TensorFlow, NeMo, Megatron-LM, TensorRT-LLM, Triton Inference Server, and vLLM.

- Experience in performance engineering, including debugging, profiling, benchmarking, and tuning various GPU applications on large-scale supercomputing clusters.

Good to Have Skill :

- Knowledge of other HPC technologies and architectures beyond NVIDIA, broadening expertise in the field.

- Good knowledge on Infiniband and other switches.

- Experience with other cloud platforms and orchestration tools, expanding versatility in deployment environments.

- Strong problem-solving and troubleshooting abilities, enabling quick resolution of complex technical issues.

- Excellent communication and collaboration skills to work effectively within cross-functional teams and with external partners.

Behavioral Attributes :

- Strong problem-solving skills with a proactive and solution-oriented approach.

- Excellent communication and collaboration skills for effective customer support.

- Adaptability to handle a dynamic and fast-paced cloud administration environment.

- Commitment to security best practices and continuous improvement.

Qualification and Experience :

- Bachelor's degree in Engineering, or equivalent.

- Minimum 10 years experience in IT, 5+ years of relevant experience in HPC engineering roles, with a focus on NVIDIA GPU and Networking Technologies.

- Demonstrated success in deploying and managing large-scale GPU Supercomputing clusters, preferably in collaboration with NVIDIA teams.

- Proven track record of performance engineering activities and optimizing GPU applications for high-performance computing workloads


Functional Areas: Other

Read full job description

Prepare for Administrator roles with real interview advice

People are getting interviews at Yotta Infrastructure Solutions through

Referral
Campus Placement
75%
25%
Moderate Confidence
?
Moderate Confidence means the data is based on a sufficient number of responses received from the candidates

What people at Yotta Infrastructure Solutions are saying

What Yotta Infrastructure Solutions employees are saying about work life

based on 84 employees
54%
60%
45%
100%
Flexible timing
Monday to Friday
No travel
Day Shift
View more insights

Yotta Infrastructure Solutions Benefits

Submitted by Company
Free Transport
Job Training
Health Insurance
Work From Home
Submitted by Employees
Work From Home
Job Training
Free Transport
Cafeteria
Soft Skill Training
Health Insurance +6 more
View more benefits

Compare Yotta Infrastructure Solutions with

STT Global Data Centres India

4.2
Compare

CtrlS

3.9
Compare

Sify Technologies

3.9
Compare

Web Werks

2.6
Compare

Reliance Data Center

4.5
Compare

Nxtgen Datacenter Cloud Technologies

3.7
Compare

Pi DATACENTERS

3.7
Compare

Tata Communications

4.1
Compare

Collabera Technologies

3.5
Compare

Foray Software

3.5
Compare

Black Knight

3.6
Compare

Nelito System

3.5
Compare

MAQ Software

2.0
Compare

ESDS Software Solutions

3.9
Compare

DataMetica

3.6
Compare

Softenger

4.2
Compare

Espire Infolabs

2.6
Compare

C-Edge Technologies

3.9
Compare

Ascent Business Solutions

3.4
Compare

SISL Infotech

3.7
Compare

Similar Jobs for you

C Consultant at TIS Labs Pvt Ltd

2-8 Yrs

₹ 12-25 LPA

Network Administrator at RMV Workforce LLP

6-8 Yrs

₹ 18-24 LPA

Network Engineer 3 at RMV Workforce LLP

5-8 Yrs

₹ 15-24 LPA

Senior Network Administrator at HyrEzy Talent Solutions

6-10 Yrs

₹ 15-25 LPA

Senior Network Administrator at Nuventure Connect Pvt Ltd

Remote

8-10 Yrs

₹ 15-30 LPA

Infrastructure Engineer at Info Services

3-5 Yrs

₹ 12-20 LPA

Senior Network Engineer at Foundever

5-7 Yrs

₹ 15-20 LPA

Network Engineer at SysTechCorp Inc

Hyderabad / Secunderabad

5-7 Yrs

₹ 15-20 LPA

Senior Engineer at 1HResource Solutions

3-5 Yrs

₹ 10-15 LPA

Network Engineer 3 at CYFUTURE

Noida

7-9 Yrs

₹ 19-24 LPA

Yotta Infrastructure Solutions Mumbai Office Location

View all
Mumbai, Maharashtra Office
Headquarter
5th Floor, Scorpio Building, Hiranandani Gardens, Powai, Mumbai 400076. India. Mumbai, Maharashtra
400076

Yotta - L3 HPC Administrator (5-7 yrs)

5-7 Yrs

1mon ago·via hirist.com

Network Team Lead

7-10 Yrs

Delhi/Ncr

2d ago·via naukri.com

Cyber Security Engineer

4-9 Yrs

₹ 4 - 9L/yr

New Delhi, Pune, Delhi/Ncr

4d ago·via naukri.com

Government Sales Manager

10-15 Yrs

New Delhi, Kolkata

4mon ago·via naukri.com
write
Share an Interview