1 Nextgen Innovation Labs Job
10-20 years
NEXTGEN Innovation Labs - System Administration/Engineer - HPC (10-20 yrs)
Nextgen Innovation Labs
posted 1mon ago
Flexible timing
Key skills for the job
Job Description :
Responsibilities :
Run :
- Workload scheduler management
- System deployment using cluster management tools
- OS repository management along with compatibility matrix for various device drivers
- Configuration and maintenance of network services
- Deployment and management of monitoring systems to automate services and hardware alerts
- Internal cluster network connectivity
- Cloud migration of HPC cluster (Core HPC system)
- Installation and configuration of HPC workload managers
- HPC Application integration with job scheduler
Skills / Expertise :
- Operating systems : Linux : RHEL, Rocky, CentOS, SuSE, Windows
- Schedulers & Resource Managers : PBS Pro, LSF, SLURM, Open Grid Scheduler [OGS]
- Provisioning : HP-CMU, xCAT, Bright Cluster Manager
- Monitoring : Ganglia, Nagios, Zabbix, Grafana
- Configuration Management : Chef, Puppet, Ansible, CFEngine.
- HPC Application : Openfoam, Star-CCM+,Abaqus, Ansys, Ls-Dyna and other CAE & CFD applications
- Linux operating system fundamentals, architecture, administration, native service configuration and advanced debugging skills
- Knowledge of x86 hardware, system software and system services
- Experience in HPC cluster configuration, management, upgrade and migration
- Knowledge of Managing parallel file system Like Luster, BeeGFS, GPFS
- Scripting and automation - bash, Perl, Python
- Knowledge of ITSM processes
Functional Areas: IT Hardware & Telecom
Read full job descriptionPrepare for System Administrator roles with real interview advice
10-20 Yrs