Cerebras has developed a radically new chip and system to dramatically accelerate deep learning applications. Our system runs training and inference workloads orders of magnitude faster than contemporary machines, fundamentally changing the way ML researchers work and pursue AI innovation.
We are innovating at every level of the stack - from chip, to microcode, to power delivery and cooling, to new algorithms and network architectures at the cutting edge of ML research. Our fully-integrated system delivers unprecedented performance because it is built from the ground up for deep learning workloads.
About The Role
As an IT/DevOps engineer, you will be part of a global team responsible for our corporate, engineering, on-prem AI/ML and hybrid cloud computing environments. You will be a primary point of contact for in-geo IT support to Cerebras AI/ML and other staff. You will develop and maintain the IT/DevOps infrastructure required to run our day-to-day operations and help develop and implement future strategic initiatives. A proven ability to automate of day-to-day tasks is critical to the role.
Responsibilities
Design and install computer hardware configurations
Install software and networking systems
Troubleshoot network and software issues
Install and support high-level software security systems
Troubleshoot hardware, software, and networking issues
Provide local IT support and participate in global IT support functions
Participate in on-call rotation for follow-the-sun IT/DevOps support
Ensure security software is kept up-to-date
Collaborate with engineering, AI/ML and development teams to evaluate and identify optimal hybrid cloud solutions
Modify and improve existing systems
Develop and maintain cloud solutions in accordance with best practices
Skills Qualifications
Minimum 5+ years
Masters or Bachelor Degree in Computer Science
Scripting/programming - include GO and Python
Linux Administration (RHEL/Centos)
Familiarity with infrastructure and automation tooling such as Ansible, Terraform, Packer, Jenkins
AWS experience including VPC, Security Groups, EC2, EFS, ASG
AWS Certification a plus
Monitoring, dashboarding, and alerting in ELK/Grafana/Zabbix
Experience working in Atlassian Jira ServiceDesk/Confluence/Slack
Experience with MDM (Intune, JAMF, Airwatch)
Familiarity with administration and deployment of Endpoint security tools
Experience with Microsoft Azure AD/OKTA to enable SSO across SAAS
Applications
Ability to build trusting relationships with peers, internal and external customers.
Ability to prioritize with a keen sense of knowing when to escalate
Why Join Cerebras
People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:
Build a breakthrough AI platform beyond the constraints of the GPU
Publish and open source their cutting-edge AI research
Work on one of the fastest AI supercomputers in the world
Enjoy job stability with startup vitality
Our simple, non-corporate work culture that respects individual beliefs
Read our blog: Five Reasons to Join Cerebras in 2024 .
Apply today and become part of the forefront of groundbreaking advancements in AI.
Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer.
We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies.
We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.
This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.