i
Cloud Destinations
13 Cloud Destinations Jobs
System Engineer - Azure Cloud Platform (7-10 yrs)
Cloud Destinations
posted 14hr ago
Flexible timing
Key skills for the job
About the Job :
We are seeking a highly motivated and experienced Cloud Systems Engineer to join our growing team.
In this role, you will play a crucial part in architecting, planning, maintaining, implementing, and optimizing our cloud environment within Microsoft Azure. You will be instrumental in leveraging and promoting cloud-native solutions, empowering our engineers to effectively utilize cloud-based services.
A strong focus on High-Performance Computing (HPC) workloads and job scheduling is essential for this position.
Mandatory Skills :
- Extensive experience with Microsoft Azure
- Proven experience with HPC workloads and job scheduling systems
Position Overview :
The ideal candidate possesses a deep understanding of cloud computing principles and a passion for building and maintaining robust, scalable, and secure cloud infrastructure. You will collaborate with various engineering teams to design and implement solutions that meet their specific needs, while adhering to best practices and security guidelines. Your expertise in automation and infrastructure as code will be critical to our success.
Essential Duties and Responsibilities :
- Cloud Architecture and Design : Architect and design cloud-native solutions on Azure, considering scalability, performance, security, and cost optimization. Specifically, design and implement solutions for HPC workloads, leveraging appropriate job scheduling systems.
- Cloud Infrastructure Management : Implement, maintain, and optimize Azure cloud infrastructure components, including virtual machines, storage, networking, security groups, and other relevant services.
- Automation and Orchestration : Develop and maintain automation scripts and tools using Ansible, Terraform, or similar technologies to automate infrastructure provisioning, configuration, and management.
- Operating System Expertise : Possess a solid understanding of operating systems (Windows, RHEL, Ubuntu, CentOS, and other Linux/Unix distributions), including system builds, configurations, troubleshooting, and performance tuning.
- Networking and Security : Configure and troubleshoot network issues in a hybrid cloud environment. Implement and maintain security features such as LDAP, ADFS, SSL certificates, and other security best practices.
- Monitoring and Observability : Implement and utilize monitoring tools like Splunk to proactively monitor cloud resources, identify performance bottlenecks, and ensure optimal usage. Develop dashboards and alerts to provide real-time insights into system health.
- Troubleshooting and Problem Solving : Effectively troubleshoot network, security, and performance issues in the cloud environment. Identify root causes and implement effective solutions.
- Collaboration and Communication : Work closely with engineering teams to understand their requirements and provide technical guidance on cloud-native solutions. Communicate effectively with both technical and non-technical stakeholders.
- Documentation : Create and maintain comprehensive documentation for cloud infrastructure, processes, and best practices.
- On-Call Support : Participate in an on-call rotation to provide system administration support, including weekends, holidays, and after-business hours as needed.
- Infrastructure as Code : Develop and maintain infrastructure as code using Python or other programming languages to automate infrastructure management and ensure consistency.
- Project Leadership : Lead small team projects related to cloud infrastructure implementations and upgrades.
Required Qualifications :
- 7+ years of hands-on technical experience in system design and implementation.
- 3+ years of experience architecting and implementing cloud-native solutions, specifically on Microsoft Azure.
- Proven experience with HPC workloads and job scheduling systems.
- Deep understanding of cloud computing technologies, business drivers, and emerging trends.
- Extensive experience provisioning, operating, and implementing cloud-based solutions on Azure.
- Hands-on experience with Splunk for monitoring and observability.
- Strong understanding of networking concepts and security best practices in a cloud environment.
- Experience implementing security features such as LDAP, ADFS, SSL, etc.
- Proficiency in at least one scripting language (e.g. , Python, Bash) for automation and infrastructure as code.
- Experience with automation tools like Ansible and Terraform.
- Excellent troubleshooting and problem-solving skills.
- Ability to work effectively in a team-oriented and collaborative environment.
- Strong communication and documentation skills in Computer Science or equivalent work experience (preferred).
Preferred Qualifications :
- Experience with other cloud platforms (AWS, GCP).
- Certifications related to Azure and/or HPC.
- Experience with containerization technologies (Docker, Kubernetes).
- Knowledge of DevOps principles and practices
Functional Areas: Software/Testing/Networking
Read full job descriptionPrepare for System Engineer roles with real interview advice