5+ years of professional experience handling large-scale production systems.
Experience in designing new services on AWS or comparable cloud providers.
Migration of services to the cloud, and deployment of new services on AWS (at least EC2, IAM, S3).
Hands-on experience in Windows Administration and Linux.
Knowledge of IIS and nginx.
Knowledge of Active Directory services.
Hands-on Experience with Kubernetes and Docker.
Experience with Powershell scripting language.
Excellent knowledge of large-scale web applications/distributed systems.
Critical thinking is continuously challenging how and why we do things to help us improve.
Excellent English communication skills both verbal and in writing.
Nice to have
Mastery in a programming language preferably .NET and development best practices.
Willingness to live our values: Ensure Customer Success, Focus on Results, and Strive for Excellence.
Experience with Terraform, and configuration management tools like Chef, Ansible, Github, or equivalent.
Responsibilities
Administration of Windows, Web servers, Application servers, Kubernetes clusters, and cloud infrastructure support for customer production environments.
Own end-to-end availability and performance of mission-critical services and build automation to prevent problem recurrence.
Work closely with Product and Development to bring availability and scalability forward with all future developments.
Tools development and automation to increase availability, performance, and deployments.
Coordinate incident, problem, and change management.
Collaborate with Product and Support teams to plan and deploy product releases.
Candidates need to be willing to participate in an on-call rotation that includes maintenance schedules and incident response outside of office hours.