We support them by building creative and robust solutions to operations problems. We use our background as generalists to work closely with product development teams from the early stages of design all the way through identifying and resolving production issues. We see the big picture. We help create and enforce standards while facilitating an agile and learning culture. We use SRE principals such as blameless postmortems and operational load caps to ensure we re constantly improving our knowledge and maintaining a good quality of life. Overall, we re passionate about automation, learning and participating in dynamic day to day work. In This Role, You Will: Enlighten, Enable and Empower a fast-growing set of multi-disciplinary teams, across multiple applications and locations. Tackle complex development, automation and business process problems. Champion Cvent standards and best practices. Ensure the scalability, performance, and resilience of Cvent products and processes. Work with product development teams, Cloud Automation and other SRE teams to ensure a holistic understanding of observability gaps and their effective and efficient identification and resolution. Identify recurring problems and anti-patterns in development, operational and security processes and help respective team to build observability for those. Develop build, test and deployment automation that seamlessly targets multiple on-premises and AWS regions. Give back by working on and contributing to Open-Source projects. Heres What You Need: Experience managing AWS services / operational knowledge of managing applications in AWS - ideally via automation. Fluent in at least one scripting languages like Typescript, Javascript, Python, Ruby and Bash. Experience with SDLC methodologies (preferably Agile). Experience with Observability (Logging, Metrics, Tracing) and SLI/SLO Working with APM, monitoring, and logging tool (Datadog, New Relic, Splunk) Good understanding of containerization concepts - docker, ECS, EKS, Kubernetes. Self-motivation and the ability to work under minimal supervision Troubleshooting and responding to incidents, set a standard for others to prevent the issues in future. Good to have skills: Experience with Infrastructure as Code (IaC) tools such as CloudFormation, CDK (preferred) and Terraform. Experience managing 3 tier application stacks. Understanding of basic networking concepts. Experience on Server configuration through Chef, Puppet, Ansible or equivalent Working experience with NoSQL databases such as MongoDB, Couchbase, Postgres etc