i
Aurigo
SSE II - CloudOps
Aurigo
posted 3d ago
Flexible timing
Key skills for the job
Senior Software Engineer 2 - CloudOps
Location: Bangalore (India)
Experience: 6 to 10 years
About Aurigo: Aurigo is an American technology company founded in 2003 with a mission to help public sector agencies and facility owners plan, deliver and maintain their capital projects and assets safely and efficiently. Our award-winning Aurigo Masterworks Cloud software is now the industry- leading solution for both public and private agencies funding and maintaining large capital, infrastructure, and facilities investments. We are a privately held U.S. corporation proudly headquartered in Austin, Texas, with software development and support centers in Canada and India.If you are ready to work for a fast-paced software company growing at over 100% YOY and interact with some of the brightest minds in the industry to solve real problems, we want to talk to you.
Position Description:
The SRE team provides hosting, operations, database, security, and scaling support to Aurigo flag ship products hosted on AWS Cloud Infrastructure. SRE role at Aurigo is to enable the business to deliver, operate, maintain, and scale our flagship products. To sustain a high-performing
product, Aurigo must design, implement, and maintain highly available and responsive cloud infrastructure. Aurigo requires a dynamic Site Reliability Engineer with both Application
management and Infrastructure administration skills. The engineer should be capable of delivering a highly available and reliable Application environment and will be responsible for a variety of technical, operational, and consultative activities, including system administration, release engineering tasks for our flagship products. It is critically important to the company that the applications and database systems offer the highest levels of reliability and performance. We are committed to providing 99.99% uptime.
Required Skills and Experience:
Education Qualifications: BE/Btech/ ME / Mtech only
AWS Cloud Management: Minimum 5 years of hands-on experience in deploying and
managing AWS services such as EC2, VPC, RDS, S3, ALB, Route 53, API Gateway, and
CloudFront in production environments. AWS Certified Solutions Architect Associate
(SAA) or Solutions Architect Professional (SAP) is preferred.
Infrastructure as Code (IaC): Minimum 4 years of experience with Terraform for
provisioning, managing, and scaling cloud infrastructure. HashiCorp Certified: Terraform
Authoring and Operations Professional or HashiCorp Certified: Terraform Associate
(003) is preferred.
Linux Administration: Proficiency in Linux system administration, including patching,
security hardening, and performance tuning in cloud environments.
Observability & Monitoring: Hands-on experience with CloudWatch, New Relic,
OpenSearch, Sumo Logic, or similar tools for log analysis, monitoring, and
troubleshooting.
Scripting & Automation: Proficiency in Python, Shell, or PowerShell for automating
cloud infrastructure tasks, deployments, and operational workflows.
Cloud Networking & Security: Strong understanding of VPC networking, VPNs, load
balancers, and security best practices, including WAFs (Web Application Firewalls) and
endpoint protection tools.
Site Reliability Engineering (SRE) Practices: Experience with SLA, SLI, and SLO, incident
management, and ensuring system reliability for production workloads.
Disaster Recovery & Resilience: Experience conducting DR and BCP tests for AWS
workloads and implementing high-availability architectures.
Roles & Responsibilities: AWS Cloud Administration: Own the troubleshooting, administration, and optimization of AWS environments, applications, databases, and servers.
Infrastructure as Code (IaC): Design, implement, and maintain cloud infrastructure
using Terraform.
Monitoring & Performance Optimization: Continuously monitor infrastructure health,
application performance, and security gaps, ensuring proactive resolution.
Compliance & Security Audits: Support the implementation and auditing of security and
compliance frameworks such as SOC 2, StateRAMP, and FedRAMP.
Automation & Scripting: Develop and optimize automation scripts for infrastructure
provisioning, patching, and operational tasks.
Production Readiness & Reliability: Conduct regular DR tests, enforce reliability best
practices, and optimize cloud resource utilization.
Incident & Change Management: Lead incident response for production issues,
perform root cause analysis, and collaborate with teams to implement preventive
measures.
On-Call & Maintenance: Participate in a 24/7 On-Call rotation (one week per month)
and monthly patching activities on one Saturday per month (compensatory time off
provided).
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Sales Executive 2 roles with real interview advice