Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by AgileEngine Team. If you also belong to the team, you can get access from here

AgileEngine Verified Tick

Compare button icon Compare button icon Compare
filter salaries All Filters

12 AgileEngine Jobs

AgileEngine - Lead DevOps Engineer - Site Reliability (6-8 yrs)

6-8 years

AgileEngine - Lead DevOps Engineer - Site Reliability (6-8 yrs)

AgileEngine

posted 3+ weeks ago

Job Description

Job Title: SRE - DevOps Lead


Job Description :

Responsibilities :

- Lead and mentor a team of SRE and DevOps engineers, fostering a culture of collaboration, ownership, and continuous improvement.

- Architect, design, and implement scalable, reliable, and highly available infrastructure on AWS.

- Manage and maintain our Kubernetes clusters, ensuring optimal performance, security, and resource utilization.

- Develop and maintain infrastructure-as-code (IaC) using Terraform for automated provisioning and management of resources.

- Design, implement, and manage CI/CD pipelines using GitLab for automated builds, testing, and deployments.

- Configure and manage VPN tunnels and SFTP setups for secure data transfer and connectivity.

- Design and implement cloud-based networking solutions, including VPCs, subnets, routing, and security groups.

- Develop and maintain scripting solutions using Shell/Bash (and ideally Python) to automate routine tasks

and system administration.

- Lead incident management processes, including root cause analysis, post-incident reviews, and preventative measures.

- Implement and maintain observability solutions, including monitoring, logging, and alerting, to proactively identify and address system issues.

- Coordinate effectively with cross-functional teams across multiple time zones to ensure smooth operations and project delivery.

- Ensure compliance with relevant industry regulations and standards, including HIPAA and GDPR.

- Train, mentor, and support junior team members, fostering their technical growth and development.

- Drive process improvement initiatives and implement automation strategies to enhance system reliability and operational efficiency.

- Participate in on-call rotations to provide 24/7 support for critical systems (approximately 2-3 days per week, including every other weekend).

- Work a Panama schedule with 8-hour shifts daily.

Requirements :

- 6+ years of experience in Site Reliability Engineering (SRE), DevOps, or infrastructure roles, with increasing levels of responsibility.

- Proven experience in leading distributed engineering or support teams, including performance management, mentoring, and team development.

- Deep knowledge of Amazon Web Services (AWS), including core services such as EC2, S3, RDS, VPC, and IAM.

- Extensive hands-on experience with Terraform for infrastructure provisioning and management.

- Strong proficiency in GitLab, including CI/CD pipeline design, implementation, and maintenance.

- Expertise in Kubernetes, including cluster management, deployment strategies, and troubleshooting.

- Solid understanding of Docker and containerization technologies.

- Practical experience with VPN tunnel configuration, SFTP setup, and cloud-based networking principles and practices.


- Familiarity with scripting languages, particularly Shell/Bash, for system automation and scripting.

- Strong incident management skills, including experience in leading incident response, conducting root cause analysis, and implementing corrective actions.


- Proven experience in implementing and utilizing observability tools and practices for monitoring, logging, and alerting.

- Excellent verbal and written communication skills, with the ability to articulate complex technical issues clearly and concisely.

- Ability to coordinate effectively with teams across multiple time zones and cultural backgrounds.

- Familiarity with working in compliance-heavy environments, with specific experience in HIPAA and GDPR regulations.

- Demonstrated ability to train, mentor, and support junior team members, fostering their technical growth.

- Proven track record of driving process improvement and implementing automation solutions to enhance system reliability and efficiency.

- Willingness to work a Panama schedule with 8-hour shifts daily and participate in on-call rotations (2-3 days per week, including every other weekend).

Nice to Have :


- Experience with Mirth Connect and/or Epic IRIS EHR systems.

- Familiarity with Bitbucket and Codefresh.

- Knowledge of Ansible for configuration management and automation.

- Familiarity with Python for scripting and automation tasks.

- Previous involvement in healthcare or medical device environments, with an understanding of relevant regulations and best practices.

- Strong understanding of high-availability infrastructure patterns and design principles.



Functional Areas: Other

Read full job description

Prepare for Your AgileEngine Interview with Real Experiences!

View interviews
Office worker

What people at AgileEngine are saying

What AgileEngine employees are saying about work life

based on 51 employees
89%
98%
100%
67%
Flexible timing
Monday to Friday
No travel
Day Shift
View more insights

AgileEngine Benefits

Submitted by Company
Flextime
Competitive compensation
A selection of exciting projects
Professional growth
Work From Home
Submitted by Employees
Free Transport
Child care
Gymnasium
Cafeteria
Work From Home
Free Food +6 more
View more benefits

Compare AgileEngine with

Primus Global Technologies

3.9
Compare

Accel Frontline

4.1
Compare

Pitney Bowes

3.8
Compare

DynPro

3.8
Compare

Apex CoVantage

3.1
Compare

Plada Infotech Services

3.5
Compare

Riddhi Corporate Services

3.7
Compare

Ind Innovations

3.6
Compare

CGS

3.6
Compare

Affiliated Computer Services

3.6
Compare

Pioneer e Solutions

3.6
Compare

Liferay

4.1
Compare

Gemini Communication

4.6
Compare

Anthelio Business Technologies

4.6
Compare

Oasis TechnoSoft

4.7
Compare

MASTEK ENGINEERING

4.0
Compare

Ausy

3.9
Compare

Aspect Software

3.9
Compare

Odisha Computer Application Centre (OCAC)

4.7
Compare

QNAP Systems

3.1
Compare

Similar Jobs for you

Senior AWS DevOps Engineer at Techolution

5-6 Yrs

₹ 15-20 LPA

Site Reliability Engineer Lead at Infraveo Technologies

4-6 Yrs

₹ 18-35 LPA

Site Reliability Engineer 2 at Zinnia

5-7 Yrs

₹ 15-19 LPA

Site Reliability Engineer Lead at athenaHealth Technology Private Limited.

8-12 Yrs

₹ 22-34 LPA

Site Reliability Engineer Lead at Pocket FM

6-8 Yrs

₹ 20-25 LPA

Site Reliability Engineer 2 at TALENT XO

1-5 Yrs

₹ 25-30 LPA

Lead DevOps Engineer at IOWeb3 Technologies

5-8 Yrs

₹ 15-24 LPA

Reliability Lead at HyreSnap

6-8 Yrs

₹ 18-24 LPA

Lead DevOps Engineer at Velodata Global Pvt Ltd

7-10 Yrs

₹ 15-20 LPA

Service Reliability Engineer at bounteous x Accolite Digital

7-9 Yrs

₹ 21-27 LPA

AgileEngine - Lead DevOps Engineer - Site Reliability (6-8 yrs)

6-8 Yrs

Software Configuration Management, DevOps, AWS +6 more

3+ weeks ago·via hirist.com

Automation QA Engineer (Middle/Senior)

3-8 Yrs

Kolkata, Mumbai, New Delhi +4 more

Manual Testing, Recruitment, Javascript +7 more

1 day ago·via naukri.com

Automation QA Engineer (Middle/Senior)

3-5 Yrs

Indore

Automation Testing, Javascript, Automation +3 more

2 days ago·via naukri.com

InRiver PIM Operations Manager

5-8 Yrs

₹ 275L/yr - 400L/yr

Indore

Excel, Clinical Data Management, Power Point Presentation +2 more

1 week ago·via naukri.com

Lead Full Stack Developer - Node.js/React.js (6-8 yrs)

6-8 Yrs

Javascript, Nestjs, Full Stack +2 more

2 weeks ago·via hirist.com

Java Engineer - Spring Boot/Hibernate (5-8 yrs)

5-8 Yrs

Cloud Computing, Java, Java Spring Boot +4 more

2 weeks ago·via hirist.com

AgileEngine - Senior Network Infrastructure Engineer - CCNA/CCNP (4-6 yrs)

4-6 Yrs

Linux Administration, VMware, CCNA +6 more

3+ weeks ago·via hirist.com

AgileEngine - Frontend Engineer - AngularJS (3-6 yrs)

3-6 Yrs

UI and UX, Javascript, HTML +4 more

3+ weeks ago·via hirist.com

AgileEngine - Lead Full Stack Developer - Node.js/React.js (8-10 yrs)

8-10 Yrs

Javascript, Full Stack, Postgresql +2 more

3+ weeks ago·via hirist.com

InRiver PIM Operations Manager (5-7 yrs)

5-7 Yrs

Clinical Data Management, Data Governance, Data Modeling +1 more

3+ weeks ago·via hirist.com
write
Share an Interview