Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

For Employers

Add office photos

Employer? Claim Account for FREE

Trimble Applanix

Compare

No reviews yet

129 Trimble Applanix Jobs

Site Reliability Engineering Lead

Applanix

6-10 years

Chennai

1 vacancy

Trimble Applanix

posted 5mon ago

Job Role Insights

Key skills for the job

Logistics SQL Linux Administration UNIX Business Continuity Troubleshooting

+ 2 more

Job Description

Design, implement, and maintain high-availability and scalable systems, ensuring our platforms run smoothly 24/7 with minimal downtime
Emphasize SRE as an engineering discipline, driven by automation. Create and improve IaC, automation tools for continuous integration, deployment, and incident response, reducing manual work and improving response times.
Develop and maintain comprehensive monitoring, alerting, and logging systems to provide deep insights into system performance, identifying potential issues before they impact users.
Monitor system performance and usage, conducting capacity planning and scaling efforts to meet growing demands.Design cost controls and rollout the cost optimization strategy.
Own KPIs for site stability, performance, and root cause analysis (RCA) for production issues.Develop services for automatic incident and disaster recovery.
Participate in troubleshooting, capacity analysis, planning, and performance analysis.
Lead incident response efforts, perform root cause analyses, and implement post-mortem processes to prevent future issues and improve system resilience
Handle escalations from internal stakeholders and manage critical issues to resolution.
Identify problems and opportunities for improvements that are common across many teams and services.
Responsible for fixing compliance issues and requirements raised by CyberSecurity tools
Adopt reliability engineering practices such as error budgets, blameless retrospectives, chaos engineering, etc.
Production operational support of our global service catalog
Foster collaboration with software product development, architecture, and engineering team to ensure releases are delivered with repeatable and auditable processes
Ensure 24x7 coverage with business continuity principles.
Learn and be passionate about cloud computing
Evaluate and utilize the newer technologies coming in the industry to keep the solution on the cutting edge
Mentor junior SREs and other engineering team members, sharing knowledge and promoting a culture of reliability, efficiency, and continuous learning.

What Skills & Experience You Should Bring

Bachelors/Master s degree in Computer Engineering, or related field
Minimum 6+ years experience in technical.
History of supporting applications and infrastructure in Production
Experience in Capacity planning and Cost optimization
Experience with Amazon Web Services (Azure or GCP acceptable)
Deep understanding of Linux/Unix operating systems
Experience building and deploying containers and serverless architecture.
Familiarity with modern web application development and architecture
Experience using a high-level scripting language (Python preferred) and IaC tools(Terraform , CloudFormation) and containerization

Desired Skills

AWS Certification (or equivalent in another public cloud)
Experience with microservice architecture
Expertise in Python or another high-level programming language
Experience with SaaS monitoring tool sets (Datadog, SumoLogic, PagerDuty, InfluxDB , Grafana)
Experience in CloudFormation, SAM Template and Terraform
Experience in Github, Atlassian tools , Bitbucket , Jira and Confluence
Experience in Ansible and Packer
Experience using SQL and NoSQL databases
Experience with Github actions, Jenkins, Azure DevOps and Gradle for CI/CD