Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

Riskspan

Compare

4.0

based on 3 Reviews

3 Riskspan Jobs

Site Reliability Engineer

RiskSpan

4.0

based on 3 Reviews

2-5 years

Bangalore / Bengaluru

1 vacancy

Site Reliability Engineer

Riskspan

posted 13hr ago

Job Role Insights

Key skills for the job

Computer Networking Automation Testing Clinical Data Management Linux Administration Incident Management Scheduling

+ 2 more

Job Description

Description

Site Reliability Engineer (SRE)

Location: Bangalore, India (Hybrid)

Shift Timings: Rotational shifts between 8:00 AM to 2:00 AM (next day)

About RiskSpan Technologies

RiskSpan Technologies is a leading technology and data solutions company specializing in delivering innovative and scalable solutions to complex challenges in the financial services and technology sectors. We pride ourselves on a collaborative culture, technical excellence, and a passion for problem-solving. Join us to enhance system reliability, observability, and performance at scale!

Job Overview

We are looking for a Site Reliability Engineer (SRE) with 2.5 to 5 years of experience to join our team. The ideal candidate will be responsible for ensuring the availability, scalability, and reliability of our distributed systems, improving observability, automating infrastructure, and enhancing system performance. This role provides an opportunity to work on high-scale, mission-critical environments and contribute to building a resilient infrastructure.

Key Responsibilities

Improve observability by implementing and managing monitoring, logging, and alerting solutions using Prometheus, ELK stack, and Grafana.
Work with APMs like Dynatrace, New Relic to monitor performance metrics, define SLIs, SLOs, and error budgets.
Participate in incident management, including on-call rotation, and Root Cause Analysis (RCA).
Automate infrastructure provisioning using Terraform and Infrastructure as Code (IaC) principles.
Ensure system scalability, reliability, and performance in a distributed environment.
Strengthen security by applying cybersecurity best practices, vulnerability assessments, and compliance policies.
Collaborate with cross-functional teams to establish SRE best practices, improve release pipelines, and minimize deployment risks.
Maintain and improve disaster recovery plans to enhance resilience.
Manage and optimize workflows using Apache Airflow to ensure efficient scheduling and execution of data pipelines.
Support Snowflake data operations, ensuring high availability, performance optimization, and security compliance.

Qualifications Certifications

Education:

Bachelors degree in Computer Science, Engineering, or related fields.

Experience:

2.5 to 5 years of experience in Site Reliability Engineering, Observability, or Performance Monitoring.

Hands-on experience in:

Monitoring and observability using Prometheus, ELK, Grafana.
Application Performance Monitoring (APM) tools like Dynatrace, New Relic, or Datadog.
Incident response and on-call rotation management.
Infrastructure automation using Terraform.
Distributed systems operations and scaling.
Load testing and performance analysis using tools like JMeter, k6, or Locust.
Security at scale, including vulnerability scanning and compliance automation.
Workflow automation and orchestration using Apache Airflow .
Experience with Snowflake , including query optimization, data management, and security controls.
Technical Skills:
Strong knowledge of cloud platforms (AWS preferred).
Experience with troubleshooting distributed systems and high-traffic environments.
Hands-on knowledge of Linux, networking, and security fundamentals.
Familiarity with container orchestration (Kubernetes, Docker).
Ability to write automation scripts using Python, Bash, or Go.

Preferred Certifications:

AWS Certified DevOps Engineer - Professional (or equivalent AWS certification).
HashiCorp Certified: Terraform Associate.
Certified Kubernetes Administrator (CKA).
Google SRE Professional Certificate (preferred but not mandatory).

Why Join RiskSpan Technologies

Work in an innovative and fast-paced environment focusing on reliability, observability, and automation. Opportunity to learn and grow in an expanding team dedicated to scaling distributed systems. Hybrid work model - Work from Bangalore office and remote as per business needs. Competitive salary and benefits. Collaborative culture that encourages continuous learning and professional development.

Join RiskSpan Technologies and play a key role in ensuring highly available, scalable, and secure systems. Apply now!

Employment Type: Full Time, Permanent

Read full job description