i
Xebia
32 Xebia Jobs
3-8 years
Gurgaon / Gurugram, Chennai, Bangalore / Bengaluru
1 vacancy
Application Support / Incident Response Analyst / System Administrator
Xebia
posted 5d ago
Flexible timing
Key skills for the job
Incident Response Analyst
As an Incident Response analyst, you will work closely with the Site Reliability Engineering, Data Engineering, and multiple support teams as you focus on ensuring the operational health of Global Data & Analytics production environments. We need someone who will bring strong communication and collaboration skills, thoughtful perspective, empathy, creativity, and a positive attitude to solve problems at scale.
You will have the opportunity to analyze problems and root cause to help identify solutions that proactively identify and prevent issues impacting Data & Analytics. You will also be able to gain experience in understanding and supporting high throughput data pipelines that feed our data lake and the use cases beyond.
Job Responsibilities
Youll manage production incidents, lead troubleshooting efforts, and coordinate responses for immediate resolution.
Youll effectively communicate with business and technology leaders, data engineers, and end users to consistently provide incident status updates and potential impacts.
Youll create and publish documentation, including root cause analyses, corrective actions, and operating procedures.
Youll develop and maintain expertise in existing application and platform performance monitoring systems.
You'll proactively research technical solutions and/or industry state-of-the art initiatives applicable to team initiatives.
Qualifications & Required Skills
Possess the excellent communication and presentation skills required to clearly articulate problem statements and solutions to multiple audiences in a corporate setting.
Excellent organizational and time management skills with strong attention to detail
ServiceNow (proficiency in ticketing/incident management within this tool)
A strong passion for learning new technologies and tools.
Knowledge of SQL, ETL, cloud computing, networking, infrastructure, and security.
High levels of creativity and quick problem solving capabilities.
Additional/Desired Skills
Project-planning skills
Basic knowledge of SSL, HTTP response codes, and network technology concepts.
Experienced in writing clear and technically detailed user stories
Basic knowledge of specific elements of the Google Cloud Platform (GCS, Pub/Sub, GKE, Vertex AI)
Basic knowledge of Apache Airflow
Ability to read code (especially Python) and identify/troubleshoot corner case scenarios
New Relic skills:
- Experience in writing JavaScript [for custom synthetic monitors];
- Experience in calling APIs;
- Basic proficiency in NRQL;
- Experience in infrastructure monitoring;
- Experience in creating custom dashboards
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Incident Response Analyst roles with real interview advice
3-8 Yrs
Gurgaon / Gurugram, Chennai, Bangalore / Bengaluru
5-10 Yrs
Hyderabad / Secunderabad, Chennai, Bangalore / Bengaluru
10-13 Yrs
Chennai, Bangalore / Bengaluru, Delhi/Ncr