Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

VIEW WINNERS
- ABECA 2025
  
  VIEW WINNERS
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
Participate in ABECA 2026

Add office photos

Employer? Claim Account for FREE

HSBC Group

Compare

3.9

based on 5.3k Reviews

Video summary

Filter interviews by

HSBC Group Data Engineer Interview Questions and Answers

Updated 31 Aug 2024

11 Interview questions

A Data Engineer was asked

Q. What is Spark, and how does it work?

Ans.

Spark is a distributed computing framework designed for big data processing.

Spark is built around the concept of Resilient Distributed Datasets (RDDs) which allow for fault-tolerant parallel processing of data.
It provides high-level APIs in Java, Scala, Python, and R for ease of use.
Spark can run on top of Hadoop, Mesos, Kubernetes, or in standalone mode.
It includes modules for SQL, streaming, machine learning, an...

A Data Engineer was asked

Q. What is MapReduce?

Ans.

MapReduce is a programming model and processing technique for parallel and distributed computing.

MapReduce is used to process large datasets in parallel across a distributed cluster of computers.
It consists of two main functions - Map function for processing key/value pairs and Reduce function for aggregating the results.
Popularly used in big data processing frameworks like Hadoop for tasks like data sorting, sear...

A Data Engineer was asked

Q. What is lazy evaluation in Spark?

Ans.

Lazy evaluation in Spark delays the execution of transformations until an action is called.

Lazy evaluation allows Spark to optimize the execution plan by combining multiple transformations into a single stage.
Transformations are not executed immediately, but are stored as a directed acyclic graph (DAG) of operations.
Actions trigger the execution of the DAG and produce results.
Example: map() and filter() are transf...

A Data Engineer was asked

Q. What is the imputer function in PySpark?

Ans.

Imputer function in PySpark is used to replace missing values in a DataFrame.

Imputer is a transformer in PySpark ML library.
It replaces missing values in a DataFrame with either mean, median, or mode of the column.
It can be used with both numerical and categorical columns.
Example: imputer = Imputer(inputCols=['col1', 'col2'], outputCols=['col1_imputed', 'col2_imputed'], strategy='mean')
Example: imputed_df = impute...

What people are saying about HSBC Group

View All

captaincul

ex -

IBM

Worried about BGV

Actually i got preoffer letter in HSBC just worried abt BGV , In my first company i just worked 2 months and left but i didnt get any experience letter letter but after that i worked for 4 years in Another company where i got all documents and just shared offer letter of my first company iwith BGV team. Can anyone one help me out i also informed HR during discussion he also said he will make a note

Got a question about HSBC Group?

Ask anonymously on communities.

A Data Engineer was asked

Q. How do you delete duplicate rows from a table?

Ans.

To delete duplicate rows from a table, use the DISTINCT keyword or GROUP BY clause.

Use the DISTINCT keyword to select unique rows from the table.
Use the GROUP BY clause to group the rows by a specific column and select the unique rows.
Use the DELETE statement with a subquery to delete the duplicate rows.
Create a new table with the unique rows and drop the old table.

A Data Engineer was asked

Q. How do you handle null values in PySpark?

Ans.

Null values in PySpark are handled using functions such as dropna(), fillna(), and replace().

dropna() function is used to drop rows or columns with null values
fillna() function is used to fill null values with a specified value or method
replace() function is used to replace null values with a specified value
coalesce() function is used to replace null values with the first non-null value in a list of columns

A Data Engineer was asked

Q. Write an SQL query to get the second highest salary from each department.

Ans.

SQL query to retrieve the second highest salary from each department

Use the RANK() function to assign a rank to each salary within each department
Filter the results to only include rows with a rank of 2
Group the results by department to get the second highest salary for each department

Are these interview questions helpful?

A Data Engineer was asked

Q. What are the window functions you have used?

Ans.

Window functions are used to perform calculations across a set of rows that are related to the current row.

Commonly used window functions include ROW_NUMBER, RANK, DENSE_RANK, LAG, LEAD, FIRST_VALUE, LAST_VALUE, and NTILE.
Window functions are used in conjunction with the OVER clause to define the window or set of rows to perform the calculation on.
Window functions can be used to calculate running totals, moving av...

A Data Engineer was asked

Q. Merge two unsorted lists such that the output list is sorted. You are free to use inbuilt sorting functions to sort the input lists

Ans.

Merge two unsorted lists into a sorted list using inbuilt sorting functions.

Use inbuilt sorting functions to sort the input lists
Merge the sorted lists using a merge algorithm
Return the merged and sorted list

A Data Engineer was asked

Q. What is skewness and skewd tables

Ans.

Skewness is a measure of asymmetry in a distribution. Skewed tables are tables with imbalanced data distribution.

Skewness is a statistical measure that describes the asymmetry of the data distribution around the mean.
Positive skewness indicates a longer tail on the right side of the distribution, while negative skewness indicates a longer tail on the left side.
Skewed tables in data engineering refer to tables with...

HSBC Group Data Engineer Interview Experiences

6 interviews found

Data Engineer Interview Questions & Answers

Anonymous

posted on 3 May 2024

Interview experience

Good

Difficulty level

Hard

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Walk-in and was interviewed in Apr 2024. There were 3 interview rounds.

Round 1 - Technical

(2 Questions)

Q1. What is lazy evaluation in spark.

Ans.

Lazy evaluation in Spark delays the execution of transformations until an action is called.

Lazy evaluation allows Spark to optimize the execution plan by combining multiple transformations into a single stage.
Transformations are not executed immediately, but are stored as a directed acyclic graph (DAG) of operations.
Actions trigger the execution of the DAG and produce results.
Example: map() and filter() are transformat...

Answered by AI

Add your answer

Q2. What is mapreduce

Ans.

MapReduce is a programming model and processing technique for parallel and distributed computing.

MapReduce is used to process large datasets in parallel across a distributed cluster of computers.
It consists of two main functions - Map function for processing key/value pairs and Reduce function for aggregating the results.
Popularly used in big data processing frameworks like Hadoop for tasks like data sorting, searching...

Answered by AI

Add your answer

Round 2 - One-on-one

(1 Question)

Q1. What is skewness and skewd tables

Ans.

Skewness is a measure of asymmetry in a distribution. Skewed tables are tables with imbalanced data distribution.

Skewness is a statistical measure that describes the asymmetry of the data distribution around the mean.
Positive skewness indicates a longer tail on the right side of the distribution, while negative skewness indicates a longer tail on the left side.
Skewed tables in data engineering refer to tables with imba...

Answered by AI

Add your answer

Round 3 - One-on-one

(1 Question)

Q1. What is spark and explain working

Ans.

Spark is a distributed computing framework designed for big data processing.

Spark is built around the concept of Resilient Distributed Datasets (RDDs) which allow for fault-tolerant parallel processing of data.
It provides high-level APIs in Java, Scala, Python, and R for ease of use.
Spark can run on top of Hadoop, Mesos, Kubernetes, or in standalone mode.
It includes modules for SQL, streaming, machine learning, and gra...

Answered by AI

Add your answer

Interview Preparation Tips

Topics to prepare for HSBC Group Data Engineer interview:

Big Data

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Anonymous

posted on 22 Apr 2024

Interview experience

Average

Difficulty level

Moderate

Process Duration

2-4 weeks

Result

Not Selected

I applied via Naukri.com and was interviewed in Mar 2024. There was 1 interview round.

Round 1 - One-on-one

(3 Questions)

Q1. About past experience

Add your answer

Q2. Python Coding question about list, dataframe etc

Add your answer

Q3. RDBMS Questions and SQL theory

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Be confident

Data Engineer Interview Questions & Answers

Anonymous

posted on 31 Aug 2024

Interview experience

Excellent

Difficulty level

Easy

Process Duration

Less than 2 weeks

Result

Selected

I applied via Job Portal and was interviewed in Jul 2024. There was 1 interview round.

Round 1 - Aptitude Test

English, aptitude test, reasoning

Interview Preparation Tips

Interview preparation tips for other job seekers - I was very good interview process

Data Engineer Interview Questions & Answers

Anonymous

posted on 22 Feb 2024

Interview experience

Good

Difficulty level

Process Duration

Result

Not Selected

Round 1 - Online

(2 Questions)

Q1. Behavioral test

Add your answer

Q2. Questions will be repeated randomly we need to select the same response if we could which will assess our behaviors

Add your answer

Round 2 - One-on-one

(2 Questions)

Q1. Technical discussion

Add your answer

Q2. Project scenario based

Add your answer

Data Engineer Interview Questions & Answers

Anonymous

posted on 30 Sep 2022

I applied via Naukri.com and was interviewed in Apr 2022. There were 4 interview rounds.

Round 1 - Resume Shortlist

Pro Tip by AmbitionBox:

Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.

View all tips

Round 2 - Technical

(6 Questions)

Q1. What is imputer function in PySpark

Add your answer

Q2. How do you handles null values in PySpark

Ans.

Null values in PySpark are handled using functions such as dropna(), fillna(), and replace().

dropna() function is used to drop rows or columns with null values
fillna() function is used to fill null values with a specified value or method
replace() function is used to replace null values with a specified value
coalesce() function is used to replace null values with the first non-null value in a list of columns

Answered by AI

Add your answer

Q3. SQL query for getting 2nd highest salary from each department

Ans.

SQL query to retrieve the second highest salary from each department

Use the RANK() function to assign a rank to each salary within each department
Filter the results to only include rows with a rank of 2
Group the results by department to get the second highest salary for each department

Answered by AI

View 1 more answer

Q4. How to delete duplicate rows from a table

Ans.

To delete duplicate rows from a table, use the DISTINCT keyword or GROUP BY clause.

Use the DISTINCT keyword to select unique rows from the table.
Use the GROUP BY clause to group the rows by a specific column and select the unique rows.
Use the DELETE statement with a subquery to delete the duplicate rows.
Create a new table with the unique rows and drop the old table.

Answered by AI

Add your answer

Q5. What are the window functions you have used?

Ans.

Window functions are used to perform calculations across a set of rows that are related to the current row.

Commonly used window functions include ROW_NUMBER, RANK, DENSE_RANK, LAG, LEAD, FIRST_VALUE, LAST_VALUE, and NTILE.
Window functions are used in conjunction with the OVER clause to define the window or set of rows to perform the calculation on.
Window functions can be used to calculate running totals, moving average...

Answered by AI

Add your answer

Q6. Merge two unsorted lists such that the output list is sorted. You are free to use inbuilt sorting functions to sort the input lists

Ans.

Merge two unsorted lists into a sorted list using inbuilt sorting functions.

Use inbuilt sorting functions to sort the input lists
Merge the sorted lists using a merge algorithm
Return the merged and sorted list

Answered by AI

Add your answer

Round 3 - Technical

(1 Question)

Q1. 1. What is udf in Spark? 2. Write PySpark code to check the validity of mobile_number column

Ans.

UDF stands for User-Defined Function in Spark. It allows users to define their own functions to process data.

UDFs can be written in different programming languages like Python, Scala, and Java.
UDFs can be used to perform complex operations on data that are not available in built-in functions.
PySpark code to check the validity of mobile_number column can be written using regular expressions and the `regexp_extract` func...

Answered by AI

Add your answer

Round 4 - HR

(1 Question)

Q1. 1. Salary discussion 2. Why do you want to join HSBC? 3. Tell me a challenge that you faced and how did you overcome it?

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Be thorough with PySpark and SQL if you are applying for Data Engineer role.

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Anonymous

posted on 29 Feb 2024

Interview experience

Good

Difficulty level

Moderate

Process Duration

2-4 weeks

Result

I applied via Recruitment Consulltant and was interviewed before Feb 2023. There were 3 interview rounds.

Round 1 - Aptitude Test

Was based on behaviour and cognitive interview round

Round 2 - One-on-one

(2 Questions)

Q1. Pure technical round based on your skills

Add your answer

Q2. SQL basic and detailed level interview like Joins, Index, performance tunning, Partition etc

Add your answer

Round 3 - HR

(1 Question)

Q1. Asked to Fill the Form ad basic details and Current and Expected CTC

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare well for your relevant skillset.

Interview questions from similar companies

Data Analyst Interview Questions & Answers

American Express

Raunak Chitlangia

posted on 14 Nov 2015

I applied via Campus Placement

Interview Preparation Tips

Round: Resume Shortlist
Experience: My resume had a couple of projects and a paper publication added to it, which was a part of my second year intern at TCS Innovation Labs.

Tips: I believe that the resume shortlist to appear for the test is just a formality and not a big deal to get through. Most of the students are allowed to appear for the test which is where, the main shortlisting occurs.
Although, the company does cut off people from certain departments. For example, none of the Agriculture and food engineering students were shortlisted for appearing for the test.
Also, from what I have heard, the company runs resumes through a program to take out the shortlist. Hence, certain key words such as "Scholar" or "Conference" may get you a good chance to get through the shortlisting.

Round: Test
Experience: The test is mainly quant based and how fast you can think and rationalize. They expect excellent speed; they ask you 20 questions to be answered in 30 minutes.
Tips: Normally, if a student is asked to appear for this right after his JEE examination, he would be able to get through it easily. But since our analytical thinking goes way down once we actually get in, it has to be worked upon again. I'd suggest you to solve all the quant questions of any CAT preparatory book for the American Express test. The test questions are quite similar to those that the CAT students prepare for.

Round: Technical Interview
Experience: In general, the American Express interviews are considered to be pretty peaceful compared to the interviews of other companies. I was just asked about my work at TCS Innovation Labs and the paper I would be publishing based on that work and the courses I have done in Data Analytics and what all do I know in it. They do not test the knowledge you say you possess.
Tips: While I was preparing for my interview, I was told to read about Amex and its products. Secondly, and most importantly, I was aksed to read a lot about credit card frauds and suggest ways of preventing that. Even during my time, a lot of people were asked about this and judged severely on that. This is a must must if you would be appearing for its interview.

Also, you would do well if you are able to connect with the interviewer. For instance, if both of you have been a part of the same inter IIT or have the same tastes, etc.

Also, the interviews are that peaceful that you are able to direct the interview in your favour. There are certain parts of your CV that you can easily talk about and certain parts that you would want to avoid. If you are able to think on your feet, it is easy to re direct the interview in your favour and the things that you would like to talk about; in my case, it was my work at TCS.

Round: Puzzle Interview
Experience: The main decision whether you have to be taken in or not is decided in your first round of interview after which, the puzzle interview is just a formality and to find out whether you are able to think on your feet or not. Not a big deal, pretty easy.
Tips: Nothing to be worried about, the puzzles asked are pretty easy and most probably, you would have already heard them if you have a nag of solving puzzles.

General Tips: Do not worry or stress yourself much. The interview is pretty easy if you have an edge in the skills that the company wants. I had started studying Data Analytics in my second year summer vacations, just 1 hour a day for hardly about a month. Which gave me enough of an edge. The main hurdle lies in getting through the test round where about 300 people are simply cut off.
Skill Tips: You should be able to code efficiently. Knowledge in Python and SQL is a big plus.
Also, I had done a course on Coursera called "Machine Learning" of Stanford University. Today, the company does not expect a lot of knowledge in Data Analytics from students which is why this course gave me the edge I needed to get through the interview. It's a pretty easy course and not very demanding. I would recommend this if you want to find out even whether you'd be interested in Data Analytics or not.
I was asked about the things I know and whether I have ever implemented them.
Skills: Analytics And Coding
Duration: 2
College Name: IIT Kharagpur
Motivation: American Express is known to have a good working culture and is currently the third best company to work for in India. The internship stipend is largest compared to any other company in India along with accommodation and travel. The company has a well structured internship program and also has good challenging work to offer.
Funny Moments: After the test, I was not shortlisted for the interview round.. Which broke my heart.
Just after the presentation, I walked up to the person presenting and handed him a hard copy of my resume asking him to at least give me a shot at the interview if they think my profile is good enough.
I believe that this was a turning point of this entire experience. I believe, at that moment itself, I gained their favour to take me in. Which is probably why my interview was peaceful, which is probably why I could get through even though my interview was the last among a bunch of 25 students..

Are these interview questions helpful?

Software Engineer Interview Questions & Answers

Citicorp

Sharat chandra

posted on 29 Jan 2015

Interview Questionnaire

1 Question

Q1. Asked many Dbsa questions. You need to be thorough about the joins in the databases.

Add your answer

Interview Preparation Tips

Round: Test
Experience: Practice questions on Indiabix.

Round: Technical Interview
Experience: Operating Systems basics, Dbsa basics

General Tips: 1: Be Confident.
2: Keep Smiling.
3: Do not be Panic.
4: Remember they are here to hire you but not reject you.
5: Do not loose hope.
6: Believe at yourself.
7: Start preparing early.
8: Do demo interviews with your friend.
Skill Tips: Make sure you do a course on object oriented programming
Skills: C, Sql, Java
College Name: BITS PILANI
Motivation: Every body joins a comapny for making a software or earning money. But I was always motivated to MAKE MONEY. I have taken this job because they have very extensive work in Share Market and they have lots of MBA from IIM's, Lots of CAs, CS. Thats why i have choosen Edelweiss. I have rejected epic System USA for this profile.

Software Engineer Interview Questions & Answers

FactSet

SONELAL GUPTA

posted on 11 Sep 2015

Interview Questionnaire

8 Questions

Q1. Find a number which occurs odd number of times and all number occurs even number of times

View 1 more answer

Q2. Some discussion about my minor project

Add your answer

Q3. Spiral order of binary tree and mattrix, print it

Add your answer

Q4. Some question about os,dbms

Add your answer

Q5. Find pair which have a given sum in a given array

Add your answer

Q6. Find total number of k element which have a given avg in a given array in minimum time complexity

Add your answer

Q7. Print all elements which in not boundary element in a given binary tree

Add your answer

Q8. Then some question about process synchronisation,error vs exception,and then 2-3 hr question

Add your answer

Interview Preparation Tips

Round: Test
Experience: practice codes on paper
Tips:

Round: Technical Interview
Experience: very good
Tips: please try to explain each and every question in detail

Round: Technical Interview
Experience: my hr round is not taken by them,and some of face 3rd round ,which is HR
Tips: please prepare all types of problem from geeksforgeeks

Skill Tips: please try to understand every problem from geeksforgeeks
Skills: ds
College Name: NIT Bhopal
Motivation: best work culture,and a lots of learning opportunity in this company,and in every 6 month there is a appraisal

Skills evaluated in this interview

Data Analyst Interview Questions & Answers

JPMorgan Chase & Co.

Anonymous

posted on 20 Jan 2017

I appeared for an interview in Oct 2016.

Interview Questionnaire

4 Questions

Q1. Select any five companies you would invest in and why given the required metrics.

Ans.

I cannot provide investment advice, but here are five companies that have shown strong financial performance in recent years.

Apple - consistently high revenue and profit margins
Amazon - dominant player in e-commerce and cloud computing
Microsoft - strong growth in cloud computing and enterprise software
Alphabet (Google) - diversified revenue streams and strong advertising business
Visa - dominant player in the payments i...

Answered by AI

Add your answer

Q2. Questions based on case study.

Add your answer

Q3. Detailed discussions on the indian economy?

Add your answer

Q4. Brexit vote and how it would effect the indian economy?

Ans.

The Brexit vote could have both positive and negative effects on the Indian economy.

Positive effects: Increased trade opportunities with the UK, potential for attracting foreign investments from companies relocating from the UK.
Negative effects: Uncertainty in global markets leading to volatility in exchange rates, potential decline in exports to the UK.
Example: Indian IT companies may face challenges due to stricter i...

Answered by AI

Add your answer

Interview Preparation Tips

Round: Case Study Interview
Experience: We were first taught the basics of operations involved in working with an investment bank. Based on the workshop we were given a case study with a list of companies to select the one's which are potencial candidates for investments. We were all divided into groups of 5 students.

Round: Test
Experience: The test was basic. Economical aptitude requires prior knowledge of economics concepts. The review topicwas GST bill.
Duration: 1 hour
Total Questions: 41

Round: Stress Interview
Tips: It is important to be aware about the current events and economies.

Skills: Economic Affairs, Financial Economics, Knowledge On Current Affairs
College Name: Thapar University, Patiala

HSBC Group Interview FAQs

How many rounds are there in HSBC Group Data Engineer interview?

HSBC Group interview process usually has 2-3 rounds. The most common rounds in the HSBC Group interview process are One-on-one Round, Technical and HR.

How to prepare for HSBC Group Data Engineer interview?

Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at HSBC Group. The most common topics and skills that interviewers at HSBC Group expect are Financial Services, Agile Coaching, Automation Testing, CCTV Monitoring and Clinical Data Management.

What are the top questions asked in HSBC Group Data Engineer interview?

Some of the top questions asked at the HSBC Group Data Engineer interview -

1. What is udf in Spark? 2. Write PySpark code to check the validity of mobile_...read more
Merge two unsorted lists such that the output list is sorted. You are free to u...read more
How to delete duplicate rows from a ta...read more

Tell us how to improve this page.

HSBC Group Interviews By Designations

Interview Questions for Popular Designations

4/5

based on 5 interview experiences

Difficulty level

Easy 25%

Moderate 50%

Hard 25%

Duration

Less than 2 weeks 50%

2-4 weeks 50%

JPMorgan Chase & Co. Data Engineer Interview Questions

3.9

• 3 Interviews

Citicorp Data Engineer Interview Questions

3.7

• 3 Interviews

UBS Data Engineer Interview Questions

3.9

• 3 Interviews

Wells Fargo Data Engineer Interview Questions

3.8

• 1 Interview

American Express Data Engineer Interview Questions

4.1

• 1 Interview

State Street Corporation Data Engineer Interview Questions

3.7

• 1 Interview

FactSet Data Engineer Interview Questions

3.9

• 1 Interview

BNP Paribas Data Engineer Interview Questions

3.8

• 1 Interview

View all

HSBC Group Data Engineer Salary

based on 72 salaries

₹7.6 L/yr - ₹18.4 L/yr

12% more than the average Data Engineer Salary in India

View more details

HSBC Group Salaries in India

Assistant Manager 2.8k salaries	₹6 L/yr - ₹13.5 L/yr
Manager 2.2k salaries	₹13.9 L/yr - ₹24 L/yr
Senior Software Engineer 1.7k salaries	₹13 L/yr - ₹23.7 L/yr
Assistant Vice President 1.7k salaries	₹25 L/yr - ₹42.5 L/yr
Software Engineer 1.5k salaries	₹7.8 L/yr - ₹14 L/yr