Upload Button Icon Add office photos

HSBC Group

Compare button icon Compare button icon Compare

Filter interviews by

HSBC Group Data Engineer Interview Questions and Answers

Updated 31 Aug 2024

11 Interview questions

A Data Engineer was asked
Q. What is Spark, and how does it work?
Ans. 

Spark is a distributed computing framework designed for big data processing.

  • Spark is built around the concept of Resilient Distributed Datasets (RDDs) which allow for fault-tolerant parallel processing of data.

  • It provides high-level APIs in Java, Scala, Python, and R for ease of use.

  • Spark can run on top of Hadoop, Mesos, Kubernetes, or in standalone mode.

  • It includes modules for SQL, streaming, machine learning, an...

A Data Engineer was asked
Q. What is MapReduce?
Ans. 

MapReduce is a programming model and processing technique for parallel and distributed computing.

  • MapReduce is used to process large datasets in parallel across a distributed cluster of computers.

  • It consists of two main functions - Map function for processing key/value pairs and Reduce function for aggregating the results.

  • Popularly used in big data processing frameworks like Hadoop for tasks like data sorting, sear...

Data Engineer Interview Questions Asked at Other Companies

asked in Sigmoid
Q1. Next Greater Element Problem Statement You are given an array arr ... read more
asked in LTIMindtree
Q2. If you are given cards numbered 1-1000 and 4 boxes, where card 1 ... read more
asked in Cisco
Q3. Optimal Strategy for a Coin Game You are playing a coin game with ... read more
asked in Sigmoid
Q4. Problem: Search In Rotated Sorted Array Given a sorted array that ... read more
asked in Sigmoid
Q5. K-th Element of Two Sorted Arrays You are provided with two sorte ... read more
A Data Engineer was asked
Q. What is lazy evaluation in Spark?
Ans. 

Lazy evaluation in Spark delays the execution of transformations until an action is called.

  • Lazy evaluation allows Spark to optimize the execution plan by combining multiple transformations into a single stage.

  • Transformations are not executed immediately, but are stored as a directed acyclic graph (DAG) of operations.

  • Actions trigger the execution of the DAG and produce results.

  • Example: map() and filter() are transf...

A Data Engineer was asked
Q. What is the imputer function in PySpark?
Ans. 

Imputer function in PySpark is used to replace missing values in a DataFrame.

  • Imputer is a transformer in PySpark ML library.

  • It replaces missing values in a DataFrame with either mean, median, or mode of the column.

  • It can be used with both numerical and categorical columns.

  • Example: imputer = Imputer(inputCols=['col1', 'col2'], outputCols=['col1_imputed', 'col2_imputed'], strategy='mean')

  • Example: imputed_df = impute...

What people are saying about HSBC Group

View All
captaincul
Verified Icon
2w
ex -
IBM
Worried about BGV
Actually i got preoffer letter in HSBC just worried abt BGV , In my first company i just worked 2 months and left but i didnt get any experience letter letter but after that i worked for 4 years in Another company where i got all documents and just shared offer letter of my first company iwith BGV team. Can anyone one help me out i also informed HR during discussion he also said he will make a note
Got a question about HSBC Group?
Ask anonymously on communities.
A Data Engineer was asked
Q. How do you delete duplicate rows from a table?
Ans. 

To delete duplicate rows from a table, use the DISTINCT keyword or GROUP BY clause.

  • Use the DISTINCT keyword to select unique rows from the table.

  • Use the GROUP BY clause to group the rows by a specific column and select the unique rows.

  • Use the DELETE statement with a subquery to delete the duplicate rows.

  • Create a new table with the unique rows and drop the old table.

A Data Engineer was asked
Q. How do you handle null values in PySpark?
Ans. 

Null values in PySpark are handled using functions such as dropna(), fillna(), and replace().

  • dropna() function is used to drop rows or columns with null values

  • fillna() function is used to fill null values with a specified value or method

  • replace() function is used to replace null values with a specified value

  • coalesce() function is used to replace null values with the first non-null value in a list of columns

A Data Engineer was asked
Q. Write an SQL query to get the second highest salary from each department.
Ans. 

SQL query to retrieve the second highest salary from each department

  • Use the RANK() function to assign a rank to each salary within each department

  • Filter the results to only include rows with a rank of 2

  • Group the results by department to get the second highest salary for each department

Are these interview questions helpful?
A Data Engineer was asked
Q. What are the window functions you have used?
Ans. 

Window functions are used to perform calculations across a set of rows that are related to the current row.

  • Commonly used window functions include ROW_NUMBER, RANK, DENSE_RANK, LAG, LEAD, FIRST_VALUE, LAST_VALUE, and NTILE.

  • Window functions are used in conjunction with the OVER clause to define the window or set of rows to perform the calculation on.

  • Window functions can be used to calculate running totals, moving av...

A Data Engineer was asked
Q. Merge two unsorted lists such that the output list is sorted. You are free to use inbuilt sorting functions to sort the input lists
Ans. 

Merge two unsorted lists into a sorted list using inbuilt sorting functions.

  • Use inbuilt sorting functions to sort the input lists

  • Merge the sorted lists using a merge algorithm

  • Return the merged and sorted list

A Data Engineer was asked
Q. What is skewness and skewd tables
Ans. 

Skewness is a measure of asymmetry in a distribution. Skewed tables are tables with imbalanced data distribution.

  • Skewness is a statistical measure that describes the asymmetry of the data distribution around the mean.

  • Positive skewness indicates a longer tail on the right side of the distribution, while negative skewness indicates a longer tail on the left side.

  • Skewed tables in data engineering refer to tables with...

HSBC Group Data Engineer Interview Experiences

6 interviews found

Interview experience
4
Good
Difficulty level
Hard
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Walk-in and was interviewed in Apr 2024. There were 3 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. What is lazy evaluation in spark.
  • Ans. 

    Lazy evaluation in Spark delays the execution of transformations until an action is called.

    • Lazy evaluation allows Spark to optimize the execution plan by combining multiple transformations into a single stage.

    • Transformations are not executed immediately, but are stored as a directed acyclic graph (DAG) of operations.

    • Actions trigger the execution of the DAG and produce results.

    • Example: map() and filter() are transformat...

  • Answered by AI
  • Q2. What is mapreduce
  • Ans. 

    MapReduce is a programming model and processing technique for parallel and distributed computing.

    • MapReduce is used to process large datasets in parallel across a distributed cluster of computers.

    • It consists of two main functions - Map function for processing key/value pairs and Reduce function for aggregating the results.

    • Popularly used in big data processing frameworks like Hadoop for tasks like data sorting, searching...

  • Answered by AI
Round 2 - One-on-one 

(1 Question)

  • Q1. What is skewness and skewd tables
  • Ans. 

    Skewness is a measure of asymmetry in a distribution. Skewed tables are tables with imbalanced data distribution.

    • Skewness is a statistical measure that describes the asymmetry of the data distribution around the mean.

    • Positive skewness indicates a longer tail on the right side of the distribution, while negative skewness indicates a longer tail on the left side.

    • Skewed tables in data engineering refer to tables with imba...

  • Answered by AI
Round 3 - One-on-one 

(1 Question)

  • Q1. What is spark and explain working
  • Ans. 

    Spark is a distributed computing framework designed for big data processing.

    • Spark is built around the concept of Resilient Distributed Datasets (RDDs) which allow for fault-tolerant parallel processing of data.

    • It provides high-level APIs in Java, Scala, Python, and R for ease of use.

    • Spark can run on top of Hadoop, Mesos, Kubernetes, or in standalone mode.

    • It includes modules for SQL, streaming, machine learning, and gra...

  • Answered by AI

Interview Preparation Tips

Topics to prepare for HSBC Group Data Engineer interview:
  • Big Data

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 22 Apr 2024

Interview experience
3
Average
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
Not Selected

I applied via Naukri.com and was interviewed in Mar 2024. There was 1 interview round.

Round 1 - One-on-one 

(3 Questions)

  • Q1. About past experience
  • Q2. Python Coding question about list, dataframe etc
  • Q3. RDBMS Questions and SQL theory

Interview Preparation Tips

Interview preparation tips for other job seekers - Be confident

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 31 Aug 2024

Interview experience
5
Excellent
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Job Portal and was interviewed in Jul 2024. There was 1 interview round.

Round 1 - Aptitude Test 

English, aptitude test, reasoning

Interview Preparation Tips

Interview preparation tips for other job seekers - I was very good interview process

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 22 Feb 2024

Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
Not Selected
Round 1 - Online 

(2 Questions)

  • Q1. Behavioral test
  • Q2. Questions will be repeated randomly we need to select the same response if we could which will assess our behaviors
Round 2 - One-on-one 

(2 Questions)

  • Q1. Technical discussion
  • Q2. Project scenario based

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 30 Sep 2022

I applied via Naukri.com and was interviewed in Apr 2022. There were 4 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(6 Questions)

  • Q1. What is imputer function in PySpark
  • Q2. How do you handles null values in PySpark
  • Ans. 

    Null values in PySpark are handled using functions such as dropna(), fillna(), and replace().

    • dropna() function is used to drop rows or columns with null values

    • fillna() function is used to fill null values with a specified value or method

    • replace() function is used to replace null values with a specified value

    • coalesce() function is used to replace null values with the first non-null value in a list of columns

  • Answered by AI
  • Q3. SQL query for getting 2nd highest salary from each department
  • Ans. 

    SQL query to retrieve the second highest salary from each department

    • Use the RANK() function to assign a rank to each salary within each department

    • Filter the results to only include rows with a rank of 2

    • Group the results by department to get the second highest salary for each department

  • Answered by AI
  • Q4. How to delete duplicate rows from a table
  • Ans. 

    To delete duplicate rows from a table, use the DISTINCT keyword or GROUP BY clause.

    • Use the DISTINCT keyword to select unique rows from the table.

    • Use the GROUP BY clause to group the rows by a specific column and select the unique rows.

    • Use the DELETE statement with a subquery to delete the duplicate rows.

    • Create a new table with the unique rows and drop the old table.

  • Answered by AI
  • Q5. What are the window functions you have used?
  • Ans. 

    Window functions are used to perform calculations across a set of rows that are related to the current row.

    • Commonly used window functions include ROW_NUMBER, RANK, DENSE_RANK, LAG, LEAD, FIRST_VALUE, LAST_VALUE, and NTILE.

    • Window functions are used in conjunction with the OVER clause to define the window or set of rows to perform the calculation on.

    • Window functions can be used to calculate running totals, moving average...

  • Answered by AI
  • Q6. Merge two unsorted lists such that the output list is sorted. You are free to use inbuilt sorting functions to sort the input lists
  • Ans. 

    Merge two unsorted lists into a sorted list using inbuilt sorting functions.

    • Use inbuilt sorting functions to sort the input lists

    • Merge the sorted lists using a merge algorithm

    • Return the merged and sorted list

  • Answered by AI
Round 3 - Technical 

(1 Question)

  • Q1. 1. What is udf in Spark? 2. Write PySpark code to check the validity of mobile_number column
  • Ans. 

    UDF stands for User-Defined Function in Spark. It allows users to define their own functions to process data.

    • UDFs can be written in different programming languages like Python, Scala, and Java.

    • UDFs can be used to perform complex operations on data that are not available in built-in functions.

    • PySpark code to check the validity of mobile_number column can be written using regular expressions and the `regexp_extract` func...

  • Answered by AI
Round 4 - HR 

(1 Question)

  • Q1. 1. Salary discussion 2. Why do you want to join HSBC? 3. Tell me a challenge that you faced and how did you overcome it?

Interview Preparation Tips

Interview preparation tips for other job seekers - Be thorough with PySpark and SQL if you are applying for Data Engineer role.

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 29 Feb 2024

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
-

I applied via Recruitment Consulltant and was interviewed before Feb 2023. There were 3 interview rounds.

Round 1 - Aptitude Test 

Was based on behaviour and cognitive interview round

Round 2 - One-on-one 

(2 Questions)

  • Q1. Pure technical round based on your skills
  • Q2. SQL basic and detailed level interview like Joins, Index, performance tunning, Partition etc
Round 3 - HR 

(1 Question)

  • Q1. Asked to Fill the Form ad basic details and Current and Expected CTC

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare well for your relevant skillset.

Interview questions from similar companies

I applied via Campus Placement

Interview Preparation Tips

Round: Resume Shortlist
Experience: My resume had a couple of projects and a paper publication added to it, which was a part of my second year intern at TCS Innovation Labs.

Tips: I believe that the resume shortlist to appear for the test is just a formality and not a big deal to get through. Most of the students are allowed to appear for the test which is where, the main shortlisting occurs.
Although, the company does cut off people from certain departments. For example, none of the Agriculture and food engineering students were shortlisted for appearing for the test.
Also, from what I have heard, the company runs resumes through a program to take out the shortlist. Hence, certain key words such as "Scholar" or "Conference" may get you a good chance to get through the shortlisting.

Round: Test
Experience: The test is mainly quant based and how fast you can think and rationalize. They expect excellent speed; they ask you 20 questions to be answered in 30 minutes.
Tips: Normally, if a student is asked to appear for this right after his JEE examination, he would be able to get through it easily. But since our analytical thinking goes way down once we actually get in, it has to be worked upon again. I'd suggest you to solve all the quant questions of any CAT preparatory book for the American Express test. The test questions are quite similar to those that the CAT students prepare for.

Round: Technical Interview
Experience: In general, the American Express interviews are considered to be pretty peaceful compared to the interviews of other companies. I was just asked about my work at TCS Innovation Labs and the paper I would be publishing based on that work and the courses I have done in Data Analytics and what all do I know in it. They do not test the knowledge you say you possess.
Tips: While I was preparing for my interview, I was told to read about Amex and its products. Secondly, and most importantly, I was aksed to read a lot about credit card frauds and suggest ways of preventing that. Even during my time, a lot of people were asked about this and judged severely on that. This is a must must if you would be appearing for its interview.

Also, you would do well if you are able to connect with the interviewer. For instance, if both of you have been a part of the same inter IIT or have the same tastes, etc.

Also, the interviews are that peaceful that you are able to direct the interview in your favour. There are certain parts of your CV that you can easily talk about and certain parts that you would want to avoid. If you are able to think on your feet, it is easy to re direct the interview in your favour and the things that you would like to talk about; in my case, it was my work at TCS.

Round: Puzzle Interview
Experience: The main decision whether you have to be taken in or not is decided in your first round of interview after which, the puzzle interview is just a formality and to find out whether you are able to think on your feet or not. Not a big deal, pretty easy.
Tips: Nothing to be worried about, the puzzles asked are pretty easy and most probably, you would have already heard them if you have a nag of solving puzzles.

General Tips: Do not worry or stress yourself much. The interview is pretty easy if you have an edge in the skills that the company wants. I had started studying Data Analytics in my second year summer vacations, just 1 hour a day for hardly about a month. Which gave me enough of an edge. The main hurdle lies in getting through the test round where about 300 people are simply cut off.
Skill Tips: You should be able to code efficiently. Knowledge in Python and SQL is a big plus.
Also, I had done a course on Coursera called "Machine Learning" of Stanford University. Today, the company does not expect a lot of knowledge in Data Analytics from students which is why this course gave me the edge I needed to get through the interview. It's a pretty easy course and not very demanding. I would recommend this if you want to find out even whether you'd be interested in Data Analytics or not.
I was asked about the things I know and whether I have ever implemented them.
Skills: Analytics And Coding
Duration: 2
College Name: IIT Kharagpur
Motivation: American Express is known to have a good working culture and is currently the third best company to work for in India. The internship stipend is largest compared to any other company in India along with accommodation and travel. The company has a well structured internship program and also has good challenging work to offer.
Funny Moments: After the test, I was not shortlisted for the interview round.. Which broke my heart.
Just after the presentation, I walked up to the person presenting and handed him a hard copy of my resume asking him to at least give me a shot at the interview if they think my profile is good enough.
I believe that this was a turning point of this entire experience. I believe, at that moment itself, I gained their favour to take me in. Which is probably why my interview was peaceful, which is probably why I could get through even though my interview was the last among a bunch of 25 students..
Are these interview questions helpful?

Interview Questionnaire 

1 Question

  • Q1. Asked many Dbsa questions. You need to be thorough about the joins in the databases.

Interview Preparation Tips

Round: Test
Experience: Practice questions on Indiabix.

Round: Technical Interview
Experience: Operating Systems basics, Dbsa basics

General Tips: 1: Be Confident.
2: Keep Smiling.
3: Do not be Panic.
4: Remember they are here to hire you but not reject you.
5: Do not loose hope.
6: Believe at yourself.
7: Start preparing early.
8: Do demo interviews with your friend.
Skill Tips: Make sure you do a course on object oriented programming
Skills: C, Sql, Java
College Name: BITS PILANI
Motivation: Every body joins a comapny for making a software or earning money. But I was always motivated to MAKE MONEY. I have taken this job because they have very extensive work in Share Market and they have lots of MBA from IIM's, Lots of CAs, CS. Thats why i have choosen Edelweiss. I have rejected epic System USA for this profile.

Interview Questionnaire 

8 Questions

  • Q1. Find a number which occurs odd number of times and all number occurs even number of times
  • Q2. Some discussion about my minor project
  • Q3. Spiral order of binary tree and mattrix, print it
  • Q4. Some question about os,dbms
  • Q5. Find pair which have a given sum in a given array
  • Q6. Find total number of k element which have a given avg in a given array in minimum time complexity
  • Q7. Print all elements which in not boundary element in a given binary tree
  • Q8. Then some question about process synchronisation,error vs exception,and then 2-3 hr question

Interview Preparation Tips

Round: Test
Experience: practice codes on paper
Tips:

Round: Technical Interview
Experience: very good
Tips: please try to explain each and every question in detail

Round: Technical Interview
Experience: my hr round is not taken by them,and some of face 3rd round ,which is HR
Tips: please prepare all types of problem from geeksforgeeks

Skill Tips: please try to understand every problem from geeksforgeeks
Skills: ds
College Name: NIT Bhopal
Motivation: best work culture,and a lots of learning opportunity in this company,and in every 6 month there is a appraisal

Skills evaluated in this interview

I appeared for an interview in Oct 2016.

Interview Questionnaire 

4 Questions

  • Q1. Select any five companies you would invest in and why given the required metrics.
  • Ans. 

    I cannot provide investment advice, but here are five companies that have shown strong financial performance in recent years.

    • Apple - consistently high revenue and profit margins

    • Amazon - dominant player in e-commerce and cloud computing

    • Microsoft - strong growth in cloud computing and enterprise software

    • Alphabet (Google) - diversified revenue streams and strong advertising business

    • Visa - dominant player in the payments i...

  • Answered by AI
  • Q2. Questions based on case study.
  • Q3. Detailed discussions on the indian economy?
  • Q4. Brexit vote and how it would effect the indian economy?
  • Ans. 

    The Brexit vote could have both positive and negative effects on the Indian economy.

    • Positive effects: Increased trade opportunities with the UK, potential for attracting foreign investments from companies relocating from the UK.

    • Negative effects: Uncertainty in global markets leading to volatility in exchange rates, potential decline in exports to the UK.

    • Example: Indian IT companies may face challenges due to stricter i...

  • Answered by AI

Interview Preparation Tips

Round: Case Study Interview
Experience: We were first taught the basics of operations involved in working with an investment bank. Based on the workshop we were given a case study with a list of companies to select the one's which are potencial candidates for investments. We were all divided into groups of 5 students.

Round: Test
Experience: The test was basic. Economical aptitude requires prior knowledge of economics concepts. The review topicwas GST bill.
Duration: 1 hour
Total Questions: 41

Round: Stress Interview
Tips: It is important to be aware about the current events and economies.

Skills: Economic Affairs, Financial Economics, Knowledge On Current Affairs
College Name: Thapar University, Patiala

HSBC Group Interview FAQs

How many rounds are there in HSBC Group Data Engineer interview?
HSBC Group interview process usually has 2-3 rounds. The most common rounds in the HSBC Group interview process are One-on-one Round, Technical and HR.
How to prepare for HSBC Group Data Engineer interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at HSBC Group. The most common topics and skills that interviewers at HSBC Group expect are Financial Services, Agile Coaching, Automation Testing, CCTV Monitoring and Clinical Data Management.
What are the top questions asked in HSBC Group Data Engineer interview?

Some of the top questions asked at the HSBC Group Data Engineer interview -

  1. 1. What is udf in Spark? 2. Write PySpark code to check the validity of mobile_...read more
  2. Merge two unsorted lists such that the output list is sorted. You are free to u...read more
  3. How to delete duplicate rows from a ta...read more

Tell us how to improve this page.

Overall Interview Experience Rating

4/5

based on 5 interview experiences

Difficulty level

Easy 25%
Moderate 50%
Hard 25%

Duration

Less than 2 weeks 50%
2-4 weeks 50%
View more
HSBC Group Data Engineer Salary
based on 72 salaries
₹7.6 L/yr - ₹18.4 L/yr
12% more than the average Data Engineer Salary in India
View more details

HSBC Group Data Engineer Reviews and Ratings

based on 8 reviews

2.5/5

Rating in categories

1.8

Skill development

2.7

Work-life balance

1.7

Salary

2.9

Job security

2.4

Company culture

1.7

Promotions

1.8

Work satisfaction

Explore 8 Reviews and Ratings
Assistant Manager
2.8k salaries
unlock blur

₹6 L/yr - ₹13.5 L/yr

Manager
2.2k salaries
unlock blur

₹13.9 L/yr - ₹24 L/yr

Senior Software Engineer
1.7k salaries
unlock blur

₹13 L/yr - ₹23.7 L/yr

Assistant Vice President
1.7k salaries
unlock blur

₹25 L/yr - ₹42.5 L/yr

Software Engineer
1.5k salaries
unlock blur

₹7.8 L/yr - ₹14 L/yr

Explore more salaries
Compare HSBC Group with

Wells Fargo

3.8
Compare

JPMorgan Chase & Co.

3.9
Compare

Cholamandalam Investment & Finance

3.9
Compare

Citicorp

3.7
Compare
write
Share an Interview