Upload Button Icon Add office photos

HSBC Group

Compare button icon Compare button icon Compare
4.0

based on 4.5k Reviews

Proud winner of ABECA 2024 - AmbitionBox Employee Choice Awards

zig zag pattern zig zag pattern

Filter interviews by

HSBC Group Data Engineer Interview Questions, Process, and Tips

Updated 31 Aug 2024

Top HSBC Group Data Engineer Interview Questions and Answers

  • Q1. 1. What is udf in Spark? 2. Write PySpark code to check the validity of mobile_number column
  • Q2. Merge two unsorted lists such that the output list is sorted. You are free to use inbuilt sorting functions to sort the input lists
  • Q3. SQL query for getting 2nd highest salary from each department
View all 11 questions

HSBC Group Data Engineer Interview Experiences

6 interviews found

Interview experience
4
Good
Difficulty level
Hard
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Walk-in and was interviewed in Apr 2024. There were 3 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. What is lazy evaluation in spark.
  • Ans. 

    Lazy evaluation in Spark delays the execution of transformations until an action is called.

    • Lazy evaluation allows Spark to optimize the execution plan by combining multiple transformations into a single stage.

    • Transformations are not executed immediately, but are stored as a directed acyclic graph (DAG) of operations.

    • Actions trigger the execution of the DAG and produce results.

    • Example: map() and filter() are transformat...

  • Answered by AI
  • Q2. What is mapreduce
  • Ans. 

    MapReduce is a programming model and processing technique for parallel and distributed computing.

    • MapReduce is used to process large datasets in parallel across a distributed cluster of computers.

    • It consists of two main functions - Map function for processing key/value pairs and Reduce function for aggregating the results.

    • Popularly used in big data processing frameworks like Hadoop for tasks like data sorting, searching...

  • Answered by AI
Round 2 - One-on-one 

(1 Question)

  • Q1. What is skewness and skewd tables
  • Ans. 

    Skewness is a measure of asymmetry in a distribution. Skewed tables are tables with imbalanced data distribution.

    • Skewness is a statistical measure that describes the asymmetry of the data distribution around the mean.

    • Positive skewness indicates a longer tail on the right side of the distribution, while negative skewness indicates a longer tail on the left side.

    • Skewed tables in data engineering refer to tables with imba...

  • Answered by AI
Round 3 - One-on-one 

(1 Question)

  • Q1. What is spark and explain working
  • Ans. 

    Spark is a distributed computing framework designed for big data processing.

    • Spark is built around the concept of Resilient Distributed Datasets (RDDs) which allow for fault-tolerant parallel processing of data.

    • It provides high-level APIs in Java, Scala, Python, and R for ease of use.

    • Spark can run on top of Hadoop, Mesos, Kubernetes, or in standalone mode.

    • It includes modules for SQL, streaming, machine learning, and gra...

  • Answered by AI

Interview Preparation Tips

Topics to prepare for HSBC Group Data Engineer interview:
  • Big Data

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 22 Apr 2024

Interview experience
3
Average
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
Not Selected

I applied via Naukri.com and was interviewed in Mar 2024. There was 1 interview round.

Round 1 - One-on-one 

(3 Questions)

  • Q1. About past experience
  • Q2. Python Coding question about list, dataframe etc
  • Q3. RDBMS Questions and SQL theory

Interview Preparation Tips

Interview preparation tips for other job seekers - Be confident

Data Engineer Interview Questions Asked at Other Companies

asked in Cisco
Q1. Optimal Strategy for a GameYou and your friend Ninjax are playing ... read more
asked in Sigmoid
Q2. Next Greater ElementYou are given an array arr of length N. You h ... read more
asked in Sigmoid
Q3. Search In Rotated Sorted ArrayAahad and Harshit always have fun b ... read more
asked in Cisco
Q4. Covid VaccinationWe are suffering from the Second wave of Covid-1 ... read more
asked in Sigmoid
Q5. K-th element of 2 sorted arrayYou are given two sorted arrays/lis ... read more

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 31 Aug 2024

Interview experience
5
Excellent
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Job Portal and was interviewed in Jul 2024. There was 1 interview round.

Round 1 - Aptitude Test 

English, aptitude test, reasoning

Interview Preparation Tips

Interview preparation tips for other job seekers - I was very good interview process

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 22 Feb 2024

Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
Not Selected
Round 1 - Online 

(2 Questions)

  • Q1. Behavioral test
  • Q2. Questions will be repeated randomly we need to select the same response if we could which will assess our behaviors
Round 2 - One-on-one 

(2 Questions)

  • Q1. Technical discussion
  • Q2. Project scenario based

HSBC Group interview questions for designations

 Senior Data Engineer

 (1)

 Data Analyst

 (7)

 Data Scientist

 (6)

 Data Manager

 (1)

 Senior Data Analyst

 (2)

 Data Analyst Intern

 (1)

 Software Engineer

 (20)

 Devops Engineer

 (2)

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 30 Sep 2022

I applied via Naukri.com and was interviewed in Apr 2022. There were 4 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(6 Questions)

  • Q1. What is imputer function in PySpark
  • Ans. 

    Imputer function in PySpark is used to replace missing values in a DataFrame.

    • Imputer is a transformer in PySpark ML library.

    • It replaces missing values in a DataFrame with either mean, median, or mode of the column.

    • It can be used with both numerical and categorical columns.

    • Example: imputer = Imputer(inputCols=['col1', 'col2'], outputCols=['col1_imputed', 'col2_imputed'], strategy='mean')

    • Example: imputed_df = imputer.fit

  • Answered by AI
  • Q2. How do you handles null values in PySpark
  • Ans. 

    Null values in PySpark are handled using functions such as dropna(), fillna(), and replace().

    • dropna() function is used to drop rows or columns with null values

    • fillna() function is used to fill null values with a specified value or method

    • replace() function is used to replace null values with a specified value

    • coalesce() function is used to replace null values with the first non-null value in a list of columns

  • Answered by AI
  • Q3. SQL query for getting 2nd highest salary from each department
  • Ans. 

    SQL query to retrieve the second highest salary from each department

    • Use the RANK() function to assign a rank to each salary within each department

    • Filter the results to only include rows with a rank of 2

    • Group the results by department to get the second highest salary for each department

  • Answered by AI
  • Q4. How to delete duplicate rows from a table
  • Ans. 

    To delete duplicate rows from a table, use the DISTINCT keyword or GROUP BY clause.

    • Use the DISTINCT keyword to select unique rows from the table.

    • Use the GROUP BY clause to group the rows by a specific column and select the unique rows.

    • Use the DELETE statement with a subquery to delete the duplicate rows.

    • Create a new table with the unique rows and drop the old table.

  • Answered by AI
  • Q5. What are the window functions you have used?
  • Ans. 

    Window functions are used to perform calculations across a set of rows that are related to the current row.

    • Commonly used window functions include ROW_NUMBER, RANK, DENSE_RANK, LAG, LEAD, FIRST_VALUE, LAST_VALUE, and NTILE.

    • Window functions are used in conjunction with the OVER clause to define the window or set of rows to perform the calculation on.

    • Window functions can be used to calculate running totals, moving average...

  • Answered by AI
  • Q6. Merge two unsorted lists such that the output list is sorted. You are free to use inbuilt sorting functions to sort the input lists
  • Ans. 

    Merge two unsorted lists into a sorted list using inbuilt sorting functions.

    • Use inbuilt sorting functions to sort the input lists

    • Merge the sorted lists using a merge algorithm

    • Return the merged and sorted list

  • Answered by AI
Round 3 - Technical 

(1 Question)

  • Q1. 1. What is udf in Spark? 2. Write PySpark code to check the validity of mobile_number column
  • Ans. 

    UDF stands for User-Defined Function in Spark. It allows users to define their own functions to process data.

    • UDFs can be written in different programming languages like Python, Scala, and Java.

    • UDFs can be used to perform complex operations on data that are not available in built-in functions.

    • PySpark code to check the validity of mobile_number column can be written using regular expressions and the `regexp_extract` func...

  • Answered by AI
Round 4 - HR 

(1 Question)

  • Q1. 1. Salary discussion 2. Why do you want to join HSBC? 3. Tell me a challenge that you faced and how did you overcome it?

Interview Preparation Tips

Interview preparation tips for other job seekers - Be thorough with PySpark and SQL if you are applying for Data Engineer role.

Skills evaluated in this interview

Get interview-ready with Top HSBC Group Interview Questions

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 29 Feb 2024

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
-

I applied via Recruitment Consulltant and was interviewed before Mar 2023. There were 3 interview rounds.

Round 1 - Aptitude Test 

Was based on behaviour and cognitive interview round

Round 2 - One-on-one 

(2 Questions)

  • Q1. Pure technical round based on your skills
  • Q2. SQL basic and detailed level interview like Joins, Index, performance tunning, Partition etc
Round 3 - HR 

(1 Question)

  • Q1. Asked to Fill the Form ad basic details and Current and Expected CTC

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare well for your relevant skillset.

Interview questions from similar companies

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
Selected Selected

I applied via Naukri.com and was interviewed in Mar 2024. There were 3 interview rounds.

Round 1 - Technical 

(1 Question)

  • Q1. Explain error handling in PySpark
  • Ans. 

    Error handling in PySpark involves using try-except blocks and logging to handle exceptions and errors.

    • Use try-except blocks to catch and handle exceptions in PySpark code

    • Utilize logging to record errors and exceptions for debugging purposes

    • Consider using the .option('mode', 'PERMISSIVE') method to handle corrupt records in data processing

  • Answered by AI
Round 2 - Technical 

(1 Question)

  • Q1. Data Warehousing related questions
Round 3 - Behavioral 

(1 Question)

  • Q1. Data Modelling related questions

Skills evaluated in this interview

Interview experience
3
Average
Difficulty level
-
Process Duration
-
Result
-
Round 1 - One-on-one 

(2 Questions)

  • Q1. Advanced sql on ctl
  • Q2. Python data structure

Interview Preparation Tips

Interview preparation tips for other job seekers - Interview went well
Interview experience
4
Good
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
No response

I applied via LinkedIn and was interviewed in Mar 2024. There were 2 interview rounds.

Round 1 - Coding Test 

Coding questions on sql python and spark

Round 2 - Technical 

(2 Questions)

  • Q1. Hacker rank code pair
  • Ans. 

    Implement a function to pair elements of an array based on a given sum.

    • Iterate through the array and check if the current element plus any other element equals the given sum.

    • Use a hash set to store elements already visited to avoid duplicate pairs.

    • Return an array of arrays containing the pairs that sum up to the given value.

  • Answered by AI
  • Q2. Pyspark questions
Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - HR 

(2 Questions)

  • Q1. Money you want for this position
  • Q2. Why to choose this position and not another

HSBC Group Interview FAQs

How many rounds are there in HSBC Group Data Engineer interview?
HSBC Group interview process usually has 2-3 rounds. The most common rounds in the HSBC Group interview process are One-on-one Round, Technical and HR.
How to prepare for HSBC Group Data Engineer interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at HSBC Group. The most common topics and skills that interviewers at HSBC Group expect are Automation Testing, CCTV Monitoring, Clinical Data Management, Data Quality and Datastage.
What are the top questions asked in HSBC Group Data Engineer interview?

Some of the top questions asked at the HSBC Group Data Engineer interview -

  1. 1. What is udf in Spark? 2. Write PySpark code to check the validity of mobile_...read more
  2. Merge two unsorted lists such that the output list is sorted. You are free to u...read more
  3. SQL query for getting 2nd highest salary from each departm...read more

Tell us how to improve this page.

HSBC Group Data Engineer Interview Process

based on 5 interviews in last 1 year

Interview experience

4
  
Good
View more

People are getting interviews through

based on 5 HSBC Group interviews
Job Portal
WalkIn
60%
20%
20% candidates got the interview through other sources.
Moderate Confidence
?
Moderate Confidence means the data is based on a sufficient number of responses received from the candidates
HSBC Group Data Engineer Salary
based on 54 salaries
₹5.2 L/yr - ₹19.1 L/yr
At par with the average Data Engineer Salary in India
View more details

HSBC Group Data Engineer Reviews and Ratings

based on 7 reviews

3.4/5

Rating in categories

2.6

Skill development

4.5

Work-Life balance

2.4

Salary & Benefits

4.9

Job Security

3.8

Company culture

2.4

Promotions/Appraisal

2.6

Work Satisfaction

Explore 7 Reviews and Ratings
Assistant Manager
2.6k salaries
unlock blur

₹4.5 L/yr - ₹14.6 L/yr

Manager
2k salaries
unlock blur

₹8 L/yr - ₹28.1 L/yr

Senior Software Engineer
1.6k salaries
unlock blur

₹7.2 L/yr - ₹26 L/yr

Assistant Vice President
1.4k salaries
unlock blur

₹12.5 L/yr - ₹45 L/yr

Software Engineer
1.3k salaries
unlock blur

₹4.9 L/yr - ₹14.4 L/yr

Explore more salaries
Compare HSBC Group with

Standard Chartered

3.8
Compare

Citibank

5.0
Compare

ICICI Bank

4.0
Compare

Axis Bank

3.8
Compare

Calculate your in-hand salary

Confused about how your in-hand salary is calculated? Enter your annual salary (CTC) and get your in-hand salary
Did you find this page helpful?
Yes No
write
Share an Interview