Upload Button Icon Add office photos

Dunnhumby

Compare button icon Compare button icon Compare

Filter interviews by

Dunnhumby Data Science Engineer Interview Questions, Process, and Tips

Updated 11 Jun 2024

Dunnhumby Data Science Engineer Interview Experiences

1 interview found

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. What is RDD and how its different from DF and Datasets
  • Ans. 

    RDD stands for Resilient Distributed Dataset and is the fundamental data structure of Apache Spark.

    • RDD is a distributed collection of objects that can be operated on in parallel.

    • DataFrames and Datasets are higher-level abstractions built on top of RDDs.

    • RDDs are more low-level and offer more control over data processing compared to DataFrames and Datasets.

  • Answered by AI
  • Q2. What is Partitioning and how to use colease and repartition
  • Ans. 

    Partitioning is the process of dividing data into smaller chunks for better organization and processing in distributed systems.

    • Partitioning helps in distributing data across multiple nodes for parallel processing.

    • Coalesce is used to reduce the number of partitions without shuffling data, while repartition is used to increase the number of partitions by shuffling data.

    • Example: coalesce(5) will merge partitions into 5 pa...

  • Answered by AI
Round 2 - Technical 

(2 Questions)

  • Q1. What is Spark and explian its architecture
  • Ans. 

    Spark is a distributed computing framework that provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

    • Spark has a master-slave architecture with a driver program that communicates with a cluster manager to distribute work across worker nodes.

    • It uses Resilient Distributed Datasets (RDDs) for fault-tolerant distributed data processing.

    • Spark supports various programming l...

  • Answered by AI
  • Q2. What is DAG? how a spark job works how the dag gets created
  • Ans. 

    DAG stands for Directed Acyclic Graph. It is a finite directed graph with no cycles.

    • DAG is a collection of nodes connected by edges where each edge goes from one node to another, but no cycles are allowed.

    • In the context of Spark, a DAG represents the sequence of transformations that need to be applied to the input data to get the final output.

    • When a Spark job is submitted, Spark creates a DAG of the transformations spe...

  • Answered by AI

Skills evaluated in this interview

Interview questions from similar companies

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I was interviewed in Dec 2024.

Round 1 - One-on-one 

(4 Questions)

  • Q1. What's the CI/CD workflow that's followed in your team?
  • Ans. 

    Our team follows a CI/CD workflow that includes automated testing, code reviews, and continuous integration.

    • Automated testing is run on every code change to catch bugs early.

    • Code reviews are conducted before merging changes to ensure code quality.

    • Continuous integration is used to automatically build and test code changes in a shared repository.

    • Deployment pipelines are set up to automate the release process.

    • Version cont...

  • Answered by AI
  • Q2. Has there been any security incident and how have you handled those ?
  • Ans. 

    Yes, there have been security incidents and I have handled them effectively.

    • Implemented security protocols to prevent future incidents

    • Conducted thorough investigation to identify the root cause

    • Collaborated with IT team to strengthen security measures

    • Communicated with stakeholders to ensure transparency and trust

    • Provided training to employees on cybersecurity best practices

  • Answered by AI
  • Q3. Difference between authorization and authentication ?
  • Ans. 

    Authentication verifies the identity of a user, while authorization determines what actions a user is allowed to perform.

    • Authentication confirms the identity of a user through credentials like passwords or biometrics.

    • Authorization determines the level of access or permissions a user has once their identity is confirmed.

    • Authentication is the process of logging in, while authorization is the process of granting or denyin...

  • Answered by AI
  • Q4. LLD for an authentication and authorization system
  • Ans. 

    LLD for an authentication and authorization system

    • Separate modules for authentication and authorization

    • Use of secure hashing algorithms for storing passwords

    • Role-based access control implementation

    • Audit logging for tracking user actions

    • Integration with external identity providers

  • Answered by AI
Round 2 - One-on-one 

(1 Question)

  • Q1. Design a document managed storage like google drive as an E2E solution.
  • Ans. 

    Design a document managed storage like Google Drive as an E2E solution.

    • Implement user authentication and authorization for secure access.

    • Create a user-friendly interface for uploading, organizing, and sharing documents.

    • Include features like version control, file syncing, and search functionality.

    • Utilize cloud storage for scalability and accessibility.

    • Implement encryption for data security.

    • Integrate with third-party app

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare well on the points mentioned in your resume.
Be ready to provide reasons for decisions taken for your resume.
Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Naukri.com and was interviewed in Dec 2024. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. Explain any ML model.
  • Q2. Create Dataframe from two lists.

Interview Preparation Tips

Topics to prepare for Nielsen Data Scientist interview:
  • Python
  • pandas
  • ML
Interview experience
2
Poor
Difficulty level
-
Process Duration
-
Result
-

I applied via campus placement at National Institute of Technology (NIT), Warangal

Round 1 - Aptitude Test 

1 hour aptitude test

Round 2 - One-on-one 

(1 Question)

  • Q1. What is one hot encoding
Round 3 - HR 

(1 Question)

  • Q1. What is your long term goal
Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
-

I applied via Company Website and was interviewed in Aug 2024. There was 1 interview round.

Round 1 - One-on-one 

(2 Questions)

  • Q1. Explain the RAG pipeline?
  • Ans. 

    RAG pipeline is a data processing pipeline used in data science to categorize data into Red, Amber, and Green based on certain criteria.

    • RAG stands for Red, Amber, Green which are used to categorize data based on certain criteria

    • Red category typically represents data that needs immediate attention or action

    • Amber category represents data that requires monitoring or further investigation

    • Green category represents data that...

  • Answered by AI
  • Q2. Explain Confusion metrics
  • Ans. 

    Confusion metrics are used to evaluate the performance of a classification model by comparing predicted values with actual values.

    • Confusion matrix is a table that describes the performance of a classification model.

    • It consists of four different metrics: True Positive, True Negative, False Positive, and False Negative.

    • These metrics are used to calculate other evaluation metrics like accuracy, precision, recall, and F1 s...

  • Answered by AI

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Recruitment Consulltant and was interviewed in Apr 2024. There was 1 interview round.

Round 1 - Coding Test 

SQL, Python coding …

Interview experience
5
Excellent
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Company Website and was interviewed in Sep 2024. There were 2 interview rounds.

Round 1 - Aptitude Test 

Basic mathematical and resoning questions.

Round 2 - Personal Interview 

(4 Questions)

  • Q1. Tell me about your self
  • Q2. Explain the recent project
  • Ans. 

    Developed a predictive model for customer churn in a telecom company

    • Collected and cleaned customer data including usage patterns and demographics

    • Used machine learning algorithms such as logistic regression and random forest

    • Evaluated model performance using metrics like accuracy and AUC-ROC curve

  • Answered by AI
  • Q3. What is the difference between random forest and decision tree
  • Ans. 

    Random forest is an ensemble learning method that uses multiple decision trees to make predictions, while a decision tree is a single tree-like structure that makes decisions based on features.

    • Random forest is a collection of decision trees that work together to make predictions.

    • Decision tree is a single tree-like structure that makes decisions based on features.

    • Random forest reduces overfitting by averaging the predic...

  • Answered by AI
  • Q4. What is cost function
  • Ans. 

    A cost function is a mathematical formula used to measure the cost of a particular decision or set of decisions.

    • Cost function helps in evaluating the performance of a model by measuring how well it is able to predict the outcomes.

    • It is used in optimization problems to find the best solution that minimizes the cost.

    • Examples include mean squared error in linear regression and cross-entropy loss in logistic regression.

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - All the above questions is based upon the project what i have explained

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
Selected Selected

I applied via Referral and was interviewed in May 2024. There were 3 interview rounds.

Round 1 - Coding Test 

I was asked to write SQL queries for 3rd highest salary of the employee, some name filtering, group by tasks.
Python code to find the index of the maximum number without using numpy.

Round 2 - One-on-one 

(1 Question)

  • Q1. Explain the Project undertaken during the research and follow-up questions
Round 3 - Technical 

(1 Question)

  • Q1. Write pandas query to separate the names as first and last name from the full name. Drop the duplicate columns and also the missing values. Write output for the Python code. Write SQL query to retrieve t...
  • Ans. 

    Answering questions related to data science concepts and techniques.

    • Recall is the ratio of correctly predicted positive observations to the total actual positives. Precision is the ratio of correctly predicted positive observations to the total predicted positives.

    • To reduce variance in an ensemble model, techniques like bagging, boosting, and stacking can be used. Bagging involves training multiple models on different ...

  • Answered by AI

Interview Preparation Tips

Topics to prepare for Nielsen Data Scientist interview:
  • Python
  • Pandas
  • SQL
  • Machine Learning
Interview preparation tips for other job seekers - Have your basics strong.

Skills evaluated in this interview

Interview experience
3
Average
Difficulty level
Easy
Process Duration
-
Result
Not Selected

I applied via Referral and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - Coding Test 

Topics like sql, python, aptitude were covered.

Round 2 - Technical 

(1 Question)

  • Q1. Questions from stats, machine learning were asked. Behavioral questions were also covered.

Interview Preparation Tips

Interview preparation tips for other job seekers - It was a good experience but they didnt give any response for my mails. I was worried about the results for 3 weeks and later they told me it was a rejection. When I asked them for reason , they dont have any it seems . Post interview it was horrible experience for me .
Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via LinkedIn and was interviewed before Mar 2022. There were 3 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(3 Questions)

  • Q1. Questions related to python basics.
  • Q2. Basic calculus to test math skills.
  • Q3. Machine learning metrics.
Round 3 - HR 

(3 Questions)

  • Q1. Policies, behavioral round.
  • Q2. Work culture and the selection process.
  • Q3. Discussion about previous employers and educational background.

Interview Preparation Tips

Interview preparation tips for other job seekers - keep it simple and be confident. it's good to know the reasons behind your data science projects.

Dunnhumby Interview FAQs

How many rounds are there in Dunnhumby Data Science Engineer interview?
Dunnhumby interview process usually has 2 rounds. The most common rounds in the Dunnhumby interview process are Technical.
How to prepare for Dunnhumby Data Science Engineer interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Dunnhumby. The most common topics and skills that interviewers at Dunnhumby expect are Python, Agile Coaching, Algorithms, Azure and Backend.
What are the top questions asked in Dunnhumby Data Science Engineer interview?

Some of the top questions asked at the Dunnhumby Data Science Engineer interview -

  1. What is DAG? how a spark job works how the dag gets crea...read more
  2. What is RDD and how its different from DF and Datas...read more
  3. what is Partitioning and how to use colease and repartit...read more

Tell us how to improve this page.

Dunnhumby Data Science Engineer Interview Process

based on 1 interview

Interview experience

5
  
Excellent
View more

Interview Questions from Similar Companies

EXL Service Interview Questions
3.7
 • 734 Interviews
S&P Global Interview Questions
4.1
 • 274 Interviews
Mu Sigma Interview Questions
2.6
 • 228 Interviews
Tiger Analytics Interview Questions
3.7
 • 220 Interviews
Fractal Analytics Interview Questions
4.0
 • 206 Interviews
Access Healthcare Interview Questions
3.9
 • 205 Interviews
Nielsen Interview Questions
3.7
 • 116 Interviews
Netscribes Interview Questions
2.8
 • 82 Interviews
View all
Dunnhumby Data Science Engineer Salary
based on 6 salaries
₹12 L/yr - ₹19 L/yr
49% more than the average Data Science Engineer Salary in India
View more details

Dunnhumby Data Science Engineer Reviews and Ratings

based on 1 review

3.0/5

Rating in categories

3.0

Skill development

4.0

Work-life balance

3.0

Salary

4.0

Job security

4.0

Company culture

3.0

Promotions

3.0

Work satisfaction

Explore 1 Review and Rating
Senior Applied Data Scientist
128 salaries
unlock blur

₹10.9 L/yr - ₹20 L/yr

Lead Applied Data Scientist
85 salaries
unlock blur

₹17 L/yr - ₹28.5 L/yr

Applied Data Scientist
79 salaries
unlock blur

₹10 L/yr - ₹16.5 L/yr

Senior Engineer
61 salaries
unlock blur

₹10 L/yr - ₹30 L/yr

Senior Data Scientist
49 salaries
unlock blur

₹9 L/yr - ₹28 L/yr

Explore more salaries
Compare Dunnhumby with

Fractal Analytics

4.0
Compare

Mu Sigma

2.6
Compare

AbsolutData

3.6
Compare

Algonomy

4.0
Compare
Did you find this page helpful?
Yes No
write
Share an Interview