Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

VIEW WINNERS
- ABECA 2025
  
  VIEW WINNERS
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
Participate in ABECA 2026

Add office photos

Engaged Employer

Cognizant

Compare

3.7

based on 54.6k Reviews

Video summary

Filter interviews by

Cognizant Associate Data Engineer Interview Questions and Answers

Updated 19 Oct 2021

11 Interview questions

An Associate Data Engineer was asked

Q. How do you delete duplicate rows in SQL?

Ans.

Deleting duplicate rows in SQL

Use the DISTINCT keyword in SELECT statement to retrieve unique rows
Use GROUP BY clause to group rows with same values and then use aggregate functions to select one row
Use the ROW_NUMBER() function to assign a unique number to each row and then delete the rows with duplicate numbers

An Associate Data Engineer was asked

Q. How do you find a process ID in Linux?

Ans.

To find process id in Linux, use the command 'ps -aux | grep '

Open the terminal
Type 'ps -aux' to list all running processes
Use 'grep ' to filter the process you are looking for
The process id (PID) will be listed in the second column

An Associate Data Engineer was asked

Q. In DataStage, how would you remove the header and trailer from a sequential data file?

Ans.

To remove header and trailer from a sequential data file in Datastage.

Use Sequential File stage in Datastage.
Set the 'Skip Rows' property to the number of header rows to be skipped.
Set the 'Trailer Rows' property to the number of trailer rows to be skipped.
Use a Transformer stage to remove any remaining header or trailer rows.
Use the 'Remove' function in the Transformer stage to remove the rows.

An Associate Data Engineer was asked

Q. How do you handle Out Of Memory (OOM) issues in Spark?

Ans.

Spark OutOfMemory (OOM) issues occur when the application exceeds memory limits, causing failures in processing data.

Increase executor memory: Use the configuration 'spark.executor.memory' to allocate more memory to executors.
Optimize data partitioning: Use 'repartition()' or 'coalesce()' to manage the number of partitions effectively.
Use broadcast variables: For large lookup tables, use 'sc.broadcast()' to reduce...

What people are saying about Cognizant

View All

a junior software engineer

Job offer in Malaysia - legit or scam?

Hey everyone, I received a job proposal from Mindgraph for a Junior Mainframe Developer position in Malaysia (onsite). Not sure if it's a real deal. They found my resume on Naukri and the offer includes: * Experience: 3+ years on cardlink, VSAM, CICS, JCL * Location: Malaysia (Accenture client in Kuala Lumpur) * Notice: 0-60 days * Benefits: One-way ticket, 1-week stay, medical insurance, visa. Has anyone heard of Mindgraph or had a similar experience? Note : This is a permanent position with Mindgragh and you need to work with our client Accenture - Malaysia (Kaula Lumpur) & we will provide one way Air Ticket from India - Malaysia, 1 Week Accommodation, Medical Insurance and will take care of the Visa process also. Any insights would be appreciated!

Got a question about Cognizant?

Ask anonymously on communities.

An Associate Data Engineer was asked

Q. How would you read data from a .log file and extract specific columns using regular expressions?

Ans.

Reading data from a .log file and extracting columns with a specific regex.

Use Python's built-in 're' module to define the regex pattern.
Open the .log file using Python's 'open' function.
Iterate through each line of the file and extract the desired columns using the regex pattern.
Store the extracted data in a data structure such as a list or dictionary.

An Associate Data Engineer was asked

Q. How would you find the count and profit from the data for the last 4 years?

Ans.

Calculate the count and profit from data over the last four years using SQL or data analysis tools.

Use SQL queries like 'SELECT COUNT(*)' to get the total count of records.
To calculate profit, use 'SUM(profit_column)' grouped by year.
Example SQL: 'SELECT YEAR(date_column), COUNT(*), SUM(profit_column) FROM sales WHERE date_column >= DATE_SUB(CURDATE(), INTERVAL 4 YEAR) GROUP BY YEAR(date_column);'
Consider filte...

An Associate Data Engineer was asked

Q. SORT BY ORDER BY CLUSTER BY DISTRIBUTE BY

Ans.

SORT BY, ORDER BY, CLUSTER BY, and DISTRIBUTE BY are SQL clauses used for data sorting and partitioning.

SORT BY is used to sort the result set in ascending or descending order based on one or more columns.
ORDER BY is used to sort the result set in ascending or descending order based on one or more columns. It is similar to SORT BY but can be used with other clauses like LIMIT and OFFSET.
CLUSTER BY is used to group...

Are these interview questions helpful?

An Associate Data Engineer was asked

Q. RDS VA DF VS DS

Ans.

RDS, VA, DF, VS, and DS are all acronyms related to data engineering.

RDS stands for Relational Database Service, a managed database service by AWS.
VA stands for Virtual Assistant, a software program that can assist with tasks.
DF stands for Dataflow, a managed service by Google Cloud for data processing.
VS stands for Virtual Server, a server that runs on a virtual machine.
DS stands for Datastore, a NoSQL document d...

An Associate Data Engineer was asked

Q. SMALL FILE PROBLEM

Ans.

Small file problem refers to the issue of having a large number of small files in a storage system.

Small files can cause inefficiencies in storage and processing.
Solutions include consolidating small files into larger ones or using a different storage system.
Examples include Hadoop's SequenceFile format and Amazon S3's object size optimization.

An Associate Data Engineer was asked

Q. SQL QUERIES WITH WINDOW FUNCTION

Ans.

SQL queries with window functions

Window functions perform calculations across a set of rows that are related to the current row
Common window functions include ROW_NUMBER, RANK, DENSE_RANK, and NTILE
Window functions are used with the OVER() clause to define the window or subset of rows to perform the calculation on

Cognizant Associate Data Engineer Interview Experiences

4 interviews found

Associate Data Engineer Interview Questions & Answers

Anonymous

posted on 11 Aug 2021

Interview Questionnaire

5 Questions

Q1. How to find 3rd Highest salary in Sql.

View 3 more answers

Q2. How to delete duplicate rows in Sql

View 3 more answers

Q3. Datastge - How will you remove Header and trailer from Sequential data file

View 1 more answer

Q4. How would you kill any job in Datastage

Ans.

To kill a job in Datastage

Stop the job manually from the Director client
Terminate the job from the command line using the dsjob command
Kill the job process from the operating system level
Delete the job from the Datastage repository

Answered by AI

Add your answer

Q5. How to find process id in linux

View 2 more answers

Skills evaluated in this interview

Associate Data Engineer Interview Questions & Answers

Anonymous

posted on 10 Jul 2021

Interview Questionnaire

6 Questions

Q1. Basics of HIVE AND SPARK

Add your answer

Q2. SQL QUERIES WITH WINDOW FUNCTION

Ans.

SQL queries with window functions

Window functions perform calculations across a set of rows that are related to the current row
Common window functions include ROW_NUMBER, RANK, DENSE_RANK, and NTILE
Window functions are used with the OVER() clause to define the window or subset of rows to perform the calculation on

Answered by AI

Add your answer

Q3. SORT BY ORDER BY CLUSTER BY DISTRIBUTE BY

Ans.

SORT BY, ORDER BY, CLUSTER BY, and DISTRIBUTE BY are SQL clauses used for data sorting and partitioning.

SORT BY is used to sort the result set in ascending or descending order based on one or more columns.
ORDER BY is used to sort the result set in ascending or descending order based on one or more columns. It is similar to SORT BY but can be used with other clauses like LIMIT and OFFSET.
CLUSTER BY is used to group data...

Answered by AI

Add your answer

Q4. SPARK OOM ISSUE

Ans.

Spark OutOfMemory (OOM) issues occur when the application exceeds memory limits, causing failures in processing data.

Increase executor memory: Use the configuration 'spark.executor.memory' to allocate more memory to executors.
Optimize data partitioning: Use 'repartition()' or 'coalesce()' to manage the number of partitions effectively.
Use broadcast variables: For large lookup tables, use 'sc.broadcast()' to reduce memo...

Answered by AI

Add your answer

Q5. SMALL FILE PROBLEM

Ans.

Small file problem refers to the issue of having a large number of small files in a storage system.

Small files can cause inefficiencies in storage and processing.
Solutions include consolidating small files into larger ones or using a different storage system.
Examples include Hadoop's SequenceFile format and Amazon S3's object size optimization.

Answered by AI

Add your answer

Q6. RDS VA DF VS DS

Ans.

RDS, VA, DF, VS, and DS are all acronyms related to data engineering.

RDS stands for Relational Database Service, a managed database service by AWS.
VA stands for Virtual Assistant, a software program that can assist with tasks.
DF stands for Dataflow, a managed service by Google Cloud for data processing.
VS stands for Virtual Server, a server that runs on a virtual machine.
DS stands for Datastore, a NoSQL document databa...

Answered by AI

Add your answer

Skills evaluated in this interview

Associate Data Engineer Interview Questions & Answers

akshaya rani

posted on 19 Oct 2021

I applied via Recruitment Consultant and was interviewed in Sep 2021. There were 3 interview rounds.

Interview Questionnaire

2 Questions

Q1. Azure activities

Add your answer

Q2. Project experience

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Basics of azure must be very clear also scenario based questions.

Associate Data Engineer Interview Questions & Answers

Anonymous

posted on 24 Jun 2021

I applied via Recruitment Consultant and was interviewed before Jun 2020. There were 4 interview rounds.

Interview Questionnaire

6 Questions

Q1. Reading Data from a .log file and finding out each column with a specific regex.

Ans.

Reading data from a .log file and extracting columns with a specific regex.

Use Python's built-in 're' module to define the regex pattern.
Open the .log file using Python's 'open' function.
Iterate through each line of the file and extract the desired columns using the regex pattern.
Store the extracted data in a data structure such as a list or dictionary.

Answered by AI

Add your answer

Q2. Asked to find count and profit from the data for last 4 years

Ans.

Calculate the count and profit from data over the last four years using SQL or data analysis tools.

Use SQL queries like 'SELECT COUNT(*)' to get the total count of records.
To calculate profit, use 'SUM(profit_column)' grouped by year.
Example SQL: 'SELECT YEAR(date_column), COUNT(*), SUM(profit_column) FROM sales WHERE date_column >= DATE_SUB(CURDATE(), INTERVAL 4 YEAR) GROUP BY YEAR(date_column);'
Consider filtering ...

Answered by AI

Add your answer

Q3. Optimizations I can use

Ans.

Optimizations for data engineering

Use indexing to speed up queries
Partition data to improve query performance
Use caching to reduce data retrieval time
Optimize data storage format for faster processing
Use parallel processing to speed up data processing
Optimize network bandwidth usage
Use compression to reduce storage and network usage

Answered by AI

Add your answer

Q4. Python JSON reading

Ans.

Answering how to read JSON in Python.

Use the json module to load and parse JSON data
Use the json.loads() method to load JSON data from a string
Use the json.load() method to load JSON data from a file
Access JSON data using keys or indexes
Use the json.dumps() method to convert Python objects to JSON strings

Answered by AI

Add your answer

Q5. Array, List - Python scenarios

Add your answer

Q6. Pyspark configs

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Very easy if you have prev work exp.

Skills evaluated in this interview

Interview questions from similar companies

Software Engineer Interview Questions & Answers

HCLTech

Anonymous

posted on 17 Aug 2021

I applied via Naukri.com and was interviewed before Aug 2020. There were 4 interview rounds.

Interview Questionnaire

1 Question

Q1. Technical questions : 1)oops concepts 2)plsql cursors, triggers, procedures 3)quick sort algorithm

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Be prepared with your resume. None of the questions were asked out of resume.

Software Engineer Interview Questions & Answers

HCLTech

Pugazharasan S

posted on 7 Sep 2021

Interview Questionnaire

2 Questions

Q1. Tell me about yourself

Add your answer

Q2. Reverse string

Ans.

Reversing a string involves rearranging its characters in the opposite order, which can be done using various methods.

Use built-in functions: In Python, you can reverse a string with slicing: `reversed_string = original_string[::-1]`.
Iterative approach: Loop through the string from the end to the beginning and build a new string.
Using recursion: Define a function that calls itself with a smaller substring until it reac...

Answered by AI

View 1 more answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Average level interview

Software Engineer Interview Questions & Answers

HCLTech

Anonymous

posted on 5 Nov 2022

I applied via Recruitment Consulltant and was interviewed before Nov 2021. There were 4 interview rounds.

Round 1 - Resume Shortlist

Pro Tip by AmbitionBox:

Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.

View all tips

Round 2 - Assignment

MCQ's and Coding Problem - 1 related with Java, RDBMS, JS

Round 3 - Technical

(1 Question)

Q1. Basic Question on Java and Project related questions

Add your answer

Round 4 - HR

(1 Question)

Q1. Introduction, Discussion regarding the roles and salary negotiation.

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare with basics and be confident with the answers.

Are these interview questions helpful?

Software Engineer Interview Questions & Answers

HCLTech

Bhaskar Mishra

posted on 5 Sep 2021

I applied via Recruitment Consultant and was interviewed in Aug 2021. There was 1 interview round.

Interview Questionnaire

1 Question

Q1. String builder and string buffer When we create Arrays and when we create linked lists Treeset in Java Java 8 features Static keyword Dynamic method dispatch How to make a immutable cla...

Ans.

Questions related to Java programming language concepts and features.

StringBuilder and StringBuffer are used for efficient string manipulation.
Arrays are fixed in size while linked lists can grow dynamically.
TreeSet is a sorted set implementation in Java.
Java 8 introduced lambda expressions, streams, and functional interfaces.
Static keyword is used to create class-level variables and methods.
Dynamic method dispatch is ...

Answered by AI

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Just be confident and rock!

Skills evaluated in this interview

Software Engineer Interview Questions & Answers

Infosys

Nitin Sridhar

posted on 15 May 2021

Interview Questionnaire

2 Questions

Q1. Apigee

Add your answer

Q2. Interal architecture

Add your answer

Software Engineer Interview Questions & Answers

Infosys

Anonymous

posted on 20 May 2021

Interview Questionnaire

1 Question

Q1. Where do you see yourself in 5 years.

Add your answer

Cognizant Interview FAQs

What are the top questions asked in Cognizant Associate Data Engineer interview?

Some of the top questions asked at the Cognizant Associate Data Engineer interview -

Datastge - How will you remove Header and trailer from Sequential data f...read more
How to delete duplicate rows in ...read more
How to find process id in lin...read more

Tell us how to improve this page.

Cognizant Interviews By Designations

Interview Questions for Popular Designations

TCS Interview Questions

3.6

• 11.1k Interviews

Accenture Interview Questions

3.8

• 8.6k Interviews

Infosys Interview Questions

3.6

• 7.9k Interviews

Wipro Interview Questions

3.7

• 6.1k Interviews

Capgemini Interview Questions

3.7

• 5.1k Interviews

HCLTech Interview Questions

3.5

• 4.1k Interviews

Tech Mahindra Interview Questions

3.5

• 4.1k Interviews

Genpact Interview Questions

3.8

• 3.4k Interviews

IBM Interview Questions

4.0

• 2.5k Interviews

DXC Technology Interview Questions

3.7

• 837 Interviews

View all

Cognizant Associate Data Engineer Salary

based on 127 salaries

₹4.5 L/yr - ₹11.8 L/yr

14% less than the average Associate Data Engineer Salary in India

View more details

Cognizant Salaries in India

Associate 73k salaries	₹5.3 L/yr - ₹12.5 L/yr
Programmer Analyst 56.1k salaries	₹3.5 L/yr - ₹7.3 L/yr
Senior Associate 53k salaries	₹10.6 L/yr - ₹23.4 L/yr
Senior Processing Executive 29.8k salaries	₹2.2 L/yr - ₹6.5 L/yr
Technical Lead 18.1k salaries	₹6 L/yr - ₹21.4 L/yr