Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by Cognizant Team. If you also belong to the team, you can get access from here

Cognizant Verified Tick

Compare button icon Compare button icon Compare

Proud winner of ABECA 2024 - AmbitionBox Employee Choice Awards

zig zag pattern zig zag pattern

Filter interviews by

Cognizant Associate Data Engineer Interview Questions, Process, and Tips

Updated 19 Oct 2021

Top Cognizant Associate Data Engineer Interview Questions and Answers

View all 10 questions

Cognizant Associate Data Engineer Interview Experiences

4 interviews found

Interview Questionnaire 

5 Questions

  • Q1. How to find 3rd Highest salary in Sql.
  • Q2. How to delete duplicate rows in Sql
  • Ans. 

    Deleting duplicate rows in SQL

    • Use the DISTINCT keyword in SELECT statement to retrieve unique rows

    • Use GROUP BY clause to group rows with same values and then use aggregate functions to select one row

    • Use the ROW_NUMBER() function to assign a unique number to each row and then delete the rows with duplicate numbers

  • Answered by AI
  • Q3. Datastge - How will you remove Header and trailer from Sequential data file
  • Ans. 

    To remove header and trailer from a sequential data file in Datastage.

    • Use Sequential File stage in Datastage.

    • Set the 'Skip Rows' property to the number of header rows to be skipped.

    • Set the 'Trailer Rows' property to the number of trailer rows to be skipped.

    • Use a Transformer stage to remove any remaining header or trailer rows.

    • Use the 'Remove' function in the Transformer stage to remove the rows.

  • Answered by AI
  • Q4. How would you kill any job in Datastage
  • Ans. 

    To kill a job in Datastage

    • Stop the job manually from the Director client

    • Terminate the job from the command line using the dsjob command

    • Kill the job process from the operating system level

    • Delete the job from the Datastage repository

  • Answered by AI
  • Q5. How to find process id in linux
  • Ans. 

    To find process id in Linux, use the command 'ps -aux | grep '

    • Open the terminal

    • Type 'ps -aux' to list all running processes

    • Use 'grep ' to filter the process you are looking for

    • The process id (PID) will be listed in the second column

Answered by AI

Skills evaluated in this interview

Interview Questionnaire 

6 Questions

  • Q1. Basics of HIVE AND SPARK
  • Q2. SQL QUERIES WITH WINDOW FUNCTION
  • Ans. 

    SQL queries with window functions

    • Window functions perform calculations across a set of rows that are related to the current row

    • Common window functions include ROW_NUMBER, RANK, DENSE_RANK, and NTILE

    • Window functions are used with the OVER() clause to define the window or subset of rows to perform the calculation on

  • Answered by AI
  • Q3. SORT BY ORDER BY CLUSTER BY DISTRIBUTE BY
  • Ans. 

    SORT BY, ORDER BY, CLUSTER BY, and DISTRIBUTE BY are SQL clauses used for data sorting and partitioning.

    • SORT BY is used to sort the result set in ascending or descending order based on one or more columns.

    • ORDER BY is used to sort the result set in ascending or descending order based on one or more columns. It is similar to SORT BY but can be used with other clauses like LIMIT and OFFSET.

    • CLUSTER BY is used to group data...

  • Answered by AI
  • Q4. SPARK OOM ISSUE
  • Q5. SMALL FILE PROBLEM
  • Ans. 

    Small file problem refers to the issue of having a large number of small files in a storage system.

    • Small files can cause inefficiencies in storage and processing.

    • Solutions include consolidating small files into larger ones or using a different storage system.

    • Examples include Hadoop's SequenceFile format and Amazon S3's object size optimization.

  • Answered by AI
  • Q6. RDS VA DF VS DS
  • Ans. 

    RDS, VA, DF, VS, and DS are all acronyms related to data engineering.

    • RDS stands for Relational Database Service, a managed database service by AWS.

    • VA stands for Virtual Assistant, a software program that can assist with tasks.

    • DF stands for Dataflow, a managed service by Google Cloud for data processing.

    • VS stands for Virtual Server, a server that runs on a virtual machine.

    • DS stands for Datastore, a NoSQL document databa

  • Answered by AI

Skills evaluated in this interview

Associate Data Engineer Interview Questions Asked at Other Companies

asked in Cognizant
Q1. Datastge - How will you remove Header and trailer from Sequential ... read more
asked in Cognizant
Q2. How to delete duplicate rows in Sql
asked in Capgemini
Q3. How would you recommend customer to migrate on cloud
asked in Cognizant
Q4. How to find process id in linux
asked in Cognizant
Q5. Reading Data from a .log file and finding out each column with a ... read more

I applied via Recruitment Consultant and was interviewed in Sep 2021. There were 3 interview rounds.

Interview Questionnaire 

2 Questions

  • Q1. Azure activities
  • Q2. Project experience

Interview Preparation Tips

Interview preparation tips for other job seekers - Basics of azure must be very clear also scenario based questions.

I applied via Recruitment Consultant and was interviewed before Jun 2020. There were 4 interview rounds.

Interview Questionnaire 

6 Questions

  • Q1. Reading Data from a .log file and finding out each column with a specific regex.
  • Ans. 

    Reading data from a .log file and extracting columns with a specific regex.

    • Use Python's built-in 're' module to define the regex pattern.

    • Open the .log file using Python's 'open' function.

    • Iterate through each line of the file and extract the desired columns using the regex pattern.

    • Store the extracted data in a data structure such as a list or dictionary.

  • Answered by AI
  • Q2. Asked to find count and profit from the data for last 4 years
  • Q3. Optimizations I can use
  • Ans. 

    Optimizations for data engineering

    • Use indexing to speed up queries

    • Partition data to improve query performance

    • Use caching to reduce data retrieval time

    • Optimize data storage format for faster processing

    • Use parallel processing to speed up data processing

    • Optimize network bandwidth usage

    • Use compression to reduce storage and network usage

  • Answered by AI
  • Q4. Python JSON reading
  • Ans. 

    Answering how to read JSON in Python.

    • Use the json module to load and parse JSON data

    • Use the json.loads() method to load JSON data from a string

    • Use the json.load() method to load JSON data from a file

    • Access JSON data using keys or indexes

    • Use the json.dumps() method to convert Python objects to JSON strings

  • Answered by AI
  • Q5. Array, List - Python scenarios
  • Q6. Pyspark configs
  • Ans. 

    Pyspark configs

    • Pyspark configs are used to configure the behavior of a Pyspark application.

    • They can be set using SparkConf object or spark-submit command.

    • Examples include setting the number of executors, memory allocation, and log level.

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Very easy if you have prev work exp.

Skills evaluated in this interview

Cognizant interview questions for designations

 Associate Data Scientist

 (1)

 Data Engineer

 (30)

 Associate Engineer

 (52)

 Azure Data Engineer

 (7)

 Gcp Data Engineer

 (3)

 Senior Data Engineer

 (2)

 AWS Data Engineer

 (2)

 Data Center Engineer

 (2)

Interview questions from similar companies

Interview experience
5
Excellent
Difficulty level
Easy
Process Duration
2-4 weeks
Result
Selected Selected

I applied via campus placement at RC Patel College of Education, Shirpur and was interviewed in Oct 2024. There were 3 interview rounds.

Round 1 - Aptitude Test 

There are some general aptitude questions.

Round 2 - Coding Test 

There were two simple codes from which we need to pass the test case for at least one code

Round 3 - HR 

(3 Questions)

  • Q1. Tell me about your project
  • Q2. What was the difficut subject in college
  • Ans. 

    The most difficult subject in college was Advanced Calculus.

    • Advanced Calculus involved complex mathematical concepts and required a deep understanding of calculus principles.

    • The subject required a lot of practice and problem-solving skills to master the concepts.

    • Topics such as multivariable calculus, differential equations, and vector calculus were particularly challenging.

    • The abstract nature of the subject made it dif...

  • Answered by AI
  • Q3. Tell me about yourself
  • Ans. 

    I am a recent graduate with a degree in Computer Science and a passion for data engineering.

    • Graduated with a degree in Computer Science

    • Strong interest in data engineering

    • Completed internships in data analysis and database management

  • Answered by AI

I applied via Recruitment Consultant and was interviewed in Feb 2021. There were 4 interview rounds.

Interview Questionnaire 

3 Questions

  • Q1. #3 interviews were there 1. Basic Telephonic all basic questions were there then 2. zoom video for detaled code and sql queries 3. advanced for algo designing and case analysis questions.All related to had...
  • Q2. Basic question and coding is answered as per knowledge and experience. Advance questions is totally based on aptitude . reasonng and decision making skills
  • Q3. Why TechMahindra guys conduct interviews if after selecting all technical round they dnt have any offer to give. And make it as hold.

Interview Preparation Tips

Interview preparation tips for other job seekers - Please discuss frankly to HR after selecting in all interview round will i get the offer with expected salary or just it will hold to infinite time.

Interview Questionnaire 

1 Question

  • Q1. How would you recommend customer to migrate on cloud
  • Ans. 

    To recommend customers to migrate to the cloud, assess their current infrastructure, plan the migration strategy, choose the right cloud provider, and ensure data security.

    • Assess the customer's current infrastructure and identify the applications and data that can be migrated to the cloud.

    • Plan the migration strategy by considering factors like cost, time, and resource requirements.

    • Choose the right cloud provider based ...

  • Answered by AI

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
4-6 weeks
Result
Selected Selected

I applied via Walk-in and was interviewed in Dec 2024. There were 5 interview rounds.

Round 1 - HR 

(1 Question)

  • Q1. Resume screening Self introduction Roles and responsibilities
Round 2 - Assignment 

Given task Statics standard deviations Attrition Average of given table values and Given graph economi graph and poverty graph base on that need to gave answers 30 qustion and 60 min time duration

Round 3 - Technical 

(2 Questions)

  • Q1. Versant test should be need to attend Hirepro will have interview prosess in YouTube will check and prepare mid level hard
  • Q2. We can goo for online prosess how it is
Round 4 - One-on-one 

(1 Question)

  • Q1. Manger Level round they will ask KPI NPS AHT standard deviations Service Level handling Root cause Analysis Weighted average Same product how will calculate Totall Swer % how will get Many of like bpi <20 ...
Round 5 - HR 

(1 Question)

  • Q1. Salary discussion and update structure of our role and responsibilities
Interview experience
4
Good
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
No response

I applied via Naukri.com and was interviewed in Nov 2024. There was 1 interview round.

Round 1 - Technical 

(6 Questions)

  • Q1. Can you introduce yourself and describe your current project experience?
  • Ans. 

    I am a Senior Data Engineer with experience in building scalable data pipelines and optimizing data processing workflows.

    • Experience in designing and implementing ETL processes using tools like Apache Spark and Airflow

    • Proficient in working with large datasets and optimizing query performance

    • Strong background in data modeling and database design

    • Worked on projects involving real-time data processing and streaming analytic

  • Answered by AI
  • Q2. Decorators in python
  • Ans. 

    Decorators in Python are functions that modify the behavior of other functions or methods.

    • Decorators are defined using the @decorator_name syntax before a function definition.

    • They can be used to add functionality to existing functions without modifying their code.

    • Decorators can be used for logging, timing, authentication, and more.

    • Example: @staticmethod decorator in Python is used to define a static method in a class.

  • Answered by AI
  • Q3. What is the SQL query to group by employee ID in order to combine the first name and last name with a space?
  • Ans. 

    SQL query to group by employee ID and combine first name and last name with a space

    • Use the GROUP BY clause to group by employee ID

    • Use the CONCAT function to combine first name and last name with a space

    • Select employee ID, CONCAT(first_name, ' ', last_name) AS full_name

  • Answered by AI
  • Q4. What are constructors in Python?
  • Ans. 

    Constructors in Python are special methods used for initializing objects. They are called automatically when a new instance of a class is created.

    • Constructors are defined using the __init__() method in a class.

    • They are used to initialize instance variables of a class.

    • Example: class Person: def __init__(self, name, age): self.name = name self.age = age person1 = Person('Alice', 30)

  • Answered by AI
  • Q5. Indexing in sql
  • Ans. 

    Indexing in SQL is a technique used to improve the performance of queries by creating a data structure that allows for faster retrieval of data.

    • Indexes are created on columns in a database table to speed up the retrieval of rows that match a certain condition in a WHERE clause.

    • Indexes can be created using CREATE INDEX statement in SQL.

    • Types of indexes include clustered indexes, non-clustered indexes, unique indexes, an...

  • Answered by AI
  • Q6. Why spark works well with parquet files?
  • Ans. 

    Spark works well with Parquet files due to its columnar storage format, efficient compression, and ability to push down filters.

    • Parquet files are columnar storage format, which aligns well with Spark's processing model of working on columns rather than rows.

    • Parquet files support efficient compression, reducing storage space and improving read performance in Spark.

    • Spark can push down filters to Parquet files, allowing f...

  • Answered by AI

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Genpact user image Sashikanta Parida

posted on 17 Dec 2024

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.

Round 1 - Technical 

(3 Questions)

  • Q1. What are different type of joins available in Databricks?
  • Ans. 

    Different types of joins available in Databricks include inner join, outer join, left join, right join, and cross join.

    • Inner join: Returns only the rows that have matching values in both tables.

    • Outer join: Returns all rows when there is a match in either table.

    • Left join: Returns all rows from the left table and the matched rows from the right table.

    • Right join: Returns all rows from the right table and the matched rows ...

  • Answered by AI
  • Q2. How do you make your data pipeline fault tolerant?
  • Ans. 

    Implementing fault tolerance in a data pipeline involves redundancy, monitoring, and error handling.

    • Use redundant components to ensure continuous data flow

    • Implement monitoring tools to detect failures and bottlenecks

    • Set up automated alerts for immediate response to issues

    • Design error handling mechanisms to gracefully handle failures

    • Use checkpoints and retries to ensure data integrity

  • Answered by AI
  • Q3. What is AutoLoader?
  • Ans. 

    AutoLoader is a feature in data engineering that automatically loads data from various sources into a data warehouse or database.

    • Automates the process of loading data from different sources

    • Reduces manual effort and human error

    • Can be scheduled to run at specific intervals

    • Examples: Apache Nifi, AWS Glue

  • Answered by AI
Round 2 - Technical 

(2 Questions)

  • Q1. How do you connect to different services in Azure?
  • Ans. 

    To connect to different services in Azure, you can use Azure SDKs, REST APIs, Azure Portal, Azure CLI, and Azure PowerShell.

    • Use Azure SDKs for programming languages like Python, Java, C#, etc.

    • Utilize REST APIs to interact with Azure services programmatically.

    • Access and manage services through the Azure Portal.

    • Leverage Azure CLI for command-line interface interactions.

    • Automate tasks using Azure PowerShell scripts.

  • Answered by AI
  • Q2. What are linked Services?
  • Ans. 

    Linked Services are connections to external data sources or destinations in Azure Data Factory.

    • Linked Services define the connection information needed to connect to external data sources or destinations.

    • They can be used in Data Factory pipelines to read from or write to external systems.

    • Examples of Linked Services include Azure Blob Storage, Azure SQL Database, and Amazon S3.

  • Answered by AI

Cognizant Interview FAQs

What are the top questions asked in Cognizant Associate Data Engineer interview?

Some of the top questions asked at the Cognizant Associate Data Engineer interview -

  1. Datastge - How will you remove Header and trailer from Sequential data f...read more
  2. How to delete duplicate rows in ...read more
  3. How to find process id in lin...read more

Tell us how to improve this page.

Interview Questions from Similar Companies

TCS Interview Questions
3.7
 • 10.4k Interviews
Accenture Interview Questions
3.8
 • 8.2k Interviews
Infosys Interview Questions
3.6
 • 7.6k Interviews
Wipro Interview Questions
3.7
 • 5.6k Interviews
Capgemini Interview Questions
3.7
 • 4.8k Interviews
Tech Mahindra Interview Questions
3.5
 • 3.8k Interviews
HCLTech Interview Questions
3.5
 • 3.8k Interviews
Genpact Interview Questions
3.8
 • 3.1k Interviews
LTIMindtree Interview Questions
3.8
 • 3k Interviews
IBM Interview Questions
4.0
 • 2.4k Interviews
View all
Cognizant Associate Data Engineer Salary
based on 91 salaries
₹5 L/yr - ₹11.5 L/yr
9% less than the average Associate Data Engineer Salary in India
View more details

Cognizant Associate Data Engineer Reviews and Ratings

based on 3 reviews

3.1/5

Rating in categories

3.4

Skill development

3.1

Work-life balance

2.5

Salary

2.8

Job security

3.1

Company culture

2.7

Promotions

3.1

Work satisfaction

Explore 3 Reviews and Ratings
Associate
72.2k salaries
unlock blur

₹5.1 L/yr - ₹16 L/yr

Programmer Analyst
55.5k salaries
unlock blur

₹2.4 L/yr - ₹9.7 L/yr

Senior Associate
48.6k salaries
unlock blur

₹9 L/yr - ₹27.3 L/yr

Senior Processing Executive
28.9k salaries
unlock blur

₹1.8 L/yr - ₹9 L/yr

Technical Lead
17.6k salaries
unlock blur

₹5.9 L/yr - ₹24.8 L/yr

Explore more salaries
Compare Cognizant with

TCS

3.7
Compare

Infosys

3.6
Compare

Wipro

3.7
Compare

Accenture

3.8
Compare
Did you find this page helpful?
Yes No
write
Share an Interview