Dr. Reddy's Interview Questions and Answers

Question 1

Asked in

Lead Engineer Interview

Q1. How will you create a end to end pipelines in Data Factory if my source and target are SQL and ASQL respectively?

Add your answer

Answer

Create end to end pipelines in Data Factory with SQL and ASQL as source and target respectively.

Use Copy Data activity to move data from SQL source to ASQL target
Define linked services for SQL and ASQL in Data Factory
Create datasets for SQL and ASQL tables
Map columns between source and target datasets
Configure data flow activities for any transformations needed

Question 2

Asked in

Lead Engineer Interview

Q2. How Node.js works under the hood?

Add your answer

Answer

Node.js is a JavaScript runtime built on Chrome's V8 JavaScript engine.

Node.js uses an event-driven, non-blocking I/O model.
It has a single-threaded event loop that handles all I/O operations.
Node.js modules are cached to improve performance.
It supports both synchronous and asynchronous programming.
Node.js has a built-in HTTP server module for creating web servers.
It can be used for building scalable network applications.

Question 3

Asked in

Lead Engineer Interview

Q3. What are python libraries used as a data engineer?

Add your answer

Answer

Python libraries commonly used by data engineers include Pandas, NumPy, Matplotlib, and Scikit-learn.

Pandas: Used for data manipulation and analysis.
NumPy: Provides support for large, multi-dimensional arrays and matrices.
Matplotlib: Used for creating visualizations and plots.
Scikit-learn: Offers machine learning algorithms and tools for data analysis.

Question 4

Asked in

Lead Engineer Interview

Q4. What is Delta Lake and its benefits

Add your answer

Answer

Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark and big data workloads.

Provides ACID transactions for big data workloads
Ensures data reliability and quality by enabling schema enforcement and data versioning
Supports batch and streaming data processing
Improves data quality and reliability by enabling schema enforcement and data versioning

Question 5

Asked in

Lead Engineer Interview

Q5. Worker threads vs child process?

Add your answer

Answer

Worker threads are lightweight and share memory, while child processes are heavier and have separate memory spaces.

Worker threads are useful for tasks that require parallel processing and sharing of data.
Child processes are useful for tasks that require isolation and fault tolerance.
Worker threads are faster to create and destroy than child processes.
Examples of worker thread libraries include pthreads and Java's Executor framework.
Examples of child process libraries include ...read more

Question 6

Asked in

Lead Engineer Interview

Q6. Call va bind vs apply?

Add your answer

Answer

Call, apply and bind are methods used to set the value of 'this' in a function.

Call and apply are used to invoke a function with a specific 'this' value and arguments passed as an array or list respectively.
Bind is used to create a new function with a specific 'this' value and arguments passed as an array or list.
Call and apply are similar, but apply is used when the number of arguments is not known beforehand.
Bind returns a new function that can be called later with the spec...read more

Question 7

Asked in

Lead Engineer Interview

Q7. What is PySpark

Add your answer

Answer

PySpark is a Python API for Apache Spark, a powerful open-source distributed computing system.

PySpark allows users to write Spark applications using Python programming language.
It provides high-level APIs in Python for Spark's functionality, making it easier to work with large datasets.
PySpark can be used for data processing, machine learning, graph processing, and more.
Example: PySpark can be used to perform data analysis on large datasets stored in distributed systems like ...read more

Dr. Reddy's Interview Questions and Answers

Q1. How will you create a end to end pipelines in Data Factory if my source and target are SQL and ASQL respectively?

Q2. How Node.js works under the hood?

Q3. What are python libraries used as a data engineer?

Q4. What is Delta Lake and its benefits

Q5. Worker threads vs child process?

Q6. Call va bind vs apply?

Q7. What is PySpark

More about working at ITC Infotech

Interview Process at Dr. Reddy's

Top Lead Engineer Interview Questions from Similar Companies