Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by OSI Digital Team. If you also belong to the team, you can get access from here

OSI Digital Verified Tick

Compare button icon Compare button icon Compare
3.7

based on 128 Reviews

Filter interviews by

OSI Digital Data Engineer Interview Questions and Answers

Updated 28 May 2024

OSI Digital Data Engineer Interview Experiences

1 interview found

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 28 May 2024

Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Approached by Company and was interviewed in Nov 2023. There was 1 interview round.

Round 1 - Technical 

(3 Questions)

  • Q1. 1. Lead and lag based questions
  • Q2. 2. Coleasce and repartition in spark
  • Ans. 

    Coalesce reduces the number of partitions in a DataFrame, while repartition reshuffles the data across a specified number of partitions in Spark.

    • Coalesce is used to reduce the number of partitions in a DataFrame without shuffling the data

    • Repartition is used to increase or decrease the number of partitions in a DataFrame by shuffling the data across the specified number of partitions

    • Coalesce is more efficient than repar...

  • Answered by AI
  • Q3. 3. Some sql questions

Interview Preparation Tips

Topics to prepare for OSI Digital Data Engineer interview:
  • SQL
  • Spark

Skills evaluated in this interview

Interview questions from similar companies

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Naukri.com and was interviewed in Oct 2024. There were 2 interview rounds.

Round 1 - One-on-one 

(2 Questions)

  • Q1. Azure Scenario based questions
  • Q2. Pyspark Coding based questions
Round 2 - One-on-one 

(2 Questions)

  • Q1. ADF, Databricks related question
  • Q2. Spark Performance problem and scenarios
Interview experience
3
Average
Difficulty level
-
Process Duration
-
Result
-

I applied via Campus Placement

Round 1 - Aptitude Test 

Based on SQL , statistics , python , cognitive

Round 2 - Technical 

(2 Questions)

  • Q1. Based on AI/Ml and based on cv
  • Q2. Based on projects
Round 3 - HR 

(2 Questions)

  • Q1. How to handle toxic work culture?
  • Ans. 

    Address toxic work culture by open communication, setting boundaries, seeking support, and considering leaving if necessary.

    • Open communication with colleagues and management about issues

    • Set boundaries to protect your mental and emotional well-being

    • Seek support from HR, a mentor, or a therapist if needed

    • Consider leaving the toxic work environment if the situation does not improve

  • Answered by AI
  • Q2. 5 strength and weakness

Interview Preparation Tips

Interview preparation tips for other job seekers - Be confident in interviews and try to calm ur mind!
Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - One-on-one 

(1 Question)

  • Q1. Spark basic question , hive related questions.

Interview Preparation Tips

Interview preparation tips for other job seekers - Good question asked, It covers sql , spark and python.
Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I was interviewed in Aug 2024.

Round 1 - Technical 

(5 Questions)

  • Q1. Questions on Pyspark
  • Q2. Questions on SQL
  • Q3. Transformations
  • Q4. Questions on Sql optimizations
  • Q5. Questions About my current Project
Interview experience
5
Excellent
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
No response

I applied via Naukri.com and was interviewed in Oct 2024. There was 1 interview round.

Round 1 - One-on-one 

(2 Questions)

  • Q1. Incremental load in pyspark
  • Ans. 

    Incremental load in pyspark refers to loading only new or updated data into a dataset without reloading the entire dataset.

    • Use the 'delta' function in pyspark to perform incremental loads by specifying the 'mergeSchema' option.

    • Utilize the 'partitionBy' function to optimize incremental loads by partitioning the data based on specific columns.

    • Implement a logic to identify new or updated records based on timestamps or uni...

  • Answered by AI
  • Q2. Drop duplicates

Skills evaluated in this interview

Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Naukri.com and was interviewed in Mar 2024. There were 2 interview rounds.

Round 1 - Coding Test 

Technical Assessment Test (MCQs) - 30 mins.

Round 2 - Technical 

(4 Questions)

  • Q1. Basic Questions on: 1.Python 2.SQL 3.Spark
  • Q2. Python: Replace all occurrences of word: "apple" with "froot" and count no.of occurrences of word: "apple" - in a given string (multiple lines) SQL: 1. Retrieve employee details, drawing the maximum salar...
  • Q3. Convert string (multiple lines) to list
  • Ans. 

    Use the split() method to convert a string with multiple lines into a list of strings.

    • Use the split() method with the newline character '\n' as the delimiter to split the string into a list of strings.

    • Example: 'Hello\nWorld\n' -> ['Hello', 'World']

  • Answered by AI
  • Q4. Convert string of multiple lines with 'n' words to multiple arrays of fixed size: k, with no overlap of elements accross arrays.
  • Ans. 

    Convert a string of multiple lines with 'n' words to multiple arrays of fixed size without overlap.

    • Split the string into individual words

    • Create arrays of fixed size 'k' and distribute words evenly

    • Handle cases where the number of words is not divisible by 'k'

  • Answered by AI

Interview Preparation Tips

Topics to prepare for Altimetrik Data Engineer interview:
  • Python
  • SQL
  • PySpark

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
-
Result
Selected Selected

I applied via Campus Placement and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - Aptitude Test 

Java and sql questions

Round 2 - Coding Test 

Simple java program for find factorial and prime number

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Referral and was interviewed in Apr 2024. There were 2 interview rounds.

Round 1 - One-on-one 

(1 Question)

  • Q1. Different type of license in power bi. Data Modelling.
  • Ans. 

    Power BI offers different types of licenses for data modeling, including Power BI Pro and Power BI Premium.

    • Power BI Pro license allows users to create and share reports and dashboards with others.

    • Power BI Premium license offers additional features such as larger data capacity and advanced AI capabilities.

    • Power BI Embedded license is designed for embedding reports and dashboards into custom applications.

    • Power BI Report ...

  • Answered by AI
Round 2 - HR 

(2 Questions)

  • Q1. What abt you know tiger
  • Q2. General hr questions

Interview Preparation Tips

Topics to prepare for Tiger Analytics Data Engineer interview:
  • Power Bi
  • Data Modeling
  • SQL
Interview preparation tips for other job seekers - Na

Skills evaluated in this interview

Interview experience
3
Average
Difficulty level
Moderate
Process Duration
-
Result
No response

I applied via LinkedIn and was interviewed in Jan 2024. There was 1 interview round.

Round 1 - Technical 

(4 Questions)

  • Q1. What is Pyspark?
  • Ans. 

    Pyspark is a Python API for Apache Spark, a powerful open-source distributed computing system.

    • Pyspark is used for processing large datasets in parallel across a cluster of computers.

    • It provides high-level APIs in Python for Spark programming.

    • Pyspark allows seamless integration with other Python libraries like Pandas and NumPy.

    • Example: Using Pyspark to perform data analysis and machine learning tasks on big data sets.

  • Answered by AI
  • Q2. What is Pyspark SQL?
  • Ans. 

    Pyspark SQL is a module in Apache Spark that provides a SQL interface for working with structured data.

    • Pyspark SQL allows users to run SQL queries on Spark dataframes.

    • It provides a more concise and user-friendly way to interact with data compared to traditional Spark RDDs.

    • Users can leverage the power of SQL for data manipulation and analysis within the Spark ecosystem.

  • Answered by AI
  • Q3. How to merge 2 dataframes of different schema?
  • Ans. 

    To merge 2 dataframes of different schema, use join operations or data transformation techniques.

    • Use join operations like inner join, outer join, left join, or right join based on the requirement.

    • Perform data transformation to align the schemas before merging.

    • Use tools like Apache Spark, Pandas, or SQL to merge dataframes with different schemas.

  • Answered by AI
  • Q4. What is Pyspark streaming?
  • Ans. 

    Pyspark streaming is a scalable and fault-tolerant stream processing engine built on top of Apache Spark.

    • Pyspark streaming allows for real-time processing of streaming data.

    • It provides high-level APIs in Python for creating streaming applications.

    • Pyspark streaming supports various data sources like Kafka, Flume, Kinesis, etc.

    • It enables windowed computations and stateful processing for handling streaming data.

    • Example: C...

  • Answered by AI

Interview Preparation Tips

Topics to prepare for Luxoft Data Engineer interview:
  • Pyspark

Skills evaluated in this interview

OSI Digital Interview FAQs

How many rounds are there in OSI Digital Data Engineer interview?
OSI Digital interview process usually has 1 rounds. The most common rounds in the OSI Digital interview process are Technical.
What are the top questions asked in OSI Digital Data Engineer interview?

Some of the top questions asked at the OSI Digital Data Engineer interview -

  1. 2. Coleasce and repartition in sp...read more
  2. 1. Lead and lag based questi...read more
  3. 3. Some sql questi...read more

Tell us how to improve this page.

OSI Digital Data Engineer Salary
based on 4 salaries
₹3.4 L/yr - ₹7.5 L/yr
51% less than the average Data Engineer Salary in India
View more details
Senior Software Engineer
155 salaries
unlock blur

₹6.2 L/yr - ₹19.4 L/yr

Software Engineer
154 salaries
unlock blur

₹3.2 L/yr - ₹12 L/yr

Associate Software Engineer
124 salaries
unlock blur

₹3 L/yr - ₹7.5 L/yr

Associate Technical Leader
61 salaries
unlock blur

₹10 L/yr - ₹23.2 L/yr

Technical Lead
58 salaries
unlock blur

₹12.4 L/yr - ₹26 L/yr

Explore more salaries
Compare OSI Digital with

TCS

3.7
Compare

Infosys

3.7
Compare

Wipro

3.7
Compare

HCLTech

3.5
Compare

Calculate your in-hand salary

Confused about how your in-hand salary is calculated? Enter your annual salary (CTC) and get your in-hand salary
Did you find this page helpful?
Yes No
write
Share an Interview