Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

Innova Solutions

Compare

3.4

based on 821 Reviews

Filter interviews by

Innova Solutions Big Data Developer Interview Questions and Answers

Updated 9 Nov 2021

Innova Solutions Big Data Developer Interview Experiences

1 interview found

Big Data Developer Interview Questions & Answers

Anonymous

posted on 9 Nov 2021

I applied via Naukri.com and was interviewed in Oct 2021. There were 3 interview rounds.

Interview Questionnaire

1 Question

Q1. Difference between DF and DS/ Big data scenario-based questions and SQL queries

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Brush up Scala/Python coding skills

Big Data Developer Jobs at Innova Solutions

View all

Big Data Developer

Chennai, Bangalore / Bengaluru

8-13 Yrs

₹ 7-16 LPA

Top trending discussions

View All

Salary Discussions, Hike & Promotions

fathersahaab

works at

AmbitionBox

New Job, Higher Pay, Now I’m Feeling Awkward

I’ve been at my new job for about six months now, and everything’s been going great! I’m getting positive feedback from my manager, and I get along well with the team. The thing is, when I started, my salary offer ended up being much higher than the initial number discussed during my interview. I didn’t negotiate and just accepted the offer. So, fast forward to happy hour with the team, and the topic of salary comes up. I, unfortunately, shared what I’m making, and let’s just say... it didn’t sit well with the others who have been on the team for years and make less than me. They weren’t mad at me, but now I’m feeling a bit uncomfortable and unsure how to handle this situation. Has anyone had something like this happen? How did you deal with it? Let’s chat!

Got a question about Innova Solutions?

Ask anonymously on communities.

Interview questions from similar companies

Big Data Developer Interview Questions & Answers

Wipro

Dhanendra Dwivedi

posted on 27 May 2022

I applied via Naukri.com and was interviewed before May 2021. There were 3 interview rounds.

Round 1 - Technical

(1 Question)

Q1. Complete technical evaluations/questions related to the Tools i.e. Hadoop, Hive, Impala, Spark, Sqoop, SQL, Scala, Python.

Add your answer

Round 2 - Group Discussion

2nd round is technical + Manager both discussions

Round 3 - HR

(1 Question)

Q1. HR discussion related Role, Salary, location etc.

Add your answer

Interview Preparation Tips

Topics to prepare for Wipro Big Data Developer interview:

Hadoop
Spark
Agile Methodology
SCALA
Python
SQL
impala
Cloud
Hive

Interview preparation tips for other job seekers - Prepare well before attending the interview and your technical skills and knowledge should be enough.

Data Engineer Interview Questions & Answers

Accenture

Anonymous

posted on 3 Jan 2025

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Naukri.com and was interviewed in Dec 2024. There was 1 interview round.

Round 1 - Technical

(5 Questions)

Q1. Scenario based questions on Azure data factory and pipelines

Add your answer

Q2. Optimisation technic to improve the performance of databricks

Add your answer

Q3. What is Autoloader

Add your answer

Q4. What is unity catalog

Add your answer

Q5. How you do the alerting mechanism in adf for failed pipelines

Add your answer

Data Engineer Interview Questions & Answers

LTIMindtree

Anonymous

posted on 7 Nov 2024

Interview experience

Average

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

No response

I applied via Naukri.com and was interviewed in Oct 2024. There were 2 interview rounds.

Round 1 - Technical

(7 Questions)

Q1. How do you optimize SQL queries?

Ans.

Optimizing SQL queries involves using indexes, avoiding unnecessary joins, and optimizing the query structure.

Use indexes on columns frequently used in WHERE clauses
Avoid using SELECT * and only retrieve necessary columns
Optimize joins by using INNER JOIN instead of OUTER JOIN when possible
Use EXPLAIN to analyze query performance and make necessary adjustments

Answered by AI

Add your answer

Q2. How do you do performance optimization in Spark. Tell how you did it in you project.

Ans.

Performance optimization in Spark involves tuning configurations, optimizing code, and utilizing caching.

Tune Spark configurations such as executor memory, number of executors, and shuffle partitions.
Optimize code by reducing unnecessary shuffles, using efficient transformations, and avoiding unnecessary data movements.
Utilize caching to store intermediate results in memory and avoid recomputation.
Example: In my projec...

Answered by AI

Add your answer

Q3. What is SparkContext and SparkSession?

Ans.

SparkContext is the main entry point for Spark functionality, while SparkSession is the entry point for Spark SQL.

SparkContext is the entry point for low-level API functionality in Spark.
SparkSession is the entry point for Spark SQL functionality.
SparkContext is used to create RDDs (Resilient Distributed Datasets) in Spark.
SparkSession provides a unified entry point for reading data from various sources and performing

Answered by AI

Add your answer

Q4. When a spark job is submitted, what happens at backend. Explain the flow.

Ans.

When a spark job is submitted, various steps are executed at the backend to process the job.

The job is submitted to the Spark driver program.
The driver program communicates with the cluster manager to request resources.
The cluster manager allocates resources (CPU, memory) to the job.
The driver program creates DAG (Directed Acyclic Graph) of the job stages and tasks.
Tasks are then scheduled and executed on worker nodes ...

Answered by AI

View 1 more answer

Q5. Calculate second highest salary using SQL as well as pyspark.

Ans.

Calculate second highest salary using SQL and pyspark

Use SQL query with ORDER BY and LIMIT to get the second highest salary
In pyspark, use orderBy() and take() functions to achieve the same result

Answered by AI

Add your answer

Q6. 2 types of modes for Spark architecture ?

Ans.

The two types of modes for Spark architecture are standalone mode and cluster mode.

Standalone mode: Spark runs on a single machine with a single JVM and is suitable for development and testing.
Cluster mode: Spark runs on a cluster of machines managed by a cluster manager like YARN or Mesos for production workloads.

Answered by AI

Add your answer

Q7. If you want very less latency - which is better standalone or client mode?

Ans.

Client mode is better for very less latency due to direct communication with the cluster.

Client mode allows direct communication with the cluster, reducing latency.
Standalone mode requires an additional layer of communication, increasing latency.
Client mode is preferred for real-time applications where low latency is crucial.

Answered by AI

Add your answer

Round 2 - Technical

(2 Questions)

Q1. Scenario based. Write SQL and pyspark code for a dataset.

Add your answer

Q2. If you have to find latest record based on latest timestamp in a table for a particular customer(table is having history) , how will you do it. Self join and nested query will be expensive. Optimized query...

Add your answer

Interview Preparation Tips

Topics to prepare for LTIMindtree Data Engineer interview:

SQL
pyspark
ETL

Interview preparation tips for other job seekers - L2 was scheduled next day to L1 so the process is fast. Brush up your practical knowledge more.

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

Genpact

Sashikanta Parida

posted on 17 Dec 2024

Interview experience

Excellent

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.

Round 1 - Technical

(3 Questions)

Q1. What are different type of joins available in Databricks?

Ans.

Different types of joins available in Databricks include inner join, outer join, left join, right join, and cross join.

Inner join: Returns only the rows that have matching values in both tables.
Outer join: Returns all rows when there is a match in either table.
Left join: Returns all rows from the left table and the matched rows from the right table.
Right join: Returns all rows from the right table and the matched rows ...

Answered by AI

Add your answer

Q2. How do you make your data pipeline fault tolerant?

Ans.

Implementing fault tolerance in a data pipeline involves redundancy, monitoring, and error handling.

Use redundant components to ensure continuous data flow
Implement monitoring tools to detect failures and bottlenecks
Set up automated alerts for immediate response to issues
Design error handling mechanisms to gracefully handle failures
Use checkpoints and retries to ensure data integrity

Answered by AI

Add your answer

Q3. What is AutoLoader?

Ans.

AutoLoader is a feature in data engineering that automatically loads data from various sources into a data warehouse or database.

Automates the process of loading data from different sources
Reduces manual effort and human error
Can be scheduled to run at specific intervals
Examples: Apache Nifi, AWS Glue

Answered by AI

Add your answer

Round 2 - Technical

(2 Questions)

Q1. How do you connect to different services in Azure?

Ans.

To connect to different services in Azure, you can use Azure SDKs, REST APIs, Azure Portal, Azure CLI, and Azure PowerShell.

Use Azure SDKs for programming languages like Python, Java, C#, etc.
Utilize REST APIs to interact with Azure services programmatically.
Access and manage services through the Azure Portal.
Leverage Azure CLI for command-line interface interactions.
Automate tasks using Azure PowerShell scripts.

Answered by AI

Add your answer

Q2. What are linked Services?

Ans.

Linked Services are connections to external data sources or destinations in Azure Data Factory.

Linked Services define the connection information needed to connect to external data sources or destinations.
They can be used in Data Factory pipelines to read from or write to external systems.
Examples of Linked Services include Azure Blob Storage, Azure SQL Database, and Amazon S3.

Answered by AI

Add your answer

Data Engineer Interview Questions & Answers

Capgemini

Brijesh yadav

posted on 9 Jan 2025

Interview experience

Good

Difficulty level

Process Duration

Result

Round 1 - Technical

(3 Questions)

Q1. What are the optimization techniques used in Apache Spark?

Add your answer

Q2. 2 SQL queries , 1 PySpark code and 1 Python Code .

Add your answer

Q3. 2-3 Scenario Based questions from ADF and databricks .

Add your answer

Data Analyst Interview Questions & Answers

Atos

Pranita Wagh

posted on 27 Jan 2025

Interview experience

Bad

Difficulty level

Easy

Process Duration

Less than 2 weeks

Result

Selected

I was interviewed in Dec 2024.

Round 1 - HR

(2 Questions)

Q1. Can you tell me about yourself? Why do you want to work as a data analyst? What do you know about our company?

Add your answer

Q2. What are your short-term and long-term career goals in the field of data analysis? Why do you wish to pursue a career in data analysis? What factors motivate you to work in a data-driven environment?

Add your answer

Data Analyst Interview Questions & Answers

NTT Data

Anonymous

posted on 29 Jan 2025

Interview experience

Excellent

Difficulty level

Process Duration

Result

Round 1 - Technical

(2 Questions)

Q1. What is the importance of data analysis?

Add your answer

Q2. What steps should be taken to become a data analyst?

Add your answer

Senior Data Engineer Interview Questions & Answers

CGI Group

Anonymous

posted on 18 Dec 2024

Interview experience

Average

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Not Selected

I applied via Naukri.com and was interviewed in Nov 2024. There was 1 interview round.

Round 1 - Technical

(2 Questions)

Q1. How do you utilize the enhanced optimization option in AWS Glue?

Ans.

Enhanced optimization in AWS Glue improves job performance by automatically adjusting resources based on workload

Enhanced optimization in AWS Glue automatically adjusts resources like DPUs based on workload
It helps improve job performance by optimizing resource allocation
Users can enable enhanced optimization in AWS Glue job settings

Answered by AI

Add your answer

Q2. What are the best practices for optimizing querying in Amazon Redshift?

Ans.

Optimizing querying in Amazon Redshift involves proper table design, distribution keys, sort keys, and query optimization techniques.

Use appropriate distribution keys to evenly distribute data across nodes for parallel processing.
Utilize sort keys to physically order data on disk, reducing the need for sorting during queries.
Avoid using SELECT * and instead specify only the columns needed to reduce data transfer.
Use AN...

Answered by AI

Add your answer

Data Engineer Interview Questions & Answers

Cognizant

Abhishek Paithankar

posted on 16 Nov 2024

Interview experience

Excellent

Difficulty level

Process Duration

Result

Round 1 - Aptitude Test

Aptitude test involved with quantative aptitude, logical reasoning and reading comprehensions.

Round 2 - Technical

(2 Questions)

Q1. Tell me your introduction.

Add your answer

Q2. Tell me about your skills.

Ans.

I have strong skills in data processing, ETL, data modeling, and programming languages like Python and SQL.

Proficient in data processing and ETL techniques
Strong knowledge of data modeling and database design
Experience with programming languages like Python and SQL
Familiarity with big data technologies such as Hadoop and Spark

Answered by AI

Add your answer

Round 3 - HR

(2 Questions)

Q1. Are you ready relocate,?

Ans.

Yes, I am open to relocating for the right opportunity.

I am willing to relocate for the right job opportunity.
I have experience moving for previous roles.
I am flexible and adaptable to new locations.
I am excited about the possibility of exploring a new city or country.

Answered by AI

Add your answer

Q2. Document verification

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - If you are fresher first prepare for aptitude, because once aptitude get cleared you will get selected from the large compitition and then focus on your technical knowledge and managerial skills about the company.

Innova Solutions Interview FAQs

How to prepare for Innova Solutions Big Data Developer interview?

Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Innova Solutions. The most common topics and skills that interviewers at Innova Solutions expect are Big Data, Spark, Python, SCALA and Hive.

Tell us how to improve this page.

Innova Solutions Interviews By Designations

Interview Questions for Popular Designations

TCS Interview Questions

3.7

• 10.4k Interviews

Accenture Interview Questions

3.9

• 8.1k Interviews

Infosys Interview Questions

3.7

• 7.6k Interviews

Wipro Interview Questions

3.7

• 5.6k Interviews

Cognizant Interview Questions

3.8

• 5.6k Interviews

Capgemini Interview Questions

3.8

• 4.8k Interviews

Tech Mahindra Interview Questions

3.5

• 3.8k Interviews

HCLTech Interview Questions

3.5

• 3.8k Interviews

Genpact Interview Questions

3.9

• 3k Interviews

LTIMindtree Interview Questions

3.8

• 3k Interviews

View all

Institute of Management Technology (IMT), Ghaziabad Placement Questions

1 Interview

Gokaraju Rangaraju Institute of Engineering and Technology, Hyderabad Placement Questions

1 Interview

Indian Institute of Management (IIM), Ranchi Placement Questions

1 Interview

Nitte Meenakshi Institute of Technology, Bangalore Placement Questions

1 Interview

Ramaiah Institute of Technology, Bengaluru Placement Questions

1 Interview

View all

Innova Solutions Big Data Developer Salary

based on 5 salaries

₹6 L/yr - ₹18 L/yr

24% more than the average Big Data Developer Salary in India

View more details

Big Data Developer Jobs at Innova Solutions

Big Data Developer

Chennai,

Bangalore / Bengaluru

8-13 Yrs

₹ 7-16 LPA

Explore more jobs

Innova Solutions Salaries in India

Senior Software Engineer 632 salaries	₹7 L/yr - ₹28 L/yr
Software Engineer 522 salaries	₹4.8 L/yr - ₹17.5 L/yr
Associate Software Engineer 337 salaries	₹5 L/yr - ₹9.1 L/yr
Principal Software Engineer 161 salaries	₹12.5 L/yr - ₹35.2 L/yr
Senior Associate 149 salaries	₹4 L/yr - ₹10 L/yr