Upload Button Icon Add office photos

Filter interviews by

Virtusa Consulting Services Big Data Engineer Interview Questions, Process, and Tips

Updated 14 Dec 2024

Top Virtusa Consulting Services Big Data Engineer Interview Questions and Answers

View all 6 questions

Virtusa Consulting Services Big Data Engineer Interview Experiences

2 interviews found

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I was interviewed in Nov 2024.

Round 1 - One-on-one 

(7 Questions)

  • Q1. Command to check disk utilisation and health in Hadoop
  • Ans. 

    Use 'hdfs diskbalancer' command to check disk utilisation and health in Hadoop

    • Run 'hdfs diskbalancer -report' to get a report on disk utilisation

    • Use 'hdfs diskbalancer -plan <path>' to generate a plan for balancing disk usage

    • Check the Hadoop logs for any disk health issues

  • Answered by AI
  • Q2. Spark Architecture & the significance of each member of spark Architecture
  • Ans. 

    Spark Architecture consists of Driver, Cluster Manager, and Executors. Driver manages the execution of Spark jobs.

    • Driver: Manages the execution of Spark jobs, converts user code into tasks, and coordinates with Cluster Manager.

    • Cluster Manager: Manages resources across the cluster and allocates resources to Spark applications.

    • Executors: Execute tasks assigned by the Driver and store data in memory or disk for further pr...

  • Answered by AI
  • Q3. Partitioning and bucketing
  • Q4. Spark optimization techniques
  • Ans. 

    Optimization techniques in Spark improve performance and efficiency of data processing.

    • Partitioning data to distribute workload evenly

    • Caching frequently accessed data in memory

    • Using broadcast variables for small lookup tables

    • Avoiding shuffling operations whenever possible

    • Tuning memory settings and garbage collection parameters

  • Answered by AI
  • Q5. Second highest salary
  • Ans. 

    I am unable to provide this information as it is confidential.

    • Confidential information about salaries in previous organizations should not be disclosed.

    • It is important to respect the privacy and confidentiality of past employers.

    • Discussing specific salary details may not be appropriate in a professional setting.

  • Answered by AI
  • Q6. Pivot table creation in SQL from not pivot one
  • Ans. 

    To create a pivot table in SQL from a non-pivot table, you can use the CASE statement with aggregate functions.

    • Use the CASE statement to categorize data into columns

    • Apply aggregate functions like SUM, COUNT, AVG, etc. to calculate values for each category

    • Group the data by the columns you want to pivot on

  • Answered by AI
  • Q7. How to create triggers
  • Ans. 

    Creating triggers in a database involves defining the trigger, specifying the event that will activate it, and writing the code to be executed.

    • Define the trigger using the CREATE TRIGGER statement

    • Specify the event that will activate the trigger (e.g. INSERT, UPDATE, DELETE)

    • Write the code or actions to be executed when the trigger is activated

    • Test the trigger to ensure it functions as intended

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Easy to medium questions were asked.
They are focusing on concept basically

Skills evaluated in this interview

I applied via Referral and was interviewed in Mar 2022. There was 1 interview round.

Round 1 - Technical 

(3 Questions)

  • Q1. Spark optimization techniques
  • Ans. 

    Spark optimization techniques improve performance and efficiency of Spark applications.

    • Partitioning data to reduce shuffling

    • Caching frequently used data

    • Using broadcast variables for small data

    • Using efficient data formats like Parquet

    • Tuning memory and CPU usage

    • Using appropriate cluster size

    • Avoiding unnecessary data shuffling

    • Using appropriate serialization formats

    • Using appropriate join strategies

  • Answered by AI
  • Q2. What type of filesystem used in ur project
  • Ans. 

    We use Hadoop Distributed File System (HDFS) for our project.

    • HDFS is a distributed file system designed to run on commodity hardware.

    • It provides high-throughput access to application data and is fault-tolerant.

    • HDFS is used by many big data processing frameworks like Hadoop, Spark, etc.

    • It stores data in a distributed manner across multiple nodes in a cluster.

    • HDFS is optimized for large files and sequential reads and wri

  • Answered by AI
  • Q3. Spark + SQL basic coding

Interview Preparation Tips

Interview preparation tips for other job seekers - Just concentrate on the basics

Skills evaluated in this interview

Big Data Engineer Interview Questions Asked at Other Companies

Q1. Difference between partitioning and bucketing. Types of joins in ... read more
asked in TCS
Q2. What optimization techniques have you utilized in your projects? ... read more
Q3. Checking whether a fibonacci number is present between a particuk ... read more
asked in TCS
Q4. What is the difference between lineage and directed acyclic graph ... read more
asked in TCS
Q5. What is the difference between cache and persistence?

Interview questions from similar companies

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
-

I applied via Referral and was interviewed in Dec 2024. There were 2 interview rounds.

Round 1 - Aptitude Test 

30 Questions in 20 Minutes

Round 2 - Technical 

(1 Question)

  • Q1. Baiscs of SQL,Python,AWS and spark in depth question
Interview experience
1
Bad
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Naukri.com and was interviewed in Jul 2023. There were 2 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(2 Questions)

  • Q1. Basic Questions of Scala Functional Programming concepts.
  • Q2. Spark internal working and optimization techniques
  • Ans. 

    Spark internal working and optimization techniques

    • Spark uses Directed Acyclic Graph (DAG) for optimizing workflows

    • Lazy evaluation helps in optimizing transformations by combining them into a single stage

    • Caching and persistence of intermediate results can improve performance

    • Partitioning data can help in parallel processing and reducing shuffle operations

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - The interview call was abruptly terminated into 20 mins of call duration, as the HR had another conflicting call. The HR called me over cellphone and told that if the interview panel requested she will let me know and the call can be extended, but the HR did not call, The interview did not extended. Finally they rejected me just after the panel spoke 20 mins with me in the 1st round interview. This shows how unprofessional are they in scheduling an interview call and how could any panel can decide within 20 mins of a discussion. Definitely not recommending anyone to attend Bigdata Engineering interviews here.

Skills evaluated in this interview

Interview experience
3
Average
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Aptitude Test 

The aptitude test lasts 30 minutes and focuses on topics relevant to data engineering, including Spark, SQL, Azure, and PySpark.

Round 2 - Coding Test 

The coding test is a one-hour examination on PySpark.

Round 3 - Technical 

(3 Questions)

  • Q1. What is the difference between Cache() and Persist()?
  • Q2. What does the purpose of the Spark Submit command in Apache Spark?
  • Q3. What are window functions in SQL?
Round 4 - HR 

(2 Questions)

  • Q1. Could you provide more details about the daily responsibilities associated with this role?
  • Q2. How would you describe your work culture?
Interview experience
1
Bad
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Job Fair and was interviewed in Nov 2024. There were 2 interview rounds.

Round 1 - Technical 

(2 Questions)

  • Q1. DAX Related Syntax and Codes
  • Q2. Data Modelling, SQL, Python
Round 2 - Technical 

(1 Question)

  • Q1. No Response from HR after calling of selection after Round 1
Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-

I was interviewed in Dec 2024.

Round 1 - Technical 

(2 Questions)

  • Q1. Windows function-related questions.
  • Q2. Join Related Questions.
Round 2 - Technical 

(2 Questions)

  • Q1. Join related Questions
  • Q2. Subqueries related queations.
Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Naukri.com and was interviewed in Nov 2024. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. How do you utilize the enhanced optimization option in AWS Glue?
  • Ans. 

    Enhanced optimization in AWS Glue improves job performance by automatically adjusting resources based on workload

    • Enhanced optimization in AWS Glue automatically adjusts resources like DPUs based on workload

    • It helps improve job performance by optimizing resource allocation

    • Users can enable enhanced optimization in AWS Glue job settings

  • Answered by AI
  • Q2. What are the best practices for optimizing querying in Amazon Redshift?
  • Ans. 

    Optimizing querying in Amazon Redshift involves proper table design, distribution keys, sort keys, and query optimization techniques.

    • Use appropriate distribution keys to evenly distribute data across nodes for parallel processing.

    • Utilize sort keys to physically order data on disk, reducing the need for sorting during queries.

    • Avoid using SELECT * and instead specify only the columns needed to reduce data transfer.

    • Use AN...

  • Answered by AI
Interview experience
5
Excellent
Difficulty level
Hard
Process Duration
2-4 weeks
Result
-

I applied via Naukri.com and was interviewed in Sep 2024. There was 1 interview round.

Round 1 - One-on-one 

(2 Questions)

  • Q1. What is scd type 2?
  • Ans. 

    SCD type 2 is a method used in data warehousing to track historical changes by creating a new record for each change.

    • SCD type 2 stands for Slowly Changing Dimension type 2

    • It involves creating a new record in the dimension table whenever there is a change in the data

    • The old record is marked as inactive and the new record is marked as current

    • It allows for historical tracking of changes in data over time

    • Example: If a cust...

  • Answered by AI
  • Q2. Pyspark question read CSV from folder and add column in each csv file and write it to different location.

Skills evaluated in this interview

Interview experience
1
Bad
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(1 Question)

  • Q1. Data engineer roles and resposibilities

Interview Preparation Tips

Interview preparation tips for other job seekers - Don't Go, worst management service and lots of office politics.

Virtusa Consulting Services Interview FAQs

How many rounds are there in Virtusa Consulting Services Big Data Engineer interview?
Virtusa Consulting Services interview process usually has 1 rounds. The most common rounds in the Virtusa Consulting Services interview process are Technical and One-on-one Round.
How to prepare for Virtusa Consulting Services Big Data Engineer interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Virtusa Consulting Services. The most common topics and skills that interviewers at Virtusa Consulting Services expect are Big Data, Spark, Hadoop, Hive and Python.
What are the top questions asked in Virtusa Consulting Services Big Data Engineer interview?

Some of the top questions asked at the Virtusa Consulting Services Big Data Engineer interview -

  1. what type of filesystem used in ur proj...read more
  2. Command to check disk utilisation and health in Had...read more
  3. Pivot table creation in SQL from not pivot ...read more

Tell us how to improve this page.

Virtusa Consulting Services Big Data Engineer Interview Process

based on 1 interview

Interview experience

4
  
Good
View more
Virtusa Consulting Services Big Data Engineer Salary
based on 40 salaries
₹4 L/yr - ₹13 L/yr
30% less than the average Big Data Engineer Salary in India
View more details

Virtusa Consulting Services Big Data Engineer Reviews and Ratings

based on 3 reviews

4.3/5

Rating in categories

3.1

Skill development

4.2

Work-life balance

3.0

Salary

3.3

Job security

3.8

Company culture

3.1

Promotions

3.1

Work satisfaction

Explore 3 Reviews and Ratings
Senior Consultant
4k salaries
unlock blur

₹8 L/yr - ₹25.5 L/yr

Consultant
3.3k salaries
unlock blur

₹6 L/yr - ₹20.9 L/yr

Lead Consultant
3.3k salaries
unlock blur

₹10.5 L/yr - ₹36 L/yr

Software Engineer
3.2k salaries
unlock blur

₹3.5 L/yr - ₹13 L/yr

Associate Consultant
2.8k salaries
unlock blur

₹4.6 L/yr - ₹15 L/yr

Explore more salaries
Compare Virtusa Consulting Services with

Cognizant

3.8
Compare

TCS

3.7
Compare

Infosys

3.6
Compare

Accenture

3.9
Compare
Did you find this page helpful?
Yes No
write
Share an Interview