Filter interviews by
Partition in Spark is a way to divide data into smaller chunks for parallel processing.
Partitions are basic units of parallelism in Spark
Data in RDDs are divided into partitions which are processed in parallel
Number of partitions can be controlled using repartition() or coalesce() methods
Bucketing is a way of organizing data files into multiple files based on a hash function, while partitioning is dividing data into different directories based on the column values.
Bucketing is used for evenly distributing data across files for better query performance.
Partitioning is used for organizing data based on specific column values for easier data retrieval.
Example: Bucketing can be used to evenly distribute sa...
Hive tables are used to store structured data in Hive, similar to tables in a traditional database.
Hive tables are created using the CREATE TABLE statement.
Tables can be partitioned based on one or more columns.
External tables in Hive store data outside of the default location in HDFS.
Managed tables store data in the default location in HDFS.
Tables can be queried using SQL-like syntax in HiveQL.
Types of read mode in Spark include permissive, dropMalformed, and failFast.
Permissive mode - ignores corrupted records and loads all possible data
DropMalformed mode - drops corrupted records during reading
FailFast mode - fails immediately upon encountering corrupted records
Top trending discussions
I applied via Company Website and was interviewed before Feb 2020. There was 1 interview round.
I applied via Job Portal and was interviewed before Dec 2019. There was 1 interview round.
I applied via Naukri.com and was interviewed in Aug 2018. There was 0 interview round.
I applied via Company Website and was interviewed before Jun 2020. There was 1 interview round.
I applied via Campus Placement and was interviewed before May 2020. There were 3 interview rounds.
I applied via Applied in more than one job portal can't remember now. and was interviewed before Jan 2020. There was 1 interview round.
I applied via Naukri.com and was interviewed in Mar 2018. There were 5 interview rounds.
I appeared for an interview before Sep 2016.
I appeared for an interview before Sep 2020.
Round duration - 60 minutes
Round difficulty - Easy
It was in the morning. We're provided with 60 minutes.
We're provided with desktop as it was online test.
Almost 3-4 interviewer were there.
Determine if a given year, represented as an integer 'N', is a leap year.
A leap year is defined as a year with 366 days, unlike a normal year which has 365 days.
The initial i...
The task is to determine if a given year is a leap year or not.
Check if the year is divisible by 4, if yes then proceed to the next step.
If the year is divisible by 100, then it should also be divisible by 400 to be a leap year.
If the year satisfies the above conditions, output 'Yes', else output 'No'.
Round duration - 30 minutes
Round difficulty - Easy
It was in the early morning.
I was the first one who selected for the interview.
There were 4 interviewers and they all are especialist in their domain
Round duration - 15 minutes
Round difficulty - Medium
It was mid in the afternoon. They called me and said you're shortlisted and come to campus again.
There were 2 interviewers who talked to me and ask different questions regarding my company, my experience in college etc.
It was kind of chit chat but basically they were testing my conversation skills.
Tip 1 : Try to give mock interview as much as you can
Tip 2 : Good with technical skills
Tip 1 : Mention good projects
Tip 2 : Any internship will be add-on
based on 1 interview experience
TCS
Accenture
Wipro
Cognizant