Filter interviews by
Clear (1)
RDD stands for Resilient Distributed Dataset and is the fundamental data structure of Apache Spark.
RDD is a distributed collection of objects that can be operated on in parallel.
DataFrames and Datasets are higher-level abstractions built on top of RDDs.
RDDs are more low-level and offer more control over data processing compared to DataFrames and Datasets.
Partitioning is the process of dividing data into smaller chunks for better organization and processing in distributed systems.
Partitioning helps in distributing data across multiple nodes for parallel processing.
Coalesce is used to reduce the number of partitions without shuffling data, while repartition is used to increase the number of partitions by shuffling data.
Example: coalesce(5) will merge partitions into 5 pa...
Spark is a distributed computing framework that provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Spark has a master-slave architecture with a driver program that communicates with a cluster manager to distribute work across worker nodes.
It uses Resilient Distributed Datasets (RDDs) for fault-tolerant distributed data processing.
Spark supports various programming l...
DAG stands for Directed Acyclic Graph. It is a finite directed graph with no cycles.
DAG is a collection of nodes connected by edges where each edge goes from one node to another, but no cycles are allowed.
In the context of Spark, a DAG represents the sequence of transformations that need to be applied to the input data to get the final output.
When a Spark job is submitted, Spark creates a DAG of the transformations spe...
I applied via Job Portal and was interviewed in Apr 2021. There was 1 interview round.
posted on 22 Sep 2023
Coding testing with 15 questions MCQ based
I was interviewed before Dec 2020.
Round duration - 120 minutes
Round difficulty - Easy
After round 1 there was a break of 15 mins and the coding round started.
We had a choice to attempt the exam in R language & SQL or python & sql
I chose python and sql
It had 8 questions 4 python and 4 sql
Given two strings, S
and T
with respective lengths M
and N
, your task is to determine the length of their longest common subsequence.
A subsequence is a seque...
The task is to find the length of the longest common subsequence between two given strings.
Use dynamic programming to solve this problem efficiently.
Create a 2D array to store the lengths of longest common subsequences of substrings.
Iterate through the strings to fill the array and find the length of the longest common subsequence.
Example: For strings 'abcde' and 'ace', the longest common subsequence is 'ace' with leng
Round duration - 15 minutes
Round difficulty - Medium
The interview was mostly analytical based and how you approach the round a few technical questions in dbms and oops were also asked.
Round duration - 20 Minutes
Round difficulty - Medium
Similar to first round
For Shortlists after elimination from first round.
You are a cab driver with a car that initially has 'C' empty seats. The car moves in a straight line towards the forward direction only. Your job is to determine if it is poss...
Determine if it is possible to accommodate all passenger trips within a car's capacity without exceeding it at any point.
Iterate through each trip and keep track of the total number of passengers in the car at each point.
Check if the total number of passengers exceeds the car capacity at any point.
Return 'True' if all trips can be accommodated within the car capacity, otherwise return 'False'.
Round duration - 20 minutes
Round difficulty - Easy
After two technical round interviews, the last round was a hr interview. The whole process can take up one whole day or more and your interview can be at any time
My process started at 5 pm and ended at 10:30 pm
All the interviewers were very friendly no pressure.
Calculate the Nth term in the Fibonacci sequence, where the sequence is defined as follows: F(n) = F(n-1) + F(n-2)
, with initial conditions F(1) = F(2) = 1
.
Calculate the Nth Fibonacci number efficiently using dynamic programming.
Use dynamic programming to store previously calculated Fibonacci numbers to avoid redundant calculations.
Start with base cases F(1) and F(2) as 1, then iterate to calculate subsequent Fibonacci numbers.
Time complexity can be optimized to O(N) using dynamic programming.
Example: For N = 5, the 5th Fibonacci number is 5.
Tip 1 : Be fast ans accurate on your aptitude amd verbal tests.
Tip 2 : In coding round, depending on the difficulty of the questions the time in which you complete solving them should decrease .If almost everyone can solve them timming comes in to the equation for shortlisting
Tip 3 : While submitting your resume, please be honest about the projects you have done and do not bluff about it because in the interview when you ate drilled abt it, they can catch you easily.
Tip 4 : be confident and answer every question to the best of your ability and if u don't know something, tell them you don't know abt it .
Tip 1 : Research the company and put the most revalent projects and skills that the company os looking for specifically
Tip 2 : Do not bluff or lie on your resume. It show's during the interview.
posted on 29 Oct 2021
I applied via Naukri.com and was interviewed in Sep 2021. There were 3 interview rounds.
I applied via Referral and was interviewed in Dec 2021. There were 4 interview rounds.
I applied via Job Portal and was interviewed in Apr 2021. There was 1 interview round.
posted on 23 Feb 2022
I applied via LinkedIn and was interviewed in Sep 2021. There were 4 interview rounds.
There were 4 sections-
1.Big data-some multiple choice questions
2.Python-2 basic programming questions
3.SQL-4 case questions on SQL queries
4.Pyspark-Basic multiple choice questions
Same as round 1 but there were no multiple choice questions,it was a subjective paper where they check your thought process and your approach.
based on 1 interview
Interview experience
based on 1 review
Rating in categories
Senior Applied Data Scientist
129
salaries
| ₹0 L/yr - ₹0 L/yr |
Lead Applied Data Scientist
86
salaries
| ₹0 L/yr - ₹0 L/yr |
Applied Data Scientist
84
salaries
| ₹0 L/yr - ₹0 L/yr |
Senior Engineer
59
salaries
| ₹0 L/yr - ₹0 L/yr |
Senior Data Scientist
49
salaries
| ₹0 L/yr - ₹0 L/yr |
Fractal Analytics
Mu Sigma
AbsolutData
Algonomy