Filter interviews by
RDD stands for Resilient Distributed Dataset and is the fundamental data structure of Apache Spark.
RDD is a distributed collection of objects that can be operated on in parallel.
DataFrames and Datasets are higher-level abstractions built on top of RDDs.
RDDs are more low-level and offer more control over data processing compared to DataFrames and Datasets.
Partitioning is the process of dividing data into smaller chunks for better organization and processing in distributed systems.
Partitioning helps in distributing data across multiple nodes for parallel processing.
Coalesce is used to reduce the number of partitions without shuffling data, while repartition is used to increase the number of partitions by shuffling data.
Example: coalesce(5) will merge partitions into 5 pa...
Spark is a distributed computing framework that provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Spark has a master-slave architecture with a driver program that communicates with a cluster manager to distribute work across worker nodes.
It uses Resilient Distributed Datasets (RDDs) for fault-tolerant distributed data processing.
Spark supports various programming l...
DAG stands for Directed Acyclic Graph. It is a finite directed graph with no cycles.
DAG is a collection of nodes connected by edges where each edge goes from one node to another, but no cycles are allowed.
In the context of Spark, a DAG represents the sequence of transformations that need to be applied to the input data to get the final output.
When a Spark job is submitted, Spark creates a DAG of the transformations spe...
Top trending discussions
posted on 3 Nov 2022
I applied via Campus Placement and was interviewed before Nov 2021. There were 3 interview rounds.
There are 30 basic aptitude question in 30 minutes
3 coding question 2 were easy and 1 was of medium level
posted on 15 Apr 2025
I appeared for an interview in Mar 2025, where I was asked the following questions.
I applied via Referral and was interviewed in Aug 2021. There was 1 interview round.
Find k min elements in given array.
Sort the array and return the first k elements.
Use a min heap of size k to find the k min elements.
Use quickselect algorithm to find the kth smallest element and return first k elements smaller than it.
Check if a given tree is a Binary Search Tree (BST) or not.
Traverse the tree in-order and check if the elements are in ascending order.
Check if the maximum value in the left subtree is less than the root and the minimum value in the right subtree is greater than the root.
Use recursion to check if all subtrees are BSTs.
Time complexity: O(n), Space complexity: O(h) where h is the height of the tree.
Given a BST and a sum, find a pair of nodes whose values add up to the given sum.
Traverse the BST in-order and store the nodes in a list
Use two pointers approach to find the pair with the given sum
If the sum is less than the current pair, move the right pointer to the left
If the sum is greater than the current pair, move the left pointer to the right
If the sum is equal to the current pair, return the pair
Time complexit...
1 coding question, and basic MCQ
I appeared for an interview in Mar 2025, where I was asked the following questions.
This code finds the nth prime number using a simple algorithm to check for prime candidates.
A prime number is a natural number greater than 1 that has no positive divisors other than 1 and itself.
To find the nth prime, we can use a loop to check each number for primality until we reach the nth prime.
We can optimize the primality test by checking divisibility only up to the square root of the number.
Example: To find the...
Two easy questions two wasy questions two easy questions two easy questions
based on 1 interview experience
based on 1 review
Rating in categories
Senior Applied Data Scientist
127
salaries
| ₹10.9 L/yr - ₹22.1 L/yr |
Applied Data Scientist
89
salaries
| ₹10 L/yr - ₹16.5 L/yr |
Lead Applied Data Scientist
87
salaries
| ₹18 L/yr - ₹30 L/yr |
Senior Engineer
70
salaries
| ₹10 L/yr - ₹34 L/yr |
Lead Engineer
52
salaries
| ₹16 L/yr - ₹54.8 L/yr |
Access Healthcare
S&P Global
IKS Health
Acuity Knowledge Partners