Filter interviews by
I applied via Referral and was interviewed before May 2023. There were 2 interview rounds.
Transformations in Databricks involve manipulating data using functions like map, filter, reduce, etc.
Transformations are operations that are applied to RDDs in Databricks
Common transformations include map, filter, reduce, flatMap, etc.
Transformations are lazy evaluated and create a new RDD
Example: map transformation to convert each element in an RDD to uppercase
Spark is a distributed computing framework for processing big data.
Spark is built around the concept of Resilient Distributed Datasets (RDDs)
It supports various programming languages like Scala, Java, Python, and R
Spark provides high-level APIs like DataFrames and Datasets for structured data processing
It includes libraries for SQL, streaming, machine learning, and graph processing
Spark can run on various cluster manag
Performance enhancements in PySpark involve optimizing code, tuning configurations, and utilizing efficient data structures.
Use partitioning to distribute data evenly across nodes
Cache intermediate results to avoid recomputation
Optimize joins by broadcasting smaller tables
Use efficient data formats like Parquet or ORC
Tune Spark configurations for memory and parallelism
I applied via Approached by Company and was interviewed before Feb 2023. There were 3 interview rounds.
It was a hackerearth test ranging from simple to intermediate
I applied via Naukri.com and was interviewed before Nov 2021. There were 4 interview rounds.
Language coding test which might be very specific to the proficient for data technologies
What people are saying about Fractal Analytics
posted on 8 Sep 2022
I applied via Recruitment Consulltant and was interviewed in Mar 2022. There were 3 interview rounds.
Overfitting in decision trees occurs when the model learns noise in the training data rather than the underlying pattern.
Overfitting happens when the decision tree is too complex and captures noise in the training data.
It leads to poor generalization on unseen data, as the model is too specific to the training set.
To prevent overfitting, techniques like pruning, setting a minimum number of samples per leaf, or using en
Bagging is a machine learning ensemble technique where multiple models are trained on different subsets of the training data and their predictions are combined.
Bagging stands for Bootstrap Aggregating.
It helps reduce overfitting by combining the predictions of multiple models.
Random Forest is a popular algorithm that uses bagging by training multiple decision trees on random subsets of the data.
A neuron is a basic unit of a neural network that receives input, processes it, and produces an output.
Neurons are inspired by biological neurons in the human brain.
They receive input signals, apply weights to them, sum them up, and pass the result through an activation function.
Neurons are organized in layers in a neural network, with each layer performing specific tasks.
In deep learning, multiple layers of neurons ar...
I applied via Approached by Company and was interviewed in Sep 2024. There was 1 interview round.
Random forest is an ensemble learning method using multiple decision trees to improve accuracy and reduce overfitting.
Random forest uses multiple decision trees to make predictions.
Decision tree is a single tree structure used for classification and regression.
Random forest reduces overfitting by averaging the predictions of multiple trees.
Decision tree can be prone to overfitting if not pruned properly.
Random forest i...
I applied via Referral and was interviewed in Feb 2024. There was 1 interview round.
posted on 17 Sep 2024
I was asked to implement lru cache which i did but still didn’t made to next round as apparently i didn’t complete the working code though in my opinion i did
Interview experience
based on 14 reviews
Rating in categories
Consultant
1.1k
salaries
| ₹6 L/yr - ₹24 L/yr |
Data Engineer
686
salaries
| ₹7.5 L/yr - ₹28 L/yr |
Senior Consultant
569
salaries
| ₹12.4 L/yr - ₹38 L/yr |
Data Scientist
442
salaries
| ₹8.3 L/yr - ₹33 L/yr |
Engineer
218
salaries
| ₹7.5 L/yr - ₹26 L/yr |
Mu Sigma
AbsolutData
LatentView Analytics
Tiger Analytics