i
Deutsche Telekom Digital Labs
Filter interviews by
I applied via Recruitment Consulltant and was interviewed in Mar 2022. There were 3 interview rounds.
Top trending discussions
posted on 23 May 2024
I applied via Referral and was interviewed before May 2023. There were 2 interview rounds.
Transformations in Databricks involve manipulating data using functions like map, filter, reduce, etc.
Transformations are operations that are applied to RDDs in Databricks
Common transformations include map, filter, reduce, flatMap, etc.
Transformations are lazy evaluated and create a new RDD
Example: map transformation to convert each element in an RDD to uppercase
Spark is a distributed computing framework for processing big data.
Spark is built around the concept of Resilient Distributed Datasets (RDDs)
It supports various programming languages like Scala, Java, Python, and R
Spark provides high-level APIs like DataFrames and Datasets for structured data processing
It includes libraries for SQL, streaming, machine learning, and graph processing
Spark can run on various cluster manag
Performance enhancements in PySpark involve optimizing code, tuning configurations, and utilizing efficient data structures.
Use partitioning to distribute data evenly across nodes
Cache intermediate results to avoid recomputation
Optimize joins by broadcasting smaller tables
Use efficient data formats like Parquet or ORC
Tune Spark configurations for memory and parallelism
Overfitting in decision trees occurs when the model learns noise in the training data rather than the underlying pattern.
Overfitting happens when the decision tree is too complex and captures noise in the training data.
It leads to poor generalization on unseen data, as the model is too specific to the training set.
To prevent overfitting, techniques like pruning, setting a minimum number of samples per leaf, or using en
Bagging is a machine learning ensemble technique where multiple models are trained on different subsets of the training data and their predictions are combined.
Bagging stands for Bootstrap Aggregating.
It helps reduce overfitting by combining the predictions of multiple models.
Random Forest is a popular algorithm that uses bagging by training multiple decision trees on random subsets of the data.
A neuron is a basic unit of a neural network that receives input, processes it, and produces an output.
Neurons are inspired by biological neurons in the human brain.
They receive input signals, apply weights to them, sum them up, and pass the result through an activation function.
Neurons are organized in layers in a neural network, with each layer performing specific tasks.
In deep learning, multiple layers of neurons ar...
I applied via Campus Placement
Quant questions, you will be able to clear it easily
Task1: create a website with 3 wepages
task2: create custom linux commands
I applied via Approached by Company and was interviewed in Sep 2024. There was 1 interview round.
Random forest is an ensemble learning method using multiple decision trees to improve accuracy and reduce overfitting.
Random forest uses multiple decision trees to make predictions.
Decision tree is a single tree structure used for classification and regression.
Random forest reduces overfitting by averaging the predictions of multiple trees.
Decision tree can be prone to overfitting if not pruned properly.
Random forest i...
posted on 28 Feb 2024
I applied via Naukri.com and was interviewed in Jan 2024. There were 3 interview rounds.
I applied via Referral and was interviewed in Feb 2024. There was 1 interview round.
posted on 17 Sep 2024
I was asked to implement lru cache which i did but still didn’t made to next round as apparently i didn’t complete the working code though in my opinion i did
Software Engineer
221
salaries
| ₹10 L/yr - ₹26 L/yr |
Senior Software Engineer
105
salaries
| ₹20 L/yr - ₹40 L/yr |
Devops Engineer
54
salaries
| ₹6.6 L/yr - ₹21.6 L/yr |
Sdet Automation Test Engineer
41
salaries
| ₹10 L/yr - ₹24 L/yr |
Software Developer
34
salaries
| ₹8.1 L/yr - ₹27.5 L/yr |
TCS
Wipro
Infosys
HCLTech