i
TCS
Filter interviews by
Decision tree algorithm is a tree-like model used for classification and regression. Cross entropy is a measure of the difference between two probability distributions.
Decision tree algorithm recursively splits the data into subsets based on the most significant attribute until a stopping criterion is met.
It is a popular algorithm for both classification and regression tasks.
Cross entropy is used as a loss function in ...
I was interviewed in Oct 2024.
Transfer learning involves using pre-trained models on a different task, while fine-tuning involves further training a pre-trained model on a specific task.
Transfer learning uses knowledge gained from one task to improve learning on a different task.
Fine-tuning involves adjusting the parameters of a pre-trained model to better fit a specific task.
Transfer learning is faster and requires less data compared to training a...
Supervised learning algorithms are used in machine learning to predict outcomes based on labeled training data.
Supervised learning algorithms require labeled training data to learn the relationship between input and output variables.
Common supervised learning algorithms include linear regression, logistic regression, decision trees, support vector machines, and neural networks.
These algorithms are used for tasks such a...
Unsupervised learning algorithms are used to find patterns in data without labeled outcomes.
Unsupervised learning algorithms do not require labeled data for training.
They are used for clustering, dimensionality reduction, and anomaly detection.
Examples include K-means clustering, hierarchical clustering, and principal component analysis.
What people are saying about TCS
Cosine similarity measures the similarity between two non-zero vectors in an inner product space.
Cosine similarity ranges from -1 to 1, with 1 indicating identical vectors and -1 indicating opposite vectors.
It is commonly used in information retrieval, text mining, and recommendation systems.
Formula: cos(theta) = (A . B) / (||A|| * ||B||)
Example: Calculating similarity between two documents based on their word frequenc
Recall is the ratio of correctly predicted positive observations to the all observations in actual class, while precision is the ratio of correctly predicted positive observations to the total predicted positive observations.
Recall is about the actual positive instances that were correctly identified by the model.
Precision is about the predicted positive instances and how many of them were actually positive.
Recall = Tr...
Stop words are common words like 'the', 'is', 'and' that are removed from text data to improve analysis.
Stop words are commonly removed from text data to improve the accuracy of natural language processing tasks.
They are typically removed before tokenization and can be done using libraries like NLTK or spaCy.
Examples of stop words include 'the', 'is', 'and', 'in', 'on', etc.
TCS interview questions for designations
Confusion matrix is a table used to evaluate the performance of a classification model.
It is a 2x2 matrix that shows the counts of true positive, true negative, false positive, and false negative predictions.
It is used to calculate metrics like accuracy, precision, recall, and F1 score.
Example: TP=100, TN=50, FP=10, FN=5.
Similarity matrix algo is a method to quantify the similarity between data points in a dataset.
It calculates the similarity between each pair of data points in a dataset and represents it in a matrix form.
Common similarity measures used include cosine similarity, Euclidean distance, and Jaccard similarity.
The diagonal of the matrix usually contains 1s as each data point is perfectly similar to itself.
The values in the ...
Get interview-ready with Top TCS Interview Questions
Steps involved in Machine Learning Problem Statement
Define the problem statement and goals
Collect and preprocess data
Select a machine learning model
Train the model on the data
Evaluate the model's performance
Fine-tune the model if necessary
Deploy the model for predictions
I applied via Naukri.com and was interviewed in Jan 2024. There was 1 interview round.
Retraining GEN AI model involves updating the model with new data to improve its accuracy and performance.
Retraining is necessary to keep the model up-to-date with new information.
New data is used to fine-tune the model's parameters and improve its predictions.
Retraining may involve adjusting hyperparameters, adding more layers, or changing the architecture.
Examples: retraining a language model with new text data, retr...
MLFlow allows for easy deployment of machine learning models.
MLFlow provides a simple way to deploy models using the mlflow models serve command.
Models can be deployed locally or to a cloud-based server for production use.
MLFlow also supports model versioning and tracking for easy management of deployed models.
Duration 1 Hr. Difficulty- Medium.
Basic coding questions
F1-score is a measure of a model's accuracy that considers both precision and recall.
F1-score is the harmonic mean of precision and recall.
It ranges from 0 to 1, where 1 is the best possible F1-score.
F1-score is useful when you want to balance precision and recall in your model evaluation.
Different ML algorithms include linear regression, decision trees, random forests, support vector machines, and neural networks.
Linear regression: used for predicting continuous values based on input features.
Decision trees: used for classification and regression tasks by splitting data into branches based on feature values.
Random forests: ensemble method using multiple decision trees for improved accuracy.
Support vect...
I applied via LinkedIn and was interviewed in Jan 2024. There was 1 interview round.
Overfitting occurs when a model learns the training data too well, while underfitting occurs when a model fails to capture the underlying patterns in the data.
Overfitting: Model is too complex and learns noise in the training data.
Underfitting: Model is too simple and fails to capture the underlying patterns.
Overfitting can lead to poor generalization and high variance.
Underfitting can lead to high bias and poor perfor...
ML models should be included in a Data Scientist's resume.
Include a section in your resume highlighting the ML models you have worked with.
Mention the specific ML algorithms and techniques you have used.
Provide examples of projects where you have successfully applied ML models.
Highlight any notable achievements or results obtained using ML models.
Demonstrate your understanding of model evaluation and validation techniq
Recall and Precision are evaluation metrics used in classification tasks to measure the performance of a model.
Recall measures the ability of a model to find all the relevant instances in a dataset.
Precision measures the ability of a model to correctly identify only the relevant instances.
Recall and Precision are often used together to evaluate the trade-off between completeness and correctness in a model's predictions...
The duration of TCS Data Scientist interview process can vary, but typically it takes about less than 2 weeks to complete.
2 Interview rounds
based on 123 reviews
Rating in categories
System Engineer
1.1L
salaries
| ₹1 L/yr - ₹9 L/yr |
IT Analyst
68.1k
salaries
| ₹5.1 L/yr - ₹16 L/yr |
AST Consultant
51.1k
salaries
| ₹8 L/yr - ₹25 L/yr |
Assistant System Engineer
31.4k
salaries
| ₹0.3 L/yr - ₹5.6 L/yr |
Associate Consultant
28.6k
salaries
| ₹8.9 L/yr - ₹31.9 L/yr |
Amazon
Wipro
Infosys
Accenture