Filter interviews by
I applied via Naukri.com and was interviewed in Aug 2023. There were 3 interview rounds.
Q. 1 Using Python, How can we extract 3-grams from the given document and obtain a list of the generated 3-gram phrases
Q. 2 You are given an integer array containing distinct numbers and you can perform the following operations until the array is empty : if the first element has the smallest value, remove it. otherwise , put the element at the end of the array. Return an integer denoting the number of operations it takes to make nums empty.
Example: Input nums = [3,4,-1]
Output:5
Operation Array
1 [4,3,-1]
2 [-1,3,4]
3 [3,4]
4 [4]
5 []
Q.3 Write a python Code to initialize k cluster centroids for the k-means clustering algorithm using a random dataset.
Q. 4 Write a python function that takes as input and performs stop word removal, stemming, and tokenization. The function should return a list of processed tokens.
Q. 5 Given an integer array nums, move all the even integers at the beginning of the array followed by all the odd integers. Return any array that satisfies this condition.
Input = [3,1,2,4]
expected Output = [2,4,1,3]
Q. 3
I applied via Approached by Company and was interviewed before Oct 2022. There were 3 interview rounds.
Top trending discussions
Pre-processing steps involve cleaning, transforming, and preparing data for machine learning models.
Data cleaning: removing missing values, duplicates, and outliers
Data transformation: scaling, encoding categorical variables, and feature engineering
Data normalization: ensuring all features have the same scale
Data splitting: dividing data into training and testing sets
Lemmatization is the process of reducing words to their base or root form.
Lemmatization helps in standardizing words for analysis.
It reduces inflected words to their base form.
For example, 'running' becomes 'run' after lemmatization.
I appeared for an interview in Oct 2024.
OHE is a technique used in machine learning to convert categorical data into a binary format.
OHE is used to convert categorical variables into a format that can be provided to ML algorithms.
Each category is represented by a binary vector where only one element is 'hot' (1) and the rest are 'cold' (0).
For example, if we have a 'color' feature with categories 'red', 'blue', 'green', OHE would represent them as [1, 0, 0],...
Conditional probability is the likelihood of an event occurring given that another event has already occurred.
Conditional probability is calculated using the formula P(A|B) = P(A and B) / P(B)
It represents the probability of event A happening, given that event B has already occurred
Conditional probability is used in various fields such as machine learning, statistics, and finance
Precision and recall are evaluation metrics used in machine learning to measure the performance of a classification model.
Precision is the ratio of correctly predicted positive observations to the total predicted positive observations.
Recall is the ratio of correctly predicted positive observations to the all observations in actual class.
Precision is important when the cost of false positives is high, while recall is i...
I applied via Company Website and was interviewed in Jul 2023. There were 4 interview rounds.
Python basic coding questions (pandas, numpy, OOPs concept.
AI ML theory questions
I applied via Referral and was interviewed in Jul 2024. There were 3 interview rounds.
30 MCQs where 15 Need to be answered correctly to get shortlisted.
Sanfoundary source is very helpful in cracking it.
Oops concepts refer to Object-Oriented Programming principles such as Inheritance, Encapsulation, Polymorphism, and Abstraction.
Inheritance: Allows a class to inherit properties and behavior from another class.
Encapsulation: Bundling data and methods that operate on the data into a single unit.
Polymorphism: Ability to present the same interface for different data types.
Abstraction: Hiding the complex implementation det...
File handling refers to the process of managing and manipulating files on a computer system.
File handling involves tasks such as creating, reading, writing, updating, and deleting files.
Common file operations include opening a file, reading its contents, writing data to it, and closing the file.
File handling in programming languages often involves using functions or libraries specifically designed for file operations.
E...
Supervised learning uses labeled data to train a model, while unsupervised learning finds patterns in unlabeled data.
Supervised learning requires input-output pairs for training
Examples include linear regression, support vector machines, and neural networks
Unsupervised learning clusters data based on similarities or patterns
Examples include k-means clustering, hierarchical clustering, and principal component analysis
I applied via Naukri.com and was interviewed in Jul 2024. There was 1 interview round.
Developed a sentiment analysis model using NLP to analyze customer reviews for a product.
Collected and preprocessed text data from various sources
Performed tokenization, stopword removal, and lemmatization
Built a machine learning model using techniques like TF-IDF and LSTM
Evaluated the model's performance using metrics like accuracy and F1 score
Deployed the model for real-time sentiment analysis of new reviews
Cosine similarity is a measure of similarity between two non-zero vectors in an inner product space.
It measures the cosine of the angle between the two vectors.
Values range from -1 (completely opposite) to 1 (exactly the same).
Used in recommendation systems, text mining, and clustering algorithms.
Iterator is an object that allows iteration over a collection, while iterable is an object that can be iterated over.
Iterator is an object with a next() method that returns the next item in the collection.
Iterable is an object that has an __iter__() method which returns an iterator.
Example: List is iterable, while iter(list) returns an iterator.
Python function to calculate cosine similarity between two vectors.
Define a function that takes two vectors as input.
Calculate the dot product of the two vectors.
Calculate the magnitude of each vector and multiply them.
Divide the dot product by the product of magnitudes to get cosine similarity.
F1 score is a metric used to evaluate the performance of a classification model by considering both precision and recall.
F1 score is the harmonic mean of precision and recall, calculated as 2 * (precision * recall) / (precision + recall).
It is a better metric than accuracy when dealing with imbalanced datasets.
A high F1 score indicates a model with both high precision and high recall.
F1 score ranges from 0 to 1, where ...
I applied via Referral and was interviewed before Apr 2023. There was 1 interview round.
based on 2 interview experiences
Difficulty level
Duration
Software Engineer
12
salaries
| ₹5.7 L/yr - ₹9.7 L/yr |
Senior Software Engineer
9
salaries
| ₹4.5 L/yr - ₹10.7 L/yr |
Lead Software Engineer
8
salaries
| ₹8 L/yr - ₹14.2 L/yr |
Senior Software Developer
7
salaries
| ₹4.5 L/yr - ₹9 L/yr |
Software Developer
6
salaries
| ₹4 L/yr - ₹8 L/yr |
TCS
Accenture
Wipro
Cognizant