i
Publicis Sapient
Filter interviews by
I applied via campus placement at Institute of Technology, Banaras Hindu University and was interviewed before Sep 2021. There were 2 interview rounds.
1. There were two coding problems from leetcode (1 easy and 1 mediam).
2. One was machine learning problem having hingh weightage. They given a data and we have to make a linear regression model on data and submit prediction.
Random forest is an ensemble learning method that constructs a multitude of decision trees and outputs the mode of the classes. Gini impurity is a measure of impurity or randomness used in decision trees.
Random forest is a collection of decision trees that are trained on different subsets of the data.
Each decision tree in the forest is trained on a random subset of the features.
The final prediction is made by taking th...
posted on 26 Nov 2024
I applied via Campus Placement
It was related to Aptitude MCQ and 2-coding test
A palindrome of a number is a number that remains the same when its digits are reversed.
To check if a number is a palindrome, reverse the number and compare it with the original number.
Examples: 121 is a palindrome, 123 is not a palindrome.
Merging two linked lists involves combining the elements of both lists into a single list.
Create a new linked list to store the merged elements
Traverse through both linked lists and add elements to the new list
Handle cases where one list is longer than the other
I applied via Job Fair and was interviewed in May 2024. There were 3 interview rounds.
They gave a span of 3 days to build an AI-powered webapp
I have experience working with cloud technologies such as AWS, Azure, and Google Cloud Platform.
Experience in setting up and managing virtual machines, storage, and networking in cloud environments
Knowledge of cloud services like EC2, S3, RDS, and Lambda
Experience with cloud-based data processing and analytics tools like AWS Glue and Google BigQuery
Developed a predictive model for customer churn in a telecom company
Collected and cleaned customer data from various sources
Performed exploratory data analysis to identify key factors influencing churn
Built and fine-tuned machine learning models to predict customer churn
Challenges included imbalanced data, feature engineering, and model interpretability
I applied via Company Website and was interviewed in Jan 2024. There was 1 interview round.
Extract India players from a dictionary using list comprehension
Use list comprehension to filter out players with nationality as 'India'
Create a new list with only the India players
Example: [player for player, nationality in CSK.items() if nationality == 'India']
Use sets to find common elements in three lists.
Convert the lists to sets for efficient comparison.
Use the intersection method to find common elements.
Return the common elements as a set or list.
Printing specific rows from a database using Pandas in Python
Use Pandas library to read the database into a DataFrame
Use iloc method to select specific rows by index
Print the selected rows
Use scikit-learn library to split dataset into train, test, and validation sets
Import train_test_split from sklearn.model_selection
Specify test_size and validation_size when splitting the dataset
Example: X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
Use DISTINCT keyword to print unique values and DELETE with a subquery to remove duplicate rows.
Use SELECT DISTINCT column_name FROM table_name to print unique values.
Use DELETE FROM table_name WHERE row_id NOT IN (SELECT MAX(row_id) FROM table_name GROUP BY column_name) to delete duplicate rows.
Use SQL SELECT statement with WHERE clause to filter rows based on a specific criterion.
Use SELECT statement with WHERE clause to specify the criterion (ex: salary > 100000)
Example: SELECT * FROM employees WHERE salary > 100000;
Ensure proper syntax and column names are used in the query
Use subquery to find rows with second highest criterion value in SQL without using offset function.
Use a subquery to find the maximum criterion value
Then use another subquery to find the maximum value that is less than the maximum value found in the first subquery
Finally, select rows with the second highest criterion value
Print rows with the same set of values in a column
Identify unique sets of values in the column
Group rows based on these unique sets of values
Print out the rows for each unique set of values
Power BI is a Microsoft product focused on business intelligence and data visualization, while Tableau is a standalone data visualization tool.
Power BI is more user-friendly and integrates well with other Microsoft products.
Tableau is known for its powerful data visualization capabilities and flexibility in creating complex visualizations.
Power BI is often preferred by organizations already using Microsoft products, wh...
Power BI and Tableau have limitations in terms of data connectivity, customization, and pricing.
Limited data connectivity options compared to other tools
Limited customization capabilities for advanced analytics
High pricing for enterprise-level features
Tableau has better visualization capabilities but can be more complex to use
Power BI is more user-friendly but may lack certain advanced features
Types of regression models include linear regression, logistic regression, polynomial regression, ridge regression, and lasso regression.
Linear regression: used to model the relationship between a dependent variable and one or more independent variables.
Logistic regression: used for binary classification problems, where the output is a probability value between 0 and 1.
Polynomial regression: fits a curve to the data by...
Linear regression is used for continuous variables, while logistic regression is used for binary outcomes.
Linear regression predicts continuous outcomes, while logistic regression predicts binary outcomes.
Linear regression uses a linear equation to model the relationship between the independent and dependent variables.
Logistic regression uses the logistic function to model the probability of a binary outcome.
Linear reg...
Random Forest is an ensemble method using multiple decision trees, while Decision Tree is a single tree-based model.
Random Forest is a collection of decision trees that are trained on random subsets of the data.
Decision Tree is a single tree structure that makes decisions by splitting the data based on features.
Random Forest reduces overfitting by averaging the predictions of multiple trees.
Decision Tree can be prone t...
ETL stands for Extract, Transform, Load. It is a process of extracting data from various sources, transforming it into a usable format, and loading it into a target database.
ETL tools include Informatica PowerCenter, Talend, Apache Nifi, Microsoft SQL Server Integration Services (SSIS), and IBM InfoSphere DataStage.
Extract: Data is extracted from various sources such as databases, files, APIs, etc.
Transform: Data is cl...
posted on 8 Oct 2024
Random forest is an ensemble learning method used for classification and regression tasks.
Random forest is a collection of decision trees that are trained on random subsets of the data.
Each tree in the random forest independently predicts the target variable, and the final prediction is made by averaging the predictions of all trees.
Random forest is robust to overfitting and noisy data, and it can handle large datasets...
I was interviewed in May 2024.
Maths and stats refer to the study of mathematical concepts and statistical methods for analyzing data.
Maths involves the study of numbers, quantities, shapes, and patterns.
Stats involves collecting, analyzing, interpreting, and presenting data.
Maths is used to solve equations, calculate probabilities, and model real-world phenomena.
Stats is used to make informed decisions, draw conclusions, and test hypotheses.
Both ma...
Confusion matrix what are your job rolls explain me Gradient boosting algorithm?
posted on 18 Jan 2025
I was interviewed in Dec 2024.
Asked the question about ml and basic python questions
posted on 6 May 2024
I applied via Recruitment Consulltant and was interviewed in Apr 2024. There was 1 interview round.
posted on 7 Oct 2023
Basic DP, Array Questions
I applied via Recruitment Consulltant and was interviewed before Aug 2023. There were 2 interview rounds.
Easy array questions.
Developed a machine learning model to predict customer churn for a telecom company.
Used Python and scikit-learn to preprocess data and build the model
Performed feature engineering to improve model performance
Evaluated model using metrics like accuracy, precision, and recall
Some of the top questions asked at the Publicis Sapient Data Science Intern interview -
based on 1 review
Rating in categories
Senior Associate
2.2k
salaries
| ₹11 L/yr - ₹40 L/yr |
Associate Technology L2
1.5k
salaries
| ₹6.5 L/yr - ₹20 L/yr |
Senior Associate Technology L1
1.2k
salaries
| ₹10 L/yr - ₹30 L/yr |
Senior Software Engineer
739
salaries
| ₹9.5 L/yr - ₹37 L/yr |
Senior Associate 2
622
salaries
| ₹14.1 L/yr - ₹41 L/yr |
Accenture
IBM
TCS
Infosys