i
Publicis Sapient
Filter interviews by
I applied via Approached by Company and was interviewed in Sep 2024. There were 3 interview rounds.
MCQ questions based on data science fundamentals. Total of 30 questions to be solved in 30 mins. Completed in 9 mins. Got selected for round 2
Case study to create a POC. Successfully completed and moved to round 3.
Top trending discussions
I applied via Company Website and was interviewed in Jan 2024. There was 1 interview round.
Extract India players from a dictionary using list comprehension
Use list comprehension to filter out players with nationality as 'India'
Create a new list with only the India players
Example: [player for player, nationality in CSK.items() if nationality == 'India']
Use sets to find common elements in three lists.
Convert the lists to sets for efficient comparison.
Use the intersection method to find common elements.
Return the common elements as a set or list.
Printing specific rows from a database using Pandas in Python
Use Pandas library to read the database into a DataFrame
Use iloc method to select specific rows by index
Print the selected rows
Use scikit-learn library to split dataset into train, test, and validation sets
Import train_test_split from sklearn.model_selection
Specify test_size and validation_size when splitting the dataset
Example: X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
Use DISTINCT keyword to print unique values and DELETE with a subquery to remove duplicate rows.
Use SELECT DISTINCT column_name FROM table_name to print unique values.
Use DELETE FROM table_name WHERE row_id NOT IN (SELECT MAX(row_id) FROM table_name GROUP BY column_name) to delete duplicate rows.
Use SQL SELECT statement with WHERE clause to filter rows based on a specific criterion.
Use SELECT statement with WHERE clause to specify the criterion (ex: salary > 100000)
Example: SELECT * FROM employees WHERE salary > 100000;
Ensure proper syntax and column names are used in the query
Use subquery to find rows with second highest criterion value in SQL without using offset function.
Use a subquery to find the maximum criterion value
Then use another subquery to find the maximum value that is less than the maximum value found in the first subquery
Finally, select rows with the second highest criterion value
Print rows with the same set of values in a column
Identify unique sets of values in the column
Group rows based on these unique sets of values
Print out the rows for each unique set of values
Power BI is a Microsoft product focused on business intelligence and data visualization, while Tableau is a standalone data visualization tool.
Power BI is more user-friendly and integrates well with other Microsoft products.
Tableau is known for its powerful data visualization capabilities and flexibility in creating complex visualizations.
Power BI is often preferred by organizations already using Microsoft products, wh...
Power BI and Tableau have limitations in terms of data connectivity, customization, and pricing.
Limited data connectivity options compared to other tools
Limited customization capabilities for advanced analytics
High pricing for enterprise-level features
Tableau has better visualization capabilities but can be more complex to use
Power BI is more user-friendly but may lack certain advanced features
Types of regression models include linear regression, logistic regression, polynomial regression, ridge regression, and lasso regression.
Linear regression: used to model the relationship between a dependent variable and one or more independent variables.
Logistic regression: used for binary classification problems, where the output is a probability value between 0 and 1.
Polynomial regression: fits a curve to the data by...
Linear regression is used for continuous variables, while logistic regression is used for binary outcomes.
Linear regression predicts continuous outcomes, while logistic regression predicts binary outcomes.
Linear regression uses a linear equation to model the relationship between the independent and dependent variables.
Logistic regression uses the logistic function to model the probability of a binary outcome.
Linear reg...
Random Forest is an ensemble method using multiple decision trees, while Decision Tree is a single tree-based model.
Random Forest is a collection of decision trees that are trained on random subsets of the data.
Decision Tree is a single tree structure that makes decisions by splitting the data based on features.
Random Forest reduces overfitting by averaging the predictions of multiple trees.
Decision Tree can be prone t...
ETL stands for Extract, Transform, Load. It is a process of extracting data from various sources, transforming it into a usable format, and loading it into a target database.
ETL tools include Informatica PowerCenter, Talend, Apache Nifi, Microsoft SQL Server Integration Services (SSIS), and IBM InfoSphere DataStage.
Extract: Data is extracted from various sources such as databases, files, APIs, etc.
Transform: Data is cl...
posted on 23 Jun 2023
I applied via Recruitment Consulltant and was interviewed in Sep 2021. There were 3 interview rounds.
HackeRank Test
Data classifications with scrubbing techniques
Sensitive data: remove or mask personally identifiable information (PII)
Outliers: remove or correct data points that are significantly different from the rest
Duplicate data: remove or merge identical data points
Inconsistent data: correct or remove data points that do not fit the expected pattern
Invalid data: remove or correct data points that do not make sense or violate co
I applied via Job Fair and was interviewed in May 2024. There were 3 interview rounds.
They gave a span of 3 days to build an AI-powered webapp
I have experience working with cloud technologies such as AWS, Azure, and Google Cloud Platform.
Experience in setting up and managing virtual machines, storage, and networking in cloud environments
Knowledge of cloud services like EC2, S3, RDS, and Lambda
Experience with cloud-based data processing and analytics tools like AWS Glue and Google BigQuery
Developed a predictive model for customer churn in a telecom company
Collected and cleaned customer data from various sources
Performed exploratory data analysis to identify key factors influencing churn
Built and fine-tuned machine learning models to predict customer churn
Challenges included imbalanced data, feature engineering, and model interpretability
I was interviewed in May 2024.
Maths and stats refer to the study of mathematical concepts and statistical methods for analyzing data.
Maths involves the study of numbers, quantities, shapes, and patterns.
Stats involves collecting, analyzing, interpreting, and presenting data.
Maths is used to solve equations, calculate probabilities, and model real-world phenomena.
Stats is used to make informed decisions, draw conclusions, and test hypotheses.
Both ma...
Confusion matrix what are your job rolls explain me Gradient boosting algorithm?
posted on 18 Jan 2025
I was interviewed in Dec 2024.
Asked the question about ml and basic python questions
I applied via Campus Placement
Bulb and switch puzzle
Rope burning and length question
posted on 7 Oct 2023
Basic DP, Array Questions
I applied via Naukri.com and was interviewed in Mar 2023. There were 2 interview rounds.
Supervised learning is a type of machine learning where the algorithm learns from labeled data to make predictions or decisions.
Supervised learning requires labeled data, where the input and output variables are known.
The algorithm learns from the labeled data to make predictions or decisions on new, unseen data.
Examples include classification, regression, and prediction tasks.
Popular algorithms include decision trees,
Interview experience
based on 2 reviews
Rating in categories
Senior Associate
2.2k
salaries
| ₹10.9 L/yr - ₹40 L/yr |
Associate Technology L2
1.5k
salaries
| ₹6.5 L/yr - ₹20 L/yr |
Senior Associate Technology L1
1.2k
salaries
| ₹10.1 L/yr - ₹32 L/yr |
Senior Software Engineer
730
salaries
| ₹9.2 L/yr - ₹37 L/yr |
Senior Associate 2
610
salaries
| ₹14 L/yr - ₹41 L/yr |
Accenture
IBM
TCS
Infosys