Filter interviews by
Web Scrapping Assignment
Looker Studio Dashboarding
TF-IDF considers term frequency and inverse document frequency, while Bag of Words only counts word occurrences.
TF-IDF assigns weights to words based on their frequency in a document and across all documents.
Bag of Words represents text as a collection of words and their frequencies, ignoring the order of words.
TF-IDF is more advanced and takes into account the importance of words in a document, while Bag of Words is s
Overfitting occurs when a model learns the noise in the training data rather than the underlying pattern.
Overfitting happens when a model is too complex and captures noise in the training data.
It leads to poor generalization to new data, as the model is too specific to the training set.
Techniques to prevent overfitting include cross-validation, regularization, and early stopping.
Example: A decision tree with too many b...
Bias is error due to overly simplistic assumptions, variance is error due to overly complex models.
Bias is error from erroneous assumptions in the learning algorithm, leading to underfitting.
Variance is error from too much complexity in the learning algorithm, leading to overfitting.
Bias and variance are inversely related - increasing one decreases the other.
Example: A linear regression model may have high bias but low...
I applied via Recruitment Consulltant and was interviewed in Jul 2022. There was 1 interview round.
Tokens in Postgres are the smallest unit of input that can be processed by the parser.
Tokens are used to identify and categorize the different parts of a SQL statement.
Examples of tokens include keywords, identifiers, operators, and literals.
The parser uses tokens to create a parse tree, which is used to execute the SQL statement.
Logistic regression is a type of classifier that uses a logistic function to model a binary dependent variable.
Logistic regression is a statistical method used to analyze a dataset in which there are one or more independent variables that determine an outcome.
It is called logistic regression because it uses a logistic function to model a binary dependent variable.
The logistic function is an S-shaped curve that can take...
I applied via Company Website and was interviewed before Feb 2023. There were 2 interview rounds.
Coding python and sql related
Top trending discussions
I was interviewed in Dec 2024.
I am a data analyst with a background in statistics and experience in analyzing large datasets.
Background in statistics
Experience in analyzing large datasets
Proficient in data visualization tools like Tableau
Strong problem-solving skills
Excellent communication skills
I would rate myself a 4 out of 5 in SQL proficiency.
Proficient in writing complex SQL queries
Experienced in optimizing database performance
Familiar with data manipulation and analysis functions
Comfortable working with large datasets
I use Power BI to analyze and visualize data for insights and decision-making in my work.
Connect to data sources to import data
Transform and clean data using Power Query Editor
Create relationships between different data tables
Design interactive reports and dashboards
Use DAX formulas for calculations and measures
Share reports with stakeholders and collaborate on insights
I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.
I applied via AmbitionBox and was interviewed in Nov 2024. There were 3 interview rounds.
NLP is my favorite subject. Is relatated medical and sign language. I have done reseach with mechien learning and AI TOOLS used for problem solving approach.
A group discussion was conducted on data-based management regarding Mr. Adani's food industry products in the state of Gujarat and their exports to other countries, utilizing data analytics.
Sql python bigdata hadoop hive spark
1 hour simple coding python sql
SQL allows for efficient data retrieval, manipulation, and analysis in relational databases.
SQL is widely used in querying databases to retrieve specific data.
It allows for data manipulation, such as adding, updating, and deleting records.
SQL can perform complex data analysis tasks, such as aggregations and joins.
It provides a standardized language for interacting with relational databases.
SQL is essential for generati...
Python is a versatile programming language known for its simplicity, readability, and vast library support.
Easy to learn and use, making it ideal for beginners and experienced programmers alike
Extensive library support for data analysis, machine learning, web development, and more
Strong community support with active forums and resources for problem-solving
Cross-platform compatibility allows for seamless integration wit...
Spark is a fast and powerful big data processing framework that offers benefits like speed, ease of use, and versatility.
Spark is known for its speed, as it can process data up to 100 times faster than traditional Hadoop MapReduce.
It offers ease of use with high-level APIs in Java, Scala, Python, and SQL, making it accessible to a wide range of users.
Spark is versatile, supporting various workloads such as batch proces...
Huddles of data engineers refer to collaborative meetings or discussions among data engineers to share insights, solve problems, and make decisions.
Huddles are typically informal and can be scheduled or ad-hoc.
They provide a platform for data engineers to brainstorm, troubleshoot, and exchange ideas.
Huddles may involve reviewing code, discussing data pipelines, or addressing technical challenges.
Effective huddles promo...
Alteryx data processing and reporting
I applied via Walk-in and was interviewed in Nov 2024. There were 2 interview rounds.
Online aptitude test with maths and statistics and data analysis
I applied via Campus Placement and was interviewed in Nov 2024. There was 1 interview round.
based on 2 interviews
Interview experience
based on 7 reviews
Rating in categories
Data Science Trainee
175
salaries
| ₹1 L/yr - ₹7.2 L/yr |
Data Science Intern
50
salaries
| ₹1 L/yr - ₹8 L/yr |
Data Analyst
39
salaries
| ₹2 L/yr - ₹9.3 L/yr |
Data Scientist
31
salaries
| ₹2.6 L/yr - ₹9.4 L/yr |
Data Analyst Trainee
22
salaries
| ₹1 L/yr - ₹6.3 L/yr |
upGrad
Simplilearn
Great Learning
Jigsaw Academy