Filter interviews by
I applied via Referral and was interviewed in Aug 2023. There was 1 interview round.
Coalesce is used to reduce the number of partitions in a DataFrame or RDD, while repartition is used to increase the number of partitions.
Coalesce is a narrow transformation that can only decrease the number of partitions.
Repartition is a wide transformation that can increase or decrease the number of partitions.
Coalesce is preferred over repartition when reducing the number of partitions.
Repartition shuffles the data ...
Top trending discussions
Approach involves data preprocessing, model training, evaluation, and interpretation.
Perform data preprocessing such as handling missing values, encoding categorical variables, and scaling features.
Split the data into training and testing sets.
Train the logistic regression model on the training data.
Evaluate the model using metrics like accuracy, precision, recall, and F1 score.
Interpret the model coefficients to under...
I would seek opportunities to apply my skills in related fields within the company.
Explore other departments or teams within the company that may have projects related to my field of interest
Offer to collaborate with colleagues in different departments to bring a new perspective to their projects
Seek out professional development opportunities to expand my skills and knowledge in related areas
I applied via Referral and was interviewed in Jun 2024. There were 2 interview rounds.
The assessment consists of a dataset for which we are required to build a machine learning model and submit the results along with code and detailed documentation
Ensemble models are machine learning models that combine multiple individual models to improve predictive performance.
Ensemble models work by aggregating predictions from multiple models to make a final prediction.
Common types of ensemble models include Random Forest, Gradient Boosting, and AdaBoost.
Ensemble models are often more accurate and robust than individual models.
They can reduce overfitting and increase genera...
Bagging and boosting are ensemble learning techniques used to improve the performance of machine learning models by combining multiple weak learners.
Bagging (Bootstrap Aggregating) involves training multiple models independently on different subsets of the training data and then combining their predictions through averaging or voting.
Boosting involves training multiple models sequentially, where each subsequent model c...
I applied via LinkedIn
SQL, Python, like case about window function
I applied via Company Website and was interviewed before Mar 2023. There were 3 interview rounds.
1.Basic SQL
2. Python based question
3.Data modelling
4. Spark
5. Cloud based questions
Sql,Python,Data Modeling and Project based questions
I applied via Referral and was interviewed in Mar 2023. There were 2 interview rounds.
I have 3 years of experience as a data analyst in the finance industry.
Worked with large datasets to extract meaningful insights
Performed data cleaning, transformation, and visualization
Created and maintained dashboards and reports for stakeholders
Conducted statistical analysis and built predictive models
Collaborated with cross-functional teams to identify business opportunities
I have strong skills in data analysis, including proficiency in statistical analysis, data visualization, and programming languages such as Python and SQL.
Proficient in statistical analysis techniques
Skilled in data visualization using tools like Tableau
Strong programming skills in Python and SQL
Experience with data cleaning and preprocessing
Ability to interpret and communicate insights from data
Familiarity with machin...
I applied via Company Website and was interviewed in Jul 2023. There were 4 interview rounds.
Number sires, clock, logic, arithmetic, geometry
Code test basic, ans the basic knowledge, write code
Python is a high-level programming language known for its simplicity and readability.
Python is widely used for web development, data analysis, artificial intelligence, and scientific computing.
It emphasizes code readability and uses indentation to define code blocks.
Python has a large standard library and a vibrant community of developers.
Example: print('Hello, World!')
Example: import pandas as pd
Python is a versatile programming language used for data analysis, web development, artificial intelligence, automation, and more.
Data analysis and visualization
Web development (Django, Flask)
Artificial intelligence and machine learning (TensorFlow, PyTorch)
Automation and scripting
Scientific computing (NumPy, SciPy)
Python is a high-level programming language known for its simplicity and readability.
Python is an interpreted language, meaning it does not need to be compiled before running.
It supports multiple programming paradigms, including object-oriented, imperative, and functional programming.
Python has a large standard library and a thriving community, making it versatile and widely used.
Example: Python is used for web develop...
Object-oriented programming (OOP) is a programming paradigm based on the concept of 'objects', which can contain data and code.
OOP allows for the organization of code into reusable components called classes.
Classes can have attributes (variables) and methods (functions) associated with them.
In Python, everything is an object, and classes can be defined using the 'class' keyword.
Encapsulation, inheritance, and polymorph
An array in Python is a data structure that stores a collection of elements of the same type.
Arrays can store elements such as integers, floats, or strings.
Arrays are indexed starting from 0, with elements accessed using their index.
Example: arr = ['apple', 'banana', 'cherry']
45 mins: AWS,Data warehouse,Python
OOPS concepts refer to Object-Oriented Programming principles like Inheritance, Encapsulation, Polymorphism, and Abstraction.
Inheritance: Allows a class to inherit properties and behavior from another class.
Encapsulation: Bundling data and methods that operate on the data into a single unit.
Polymorphism: Ability to present the same interface for different data types.
Abstraction: Hiding the complex implementation detail
BigQuery is a fully managed, serverless data warehouse by Google Cloud for analyzing large datasets using SQL.
Fully managed and serverless data warehouse
Allows for analyzing large datasets using SQL
Integrates with other Google Cloud services like Data Studio and AI Platform
I applied via Recruitment Consulltant and was interviewed in Feb 2022. There was 1 interview round.
based on 1 interview
Interview experience
Software Engineer
138
salaries
| ₹6.2 L/yr - ₹26 L/yr |
Senior Software Engineer
53
salaries
| ₹16 L/yr - ₹35 L/yr |
Technical Architect
21
salaries
| ₹50 L/yr - ₹73 L/yr |
Senior Data Scientist
18
salaries
| ₹18 L/yr - ₹40 L/yr |
Software Developer
16
salaries
| ₹7.8 L/yr - ₹18 L/yr |
OLX
Flipkart
Snapdeal
Paytm Mall