i
Vertoz
Filter interviews by
I applied via LinkedIn and was interviewed in Jul 2024. There was 1 interview round.
SQL Window Functions like RANK and DENSE RANK are used to assign a rank to rows within a partition.
RANK function assigns a unique rank to each distinct row within a partition.
DENSE RANK function assigns a unique rank to each distinct row within a partition, but without any gaps.
Both functions are used with the OVER() clause in SQL to define the partition and order of rows.
Top trending discussions
I applied via Referral and was interviewed in Nov 2021. There was 1 interview round.
Coalesce is used to select the first non-null value from a set of columns. Repartition is used to shuffle data across nodes.
Coalesce reduces the number of partitions to the minimum required.
Repartition increases or decreases the number of partitions.
Coalesce is a narrow transformation while repartition is a wide transformation.
Coalesce is used to optimize data for queries while repartition is used to balance data acros...
Optimizing joins involves selecting appropriate join types, indexing tables, and minimizing data movement.
Choose the appropriate join type based on the size and structure of the tables being joined
Index the tables on the join columns to speed up the join process
Minimize data movement by selecting only the necessary columns and filtering rows before joining
Consider using denormalization or materialized views to precompu
RDD is a low-level distributed data structure while DataFrame is a high-level structured data abstraction.
RDD is immutable and unstructured while DataFrame is structured and has a schema
DataFrames are optimized for SQL queries and can be cached in memory
RDDs are more flexible and can be used for complex data processing tasks
DataFrames are easier to use and provide a more concise syntax for data manipulation
RDDs are the...
I applied via campus placement at Dhirubhai Ambani Institute of Information and Communication Technology (DA-IICT), Gandhinagar and was interviewed in Oct 2024. There were 2 interview rounds.
It was a 100-minute test consisting of mcqs and one coding question on articulation points in the graph.
Postgresql is known for its advanced features and support for complex queries, while MySQL is popular for its ease of use and speed.
Postgresql is known for its advanced features such as support for complex queries, indexing, and JSON data types.
MySQL is popular for its ease of use, speed, and compatibility with various platforms.
Postgresql has a strong focus on standards compliance and extensibility, while MySQL is mor...
ACID properties in DBMS ensure data integrity and consistency in transactions.
Atomicity: All operations in a transaction are completed successfully or none at all.
Consistency: Data is always in a valid state before and after a transaction.
Isolation: Transactions are executed independently without interference from other transactions.
Durability: Once a transaction is committed, changes are permanent and cannot be lost.
E...
I applied via Approached by Company and was interviewed in Oct 2024. There were 2 interview rounds.
I applied via Approached by Company and was interviewed in Jul 2024. There was 1 interview round.
Invoke controller is a component in a software application that manages the flow of requests and responses between the user interface and the backend services.
Manages the communication between user interface and backend services
Handles the routing of requests to appropriate functions or methods
Coordinates the flow of data and actions within the application
Facade in Laravel is a design pattern that provides a simple interface to a complex subsystem.
Facade provides a static interface to classes that are available in the application's service container.
It is used to simplify the usage of complex classes by providing a simpler interface.
Facade allows you to access Laravel's features without needing to instantiate objects or use dependency injection.
Example: The Auth facade ...
I applied via Recruitment Consulltant and was interviewed in Dec 2023. There was 1 interview round.
Build a MMX Model for a given dataset and share insights
Preprocess the data by handling missing values and encoding categorical variables
Split the data into training and testing sets
Build the MMX model using appropriate algorithms like decision trees or random forests
Evaluate the model using metrics like accuracy, precision, recall, and F1 score
Interpret the model results to gain insights and make data-driven decisions
Time series analysis can be evaluated by examining the accuracy of forecasts, the model's ability to capture trends and patterns, and the overall performance metrics.
Evaluate forecast accuracy using metrics like Mean Absolute Error (MAE) or Root Mean Squared Error (RMSE)
Assess the model's ability to capture trends and patterns by visualizing the data and comparing it to the model's predictions
Analyze the overall perfor...
I applied via LinkedIn and was interviewed before Aug 2023. There were 2 interview rounds.
Union combines and removes duplicates, Union All combines without removing duplicates.
Union combines result sets and removes duplicates
Union All combines result sets without removing duplicates
Union is slower than Union All as it involves removing duplicates
Example: SELECT column1 FROM table1 UNION SELECT column1 FROM table2;
Example: SELECT column1 FROM table1 UNION ALL SELECT column1 FROM table2;
To show top 5 in pandas, use the nlargest() function.
Use the nlargest() function with the 'n' parameter set to 5 to get the top 5 values in a pandas DataFrame.
For example: df['column_name'].nlargest(5) will return the top 5 values in the specified column.
A scatter plot is a better representation for 3 numerical columns.
Use a scatter plot to show the relationship between the numerical columns.
Scatter plots are effective for visualizing correlations and patterns in data.
Each point on the plot represents a data point with values from all 3 columns.
I applied via LinkedIn and was interviewed in Sep 2023. There were 2 interview rounds.
XGBoost is preferred over Random Forest due to its faster execution speed and better performance in complex datasets.
XGBoost is faster than Random Forest due to its optimized implementation of gradient boosting algorithm.
XGBoost generally performs better in complex datasets with high-dimensional features.
XGBoost allows for more fine-tuning of hyperparameters compared to Random Forest.
XGBoost has regularization techniqu...
I applied via campus placement at Crescent Engineering College, Chennai and was interviewed in May 2022. There were 2 interview rounds.
Basic general English language
Basic question
I applied via LinkedIn and was interviewed before Oct 2022. There were 4 interview rounds.
Basic dsa question in python and data engineering questions ,sql
Basic dsa question in python and data engineering questions
Interview experience
based on 1 review
Rating in categories
Software Engineer
16
salaries
| ₹2.4 L/yr - ₹8 L/yr |
AD Operations Specialist
12
salaries
| ₹2.5 L/yr - ₹6 L/yr |
Business Development Specialist
12
salaries
| ₹2.2 L/yr - ₹3 L/yr |
Talent Acquisition Specialist
9
salaries
| ₹2 L/yr - ₹2.3 L/yr |
System Administrator
7
salaries
| ₹2.4 L/yr - ₹5.5 L/yr |
InMobi
Komli Media
Affle
Amagi Media Labs