Filter interviews by
I applied via Walk-in and was interviewed in Apr 2024. There were 2 interview rounds.
Sql concepts and sql Queries
WHERE is used to filter rows before grouping, HAVING is used to filter groups after grouping.
WHERE is used with SELECT statement to filter rows based on a condition
HAVING is used with GROUP BY statement to filter groups based on a condition
WHERE is applied before grouping, HAVING is applied after grouping
Example: SELECT * FROM table_name WHERE column_name = 'value'
Example: SELECT column_name, COUNT(*) FROM table_name G
Delete removes rows one by one, while truncate removes all rows at once.
Delete is a DML command, while truncate is a DDL command
Delete can be rolled back, while truncate cannot be rolled back
Delete triggers delete triggers, while truncate does not trigger any triggers
Delete is slower than truncate for large tables
Example: DELETE FROM table_name WHERE condition;
Example: TRUNCATE TABLE table_name;
Primary key uniquely identifies each record in a table, while foreign key establishes a link between two tables.
Primary key ensures each record is unique
Foreign key establishes a relationship between tables
Primary key can be a single column or a combination of columns
Foreign key references the primary key of another table
When creating a table, factors to consider include data types, column names, primary keys, relationships, and constraints.
Consider the data types for each column (e.g. integer, text, date)
Choose appropriate column names that are descriptive and easy to understand
Define primary keys to uniquely identify each row
Establish relationships between tables using foreign keys
Set constraints to enforce data integrity (e.g. uniqu
Top trending discussions
I applied via Company Website and was interviewed before Aug 2021. There was 1 interview round.
I was interviewed in Dec 2024.
I am a data analyst with a background in statistics and experience in analyzing large datasets.
Background in statistics
Experience in analyzing large datasets
Proficient in data visualization tools like Tableau
Strong problem-solving skills
Excellent communication skills
I would rate myself a 4 out of 5 in SQL proficiency.
Proficient in writing complex SQL queries
Experienced in optimizing database performance
Familiar with data manipulation and analysis functions
Comfortable working with large datasets
I use Power BI to analyze and visualize data for insights and decision-making in my work.
Connect to data sources to import data
Transform and clean data using Power Query Editor
Create relationships between different data tables
Design interactive reports and dashboards
Use DAX formulas for calculations and measures
Share reports with stakeholders and collaborate on insights
I applied via Recruitment Consulltant
I was interviewed in Dec 2024.
I applied via Naukri.com and was interviewed in Dec 2024. There was 1 interview round.
I applied via Naukri.com and was interviewed in Nov 2024. There was 1 interview round.
I am a Senior Data Engineer with experience in building scalable data pipelines and optimizing data processing workflows.
Experience in designing and implementing ETL processes using tools like Apache Spark and Airflow
Proficient in working with large datasets and optimizing query performance
Strong background in data modeling and database design
Worked on projects involving real-time data processing and streaming analytic
Decorators in Python are functions that modify the behavior of other functions or methods.
Decorators are defined using the @decorator_name syntax before a function definition.
They can be used to add functionality to existing functions without modifying their code.
Decorators can be used for logging, timing, authentication, and more.
Example: @staticmethod decorator in Python is used to define a static method in a class.
SQL query to group by employee ID and combine first name and last name with a space
Use the GROUP BY clause to group by employee ID
Use the CONCAT function to combine first name and last name with a space
Select employee ID, CONCAT(first_name, ' ', last_name) AS full_name
Constructors in Python are special methods used for initializing objects. They are called automatically when a new instance of a class is created.
Constructors are defined using the __init__() method in a class.
They are used to initialize instance variables of a class.
Example: class Person: def __init__(self, name, age): self.name = name self.age = age person1 = Person('Alice', 30)
Indexing in SQL is a technique used to improve the performance of queries by creating a data structure that allows for faster retrieval of data.
Indexes are created on columns in a database table to speed up the retrieval of rows that match a certain condition in a WHERE clause.
Indexes can be created using CREATE INDEX statement in SQL.
Types of indexes include clustered indexes, non-clustered indexes, unique indexes, an...
Spark works well with Parquet files due to its columnar storage format, efficient compression, and ability to push down filters.
Parquet files are columnar storage format, which aligns well with Spark's processing model of working on columns rather than rows.
Parquet files support efficient compression, reducing storage space and improving read performance in Spark.
Spark can push down filters to Parquet files, allowing f...
I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.
Different types of joins available in Databricks include inner join, outer join, left join, right join, and cross join.
Inner join: Returns only the rows that have matching values in both tables.
Outer join: Returns all rows when there is a match in either table.
Left join: Returns all rows from the left table and the matched rows from the right table.
Right join: Returns all rows from the right table and the matched rows ...
Implementing fault tolerance in a data pipeline involves redundancy, monitoring, and error handling.
Use redundant components to ensure continuous data flow
Implement monitoring tools to detect failures and bottlenecks
Set up automated alerts for immediate response to issues
Design error handling mechanisms to gracefully handle failures
Use checkpoints and retries to ensure data integrity
AutoLoader is a feature in data engineering that automatically loads data from various sources into a data warehouse or database.
Automates the process of loading data from different sources
Reduces manual effort and human error
Can be scheduled to run at specific intervals
Examples: Apache Nifi, AWS Glue
To connect to different services in Azure, you can use Azure SDKs, REST APIs, Azure Portal, Azure CLI, and Azure PowerShell.
Use Azure SDKs for programming languages like Python, Java, C#, etc.
Utilize REST APIs to interact with Azure services programmatically.
Access and manage services through the Azure Portal.
Leverage Azure CLI for command-line interface interactions.
Automate tasks using Azure PowerShell scripts.
Linked Services are connections to external data sources or destinations in Azure Data Factory.
Linked Services define the connection information needed to connect to external data sources or destinations.
They can be used in Data Factory pipelines to read from or write to external systems.
Examples of Linked Services include Azure Blob Storage, Azure SQL Database, and Amazon S3.
I applied via Recruitment Consulltant and was interviewed in Nov 2024. There were 2 interview rounds.
Interview experience
Data Analyst
31
salaries
| ₹4 L/yr - ₹8.5 L/yr |
Analyst
13
salaries
| ₹4 L/yr - ₹10 L/yr |
Analytics Consultant
9
salaries
| ₹5 L/yr - ₹9.1 L/yr |
Digital Media Analyst
6
salaries
| ₹4 L/yr - ₹5 L/yr |
Consultant
6
salaries
| ₹8 L/yr - ₹13 L/yr |
Analytics India Magazine
Fractal Analytics
Mu Sigma
Tiger Analytics