i
Blackbuck Insights
Filter interviews by
Top trending discussions
I applied via LinkedIn and was interviewed in Apr 2024. There was 1 interview round.
Dual mode in Power BI allows users to switch between DirectQuery and Import modes for data sources.
Dual mode allows users to combine the benefits of both DirectQuery and Import modes in Power BI.
Users can switch between DirectQuery and Import modes for different data sources within the same report.
DirectQuery mode connects directly to the data source for real-time data retrieval, while Import mode loads data into Power...
I applied via Naukri.com and was interviewed in Dec 2024. There was 1 interview round.
To create a pipeline in Databricks, you can use Databricks Jobs or Apache Airflow for orchestration.
Use Databricks Jobs to create a pipeline by scheduling notebooks or Spark jobs.
Utilize Apache Airflow for more complex pipeline orchestration with dependencies and monitoring.
Leverage Databricks Delta for managing data pipelines with ACID transactions and versioning.
I applied via Campus Placement and was interviewed in May 2024. There were 2 interview rounds.
Two coding questions
I applied via Job Portal and was interviewed before May 2022. There were 2 interview rounds.
Coding test was bad. they asked irrelevant things that were not related to the field
Data engineering is the process of designing, building, and maintaining the infrastructure for data storage and processing.
Data engineering involves creating and managing data pipelines
It includes tasks such as data modeling, data integration, and data warehousing
Data engineers work with big data technologies such as Hadoop, Spark, and NoSQL databases
They also ensure data quality, security, and scalability
Examples of d...
A resilient distributed database is a database that can continue to function even if some of its nodes fail.
It is designed to be fault-tolerant and highly available.
Data is distributed across multiple nodes to ensure redundancy.
If one node fails, the database can continue to function using data from other nodes.
Examples include Apache Cassandra, Riak, and HBase.
Developed ETL pipeline to ingest, clean, and analyze customer data for personalized marketing campaigns
Gathered requirements from stakeholders to understand data sources and business objectives
Designed data model to store customer information and campaign performance metrics
Implemented ETL process using Python and Apache Spark to extract, transform, and load data
Performed data quality checks and created visualizations ...
I have used various transformations such as filtering, joining, aggregating, and pivoting in my data engineering projects.
Filtering data based on certain conditions
Joining multiple datasets together
Aggregating data to summarize information
Pivoting data from rows to columns or vice versa
I applied via Naukri.com and was interviewed in Nov 2022. There were 4 interview rounds.
Join two tables in PySpark code and DataFrame
Create two DataFrames from the tables
Specify the join condition using join() function
Select the columns to be displayed using select() function
Use show() function to display the result
posted on 29 May 2024
I applied via Campus Placement and was interviewed in Apr 2024. There were 2 interview rounds.
It was a written test where theoretical SQL questions were asked like primary key, foreign key, set operators and some queries
based on 1 interview
Interview experience
based on 5 reviews
Rating in categories
Data Engineer
90
salaries
| ₹2.5 L/yr - ₹8.1 L/yr |
Software Engineer
59
salaries
| ₹3.6 L/yr - ₹9.2 L/yr |
Senior Associate
47
salaries
| ₹12.3 L/yr - ₹28 L/yr |
Lead Consultant
31
salaries
| ₹20 L/yr - ₹37.5 L/yr |
Consultant
26
salaries
| ₹11.7 L/yr - ₹40.1 L/yr |
Cyfuture
Value Point Systems
JoulestoWatts Business Solutions
Black Knight