Most questions on technologies mentioned in resume, pyspark, sql , database, hadoop, spark, what are optimisation techniques in spark, what id DAG , hadoop architecture, spark architecture, Questions on Hive, can we apply indexing in hive?, Can we define primary key in hive? , Does hive support ACID properties ? , Questions on DataBricks , Azure services , ADF , Join operations on PySpark, window functions in pyspark, how to join two spark dataframe with a common column having different name , broadcast variable, mapside join, serialisation and de-serialisation, Magic commands in Dtabricks notebook, how to schedule databricks notebook, how to import variables from other notebok to master notebook, how databricks is different from other cloude platforms and why databricks? ,Azure synapse analytics , what is runtime in databricks, how do you setup cluster in databricks,what is Mapreduce in hadoop, what is default block size in HDFS and can we change it? ,What is the default size of a partition in hive? , How do you read CSV in pandas, how to extract specific columns in pandas.
Be the first one to answer
Add answer anonymously...
Popular interview questions of Big Data Engineer
>
Celebal Technologies Big Data Engineer Interview Questions
Stay ahead in your career. Get AmbitionBox app
Helping over 1 Crore job seekers every month in choosing their right fit company
65 L+
Reviews
4 L+
Interviews
4 Cr+
Salaries
1 Cr+
Users/Month
Contribute to help millions
Get AmbitionBox app