Pyspark & Data bricks interview questions asked : Project explanation - architecture used, cluster configurations. What is workspace, compute, workflows Syntax to perform join operation between employees and department dept in Pyspark. Optimization techniques implemented in the code Challenges faced while executing spark codes and how did you handle. How do you schedule a python script(not notebook) in databricks. How to install external packages at notebook and cluster level in databricks How do you fetch data from a Azure sql database to databricks Can you perform updates and deletes in Azure sql database using PySpark in databricks Different methods to access data from azure storage account in databricks Security mechanisms implemented in databricks What are delta tables What are the types of delta tables present and what are the differences between them What is the file format supported by delta tables How do you permanently delete an external table in databricks by making sure that the delete operation has been entered in the delta log file Query to write second highest salary using sub query and window function How do you convert spark datsframe to pandas dataframe Difference between Pyspark and pandas Convert an rdd to dataframe Let's suppose that there is a list containing the names of the databrick notebooks. How do you schedule these notebook such that they are executed parallelly and not sequentially Does workspace come under compute or compute comes under workspace
Be the first one to answer
Add answer anonymously...
Popular interview questions of Data Engineer
>
Infinite Computer Solutions Data Engineer Interview Questions
Stay ahead in your career. Get AmbitionBox app
Helping over 1 Crore job seekers every month in choosing their right fit company
65 L+
Reviews
4 L+
Interviews
4 Cr+
Salaries
1 Cr+
Users/Month
Contribute to help millions
Get AmbitionBox app