i
Diggibyte Technologies
Filter interviews by
I applied via Recruitment Consulltant and was interviewed in Apr 2024. There was 1 interview round.
Nested JSON in PySpark allows for handling complex data structures within a DataFrame.
Use the `struct` function to create nested structures in PySpark DataFrames.
Access nested elements using dot notation or `getItem` function.
Use `explode` function to flatten nested arrays.
Consider using `selectExpr` for complex transformations involving nested JSON.
I applied via Naukri.com and was interviewed in Sep 2023. There was 1 interview round.
I applied via Naukri.com and was interviewed in May 2022. There were 2 interview rounds.
Spark architecture is a distributed computing framework that processes large datasets in parallel across a cluster of nodes.
Spark has a master-slave architecture with a driver program that communicates with the cluster manager to allocate resources and tasks to worker nodes.
Worker nodes execute tasks in parallel and store data in memory or disk.
Spark supports various data sources and APIs for batch processing, streamin...
DAG stands for Directed Acyclic Graph and is a way to represent dependencies between tasks. RDD stands for Resilient Distributed Datasets and is a fundamental data structure in Apache Spark.
DAG is used to represent a series of tasks or operations where each task depends on the output of the previous task.
RDD is a distributed collection of data that can be processed in parallel across multiple nodes in a cluster.
RDDs ar...
Serialization is the process of converting an object into a stream of bytes for storage or transmission.
Serialization is used to transfer objects between different applications or systems.
It allows objects to be stored in a file or database.
Serialization can be used for caching and improving performance.
Examples of serialization formats include JSON, XML, and binary formats like Protocol Buffers and Apache Avro.
Accumulators are variables used for aggregating data in Spark. GroupByKey and ReduceByKey are operations used for data transformation.
Accumulators are used to accumulate values across multiple tasks in a distributed environment.
GroupByKey is used to group data based on a key and create a pair of key-value pairs.
ReduceByKey is used to aggregate data based on a key and reduce the data to a single value.
GroupByKey is less...
Choose a cluster based on data size, complexity, and processing requirements.
Consider the size and complexity of the data to be processed.
Determine the processing requirements, such as batch or real-time processing.
Choose a cluster with appropriate resources, such as CPU, memory, and storage.
Examples of Azure clusters include HDInsight, Databricks, and Synapse Analytics.
To create mount points in ADLS, use the Azure Storage Explorer or Azure Portal. To load data source, use Azure Data Factory or Azure Databricks.
Mount points can be created using Azure Storage Explorer or Azure Portal
To load data source, use Azure Data Factory or Azure Databricks
Mount points allow you to access data in ADLS as if it were a local file system
Data can be loaded into ADLS using various tools such as Azure D...
I applied via LinkedIn and was interviewed in Mar 2022. There were 3 interview rounds.
Diggibyte Technologies interview questions for popular designations
Joining of two table using spark structure API. SQL queries like join, aggregation(avg, sum, max). word count Program.
I applied via Naukri.com and was interviewed before Oct 2022. There were 4 interview rounds.
Data structure Like List Array Stack Queue etc
Scenario based question and coding problems
Top trending discussions
I was interviewed in Jan 2025.
I was interviewed in Jan 2025.
A sequence was provided: 4181, 2684, 1597, 987, 610.
first 2 are given and write code for other value calculation using java 8
The second question required writing a reverse of a palindrome using both Java 8 streams. I was able to successfully write both and clear the first round.
Java 17 introduces sealed classes to restrict inheritance and improve code maintainability.
Sealed classes are declared using the 'sealed' keyword followed by the permitted subclasses.
Subclasses of a sealed class must be either final or sealed themselves.
Errors may occur when trying to extend a sealed class with a non-permitted subclass.
Implementation of 'notify me if item is back in stock' feature in an ecommerce application
Create a database table to store user notifications for out-of-stock items
Implement a service to check item availability and send notifications to subscribed users
Provide a user interface for users to subscribe to notifications for specific items
posted on 4 Feb 2025
I was interviewed in Jan 2025.
Yes, open for fixed term hire and working from client location at Gurgaon for 3 days a week.
Open for fixed term hire
Willing to work from client location at Gurgaon for 3 days a week
Implemented automated testing using Selenium WebDriver and JUnit in Agile environment
Implemented automated testing framework using Selenium WebDriver
Utilized JUnit for test case management
Worked in Agile environment to ensure continuous testing and integration
Pilot testing is done by a small group of users before the full release, while beta testing is done by a larger group of users. Automation testing can be used for regression testing, smoke testing, and performance testing.
Pilot testing involves a small group of users testing the functionality in a controlled environment.
Beta testing involves a larger group of users testing the functionality in a real-world environment.
...
Primary key uniquely identifies a record, while unique key allows only one instance of a value in a column. Query to find last id involves using ORDER BY and LIMIT.
Primary key enforces uniqueness and not null constraint on a column
Unique key enforces uniqueness but allows null values
To find row with last id, use ORDER BY id DESC LIMIT 1 in SQL query
Software Testing Life Cycle (STLC) involves planning, designing, executing, and reporting on tests. Defect Life Cycle includes identification, logging, fixing, and retesting defects.
STLC includes requirements analysis, test planning, test design, test execution, and test closure.
Defect Life Cycle involves defect identification, defect logging, defect fixing, defect retesting, and defect closure.
STLC ensures that the so...
303 status code in API means 'See Other'. PUT method is used to update data, while DELETE method is used to remove data. 3 point estimation technique in Agile is used to estimate tasks.
303 status code indicates that the resource can be found at a different URI and should be retrieved from there
PUT method is used to update an existing resource in the API
DELETE method is used to remove a resource from the API
3 point esti...
Links and labels that can be tagged to a bug in Jira
Links: related issues, documents, websites
Labels: priority, severity, type, status
Shell scripting is a way to automate tasks in Unix/Linux systems. Grep is used to search for specific patterns in text files. Href is not a standard Unix command.
Shell scripting automates tasks by writing scripts in a Unix/Linux environment
Grep command is used to search for specific patterns in text files
Example: grep 'search_pattern' file.txt
Href is not a standard Unix command, it may be a typo or a custom script
To resolve conflict with a team member, communication is key. Prioritize understanding, address the issue calmly, find common ground, and work towards a solution together.
Listen to the team member's perspective and concerns
Communicate openly and calmly about the issue
Find common ground and areas of agreement
Work together to find a solution that benefits both parties
Seek input from other team members or a mediator if ne
Open to relocating to Bangalore, working in night shifts, long hours, and 24X7 culture. Goal is to excel in automation testing.
Yes, open to relocating to Bangalore and working from client's office
Yes, open to working in night/rotational shifts
Yes, open to working in long extendable hours or 24X7 culture
Goal is to excel in automation testing
posted on 6 Jan 2025
I applied via Naukri.com and was interviewed in Dec 2024. There was 1 interview round.
based on 6 interviews
Interview experience
based on 17 reviews
Rating in categories
Data Engineer
27
salaries
| ₹3 L/yr - ₹9.9 L/yr |
Scrum Master
4
salaries
| ₹11 L/yr - ₹19 L/yr |
Front end Developer
4
salaries
| ₹3 L/yr - ₹12.5 L/yr |
Qliksense Developer
4
salaries
| ₹5 L/yr - ₹7.7 L/yr |
Data Scientist
3
salaries
| ₹3.7 L/yr - ₹10 L/yr |
Infosys
TCS
Wipro
HCLTech