i
Decision Minds
Filter interviews by
Hive architecture is a data warehousing system built on top of Hadoop for querying and analyzing large datasets.
Hive uses a SQL-like language called HiveQL for querying data stored in Hadoop Distributed File System (HDFS).
It consists of three main components: Metastore, Driver, and Execution Engine.
Metastore stores metadata about tables and partitions in a relational database.
Driver accepts HiveQL queries, compiles the...
List and tuple are data structures in Python used to store collections of items.
Lists are mutable and can be modified after creation
Tuples are immutable and cannot be modified after creation
Lists are defined using square brackets []
Tuples are defined using parentheses ()
Lists are used for collections of similar items
Tuples are used for collections of different items
To remove duplicates, we can use various methods such as using sets, pandas, or SQL.
Using sets: Convert the list to a set and then back to a list
Using pandas: Use the drop_duplicates() method on a pandas dataframe
Using SQL: Use the DISTINCT keyword in a SELECT statement
Using hashable objects: Create a dictionary with the list elements as keys and then convert it back to a list
Hive is a data warehousing tool for Hadoop while SQL is a language used to manage relational databases.
Hive is used for big data processing while SQL is used for relational databases.
Hive uses Hadoop Distributed File System (HDFS) while SQL uses traditional storage systems.
Hive queries are written in Hive Query Language (HQL) while SQL queries are written in Structured Query Language.
Hive is optimized for batch process...
Top trending discussions
ETL stands for Extract, Transform, Load. It is a process used in data warehousing to extract data from various sources, transform it into a consistent format, and load it into a target database.
ETL stands for Extract, Transform, Load
Extract: Involves extracting data from various sources such as databases, applications, and files
Transform: Involves cleaning, filtering, and transforming the extracted data into a consiste...
Easy and moderate level question
Given a relevant topic and discussion on it
ES6 introduced several new features and improvements to JavaScript, making it more powerful and efficient.
Arrow functions for more concise syntax
Let and const for block-scoped variables
Classes for easier object-oriented programming
Template literals for easier string interpolation
Default parameters and rest parameters for function arguments
Destructuring assignment for easier data extraction
Promises for asynchronous prog
Yes, let and const variables are hoisted but not initialized.
let and const variables are hoisted to the top of their block scope, but they are not initialized until the actual line of code is executed.
This means that you cannot access a let or const variable before it is declared in the code.
For example, trying to access a let variable before it is declared will result in a ReferenceError.
Global context refers to the overall environment or setting in which something exists or operates.
Global context encompasses all variables, functions, and objects that are accessible throughout an entire program.
It can be thought of as the 'big picture' view of a program's execution.
In web development, global context includes the window object in JavaScript.
Temporal dead zone is a period during the variable creation process where accessing the variable results in a ReferenceError.
Occurs when trying to access a variable before it has been declared with let or const
Happens due to the variable being in the temporal dead zone until it is declared
Example: accessing a variable before its declaration will result in a ReferenceError
Modules in Node.js are reusable blocks of code that encapsulate related functionality.
Modules in Node.js can be built-in modules like fs (file system) or third-party modules like express.
Modules help in organizing code into separate files for better maintainability and reusability.
Modules can be imported using the require() function in Node.js.
Indexes in MongoDB improve query performance by allowing the database to quickly locate and retrieve specific documents.
Indexes help to speed up query performance by allowing the database to quickly locate specific documents based on the indexed fields.
Without indexes, MongoDB would have to perform a collection scan, which can be slow and resource-intensive.
Indexes can be created on single fields or compound fields to ...
Hooks are a feature in React that allow you to use state and other React features in functional components. useRef is a hook that allows you to create a mutable object that persists for the lifetime of the component.
Hooks are used to add state and lifecycle methods to functional components in React
useRef is used to create a mutable reference that persists between renders
useRef can be used to access DOM elements directl
Mongoose is an Object Data Modeling (ODM) library for MongoDB that provides a higher level of abstraction and simplifies interactions with the database.
Mongoose provides schema validation which helps maintain data integrity.
Mongoose simplifies querying and data manipulation with built-in functions.
Mongoose supports middleware functions for pre and post processing of data operations.
I was interviewed in Dec 2024.
Forecasting problem - Predict daily sku level sales
Bias is error due to overly simplistic assumptions, variance is error due to overly complex models.
Bias is the error introduced by approximating a real-world problem, leading to underfitting.
Variance is the error introduced by modeling the noise in the training data, leading to overfitting.
High bias can cause a model to miss relevant relationships between features and target variable.
High variance can cause a model to ...
Parametric models make strong assumptions about the form of the underlying data distribution, while non-parametric models do not.
Parametric models have a fixed number of parameters, while non-parametric models have a flexible number of parameters.
Parametric models are simpler and easier to interpret, while non-parametric models are more flexible and can capture complex patterns in data.
Examples of parametric models inc...
posted on 2 Jul 2024
Find the second largest number in an array of strings.
Convert the strings to numbers for comparison.
Sort the array in descending order.
Return the second element in the sorted array.
Find pairs in an array that sum up to a given target value.
Use a hash set to store the difference between the target value and each element in the array.
Iterate through the array and check if the current element's complement exists in the hash set.
Return the pairs that sum up to the target value.
I applied via Walk-in and was interviewed in Dec 2022. There were 3 interview rounds.
Troubleshooting network involves identifying and resolving issues related to network connectivity and performance.
Start by identifying the symptoms of the problem
Check network cables and connections
Verify network settings and configurations
Use network diagnostic tools to identify and isolate the problem
Resolve the issue by applying appropriate fixes
Test the network to ensure the problem is resolved
I applied via Naukri.com
This is the round in which basic quant questions were asked
In this round basic 2 java questions were asked
In this round one topic was given and we need to discuss about that topic
Java is a high-level, object-oriented programming language used to develop applications for various platforms.
Java is platform-independent, meaning it can run on any platform with a Java Virtual Machine (JVM)
It is known for its security features, such as the ability to run code in a sandbox environment
Java is used for developing a wide range of applications, from web applications to mobile apps and games
It is also used...
Searching is finding a specific item in a collection of items. Sorting is arranging items in a specific order.
Searching involves looking for a specific item in a collection of items.
Sorting involves arranging items in a specific order, such as alphabetical or numerical.
Searching and sorting are commonly used in computer science and data analysis.
Examples of searching algorithms include linear search and binary search.
E...
A binary tree is a data structure consisting of nodes, where each node has at most two children.
Nodes in a binary tree have a left and/or right child node.
The topmost node is called the root node.
Binary trees are used in search algorithms and data compression.
Examples of binary trees include binary search trees and heap trees.
Interview experience
based on 2 reviews
Rating in categories
Software Engineer
92
salaries
| ₹2.5 L/yr - ₹7.5 L/yr |
Senior Software Engineer
54
salaries
| ₹8.5 L/yr - ₹28 L/yr |
Technical Lead
17
salaries
| ₹12 L/yr - ₹28.3 L/yr |
Tableau Developer
16
salaries
| ₹4 L/yr - ₹15 L/yr |
Senior Tableau Developer
9
salaries
| ₹15 L/yr - ₹18 L/yr |
Fractal Analytics
Mu Sigma
Tredence
AbsolutData