100+ Miju Precision Interview Questions and Answers

Q3. In a word count spark program which command will run on driver and which will run on executor

Add your answer

Answer

Commands that run on driver and executor in a word count Spark program.

The command to read the input file and create RDD will run on driver.
The command to split the lines and count the words will run on executor.
The command to aggregate the word counts and write the output will run on driver.
Driver sends tasks to executors and coordinates the overall job.
Executor processes the tasks assigned by the driver.

Question 4

Asked in

Q4. What is truth? Like the one you have been taught or the one you learn yourself like your parents teach you to not cut nails at night etc ot go to the temple?

Add your answer

Answer

Truth is subjective and can be influenced by personal experiences and cultural beliefs.

Truth is not always objective or universal
It can be shaped by personal experiences and cultural beliefs
What is considered true in one culture may not be true in another
Truth can also change over time as new information is discovered
For example, the belief that the earth was flat was once considered true, but is now known to be false

Question 5

Asked in

Software Engineer Interview

Q5. What is adstock, decay, Due tos Contributions? How do you evaluate the model? What is seasonality and the formula for seasonality? Do seasonality have any contribution?

Add your answer

Answer

Explanation of adstock, decay, Due tos Contributions, seasonality and its formula.

Adstock is the measure of the lasting impact of advertising on consumer behavior.
Decay refers to the reduction in the effectiveness of advertising over time.
Due tos Contributions is the attribution of sales to different marketing channels.
Seasonality is the pattern of sales or other metrics that repeat over a fixed period of time.
The formula for seasonality is (Value in Period / Average Value of...read more

Question 6

Asked in

Q6. Nth Fibonacci Number Problem Statement

Calculate the Nth term in the Fibonacci sequence, where the sequence is defined as follows: F(n) = F(n-1) + F(n-2), with initial conditions F(1) = F(2) = 1.

Input:

The inp...read more

Add your answer

Answer

Calculate the Nth Fibonacci number efficiently using dynamic programming.

Use dynamic programming to store previously calculated Fibonacci numbers to avoid redundant calculations.
Start with base cases F(1) and F(2) as 1, then iterate to calculate subsequent Fibonacci numbers.
Time complexity can be optimized to O(N) using dynamic programming.
Example: For N = 5, the 5th Fibonacci number is 5.

Question 7

Asked in

Q7. What are the key features and functionalities of Snowflake?

Add your answer

Answer

Snowflake is a cloud-based data warehousing platform known for its scalability, performance, and ease of use.

Snowflake uses a unique architecture called multi-cluster, which separates storage and compute resources for better scalability and performance.
It supports both structured and semi-structured data, allowing users to work with various data types.
Snowflake offers features like automatic scaling, data sharing, and built-in support for SQL queries.
It provides a web interfa...read more

Question 8

Asked in

Q8. Okay. How many people attended StanChart Mumbai Marathon?

Add your answer

Answer

The number of attendees at StanChart Mumbai Marathon is not available.

Data on the number of attendees is not available.
The organizers have not released any official figures.
It is unclear how many people participated in the marathon.

Question 9

Asked in

Q9. 1. Describe one of your projects in detail. 2. Explain Random Forest and other ML models 3. Statistics

Add your answer

Answer

Developed a predictive model for customer churn using Random Forest algorithm.

Used Python and scikit-learn library for model development
Performed data cleaning, feature engineering, and exploratory data analysis
Tuned hyperparameters using GridSearchCV and evaluated model performance using cross-validation
Random Forest is an ensemble learning method that builds multiple decision trees and combines their predictions
Other ML models include logistic regression, support vector mac...read more

Question 10

Asked in

Q10. What are the important documents to be submitted during the RFP.

View 1 answer

Answer

The important documents to be submitted during the RFP include proposal, pricing information, technical specifications, and references.

Proposal: A detailed document outlining the solution being offered, including the approach, methodology, and deliverables.
Pricing Information: A breakdown of the costs associated with the proposed solution, including any licensing fees, implementation costs, and ongoing maintenance fees.
Technical Specifications: Detailed information about the ...read more

Question 11

Asked in

Business Analyst Interview

Q11. Joins in Sql, Modelling and visualization part in PowerBI

Add your answer

Answer

Answering about joins in SQL and modeling/visualization in PowerBI

Joins in SQL are used to combine data from two or more tables based on a related column
There are different types of joins such as inner join, left join, right join, and full outer join
PowerBI is a data visualization tool that allows users to create interactive reports and dashboards
Data modeling in PowerBI involves creating relationships between tables and defining measures and calculated columns
Visualization i...read more

Question 12

Asked in

Q12. How many tube lights are there in the city of Mumbai

Add your answer

Answer

It is not possible to accurately determine the number of tube lights in the city of Mumbai.

The number of tube lights in a city is not publicly available information.
The city of Mumbai has a large population and a vast number of buildings, making it impossible to count all the tube lights.
The number of tube lights can vary greatly depending on factors such as residential, commercial, and industrial areas.
Even if we consider an average number of tube lights per household or bui...read more

Question 13

Asked in

Q13. Cumulative sum and rank functions in spark

Add your answer

Answer

Explanation of cumulative sum and rank functions in Spark

Cumulative sum function calculates the running total of a column
Rank function assigns a rank to each row based on the order of values in a column
Both functions can be used with window functions in Spark
Example: df.withColumn('cumulative_sum', F.sum('column').over(Window.orderBy('order_column').rowsBetween(Window.unboundedPreceding, Window.currentRow)))
Example: df.withColumn('rank', F.rank().over(Window.orderBy('column')...read more

Question 14

Asked in

Q14. The puzzle of 2 cans of 3Litre and 5 litre used to measure other values (very common one)

Add your answer

Answer

The puzzle of 2 cans of 3Litre and 5 litre used to measure other values.

Fill 3L can and pour into 5L can, repeat until 5L can is full to get 2L
Empty 5L can and pour 2L from 3L can into 5L can
Fill 3L can and pour into 5L can until full, leaving 1L in 3L can
Total of 4L can be measured using these two cans

Question 15

Asked in

Q15. Slowly change data handling in spark

Add your answer

Answer

Slowly changing data handling in Spark involves updating data over time.

Slowly changing dimensions (SCD) are used to track changes in data over time.
SCD Type 1 updates the data in place, overwriting the old values.
SCD Type 2 creates a new record for each change, with a start and end date.
SCD Type 3 adds a new column to the existing record to track changes.
Spark provides functions like `from_unixtime` and `unix_timestamp` to handle timestamps.

Question 16

Q16. CASE: There is a beach with uniformly distributed customers, you know that if you set up a stall there a competitor will appear. Where would you put your stall?

Add your answer

Answer

I would strategically place my stall near a popular attraction on the beach to attract customers and deter competitors.

Place the stall near a popular attraction on the beach to attract more customers
Consider setting up the stall in a location with limited space for competitors to set up nearby
Offer unique products or services to differentiate from competitors

Question 17

Asked in

Senior Analytics Consultant Interview

Q17. How can you derive (%share) in power bi

Add your answer

Answer

To derive % share in Power BI, use the 'Group By' function and create a measure using the 'Divide' function.

Use the 'Group By' function to group the data by the desired category
Create a measure using the 'Divide' function to calculate the percentage share
Add the measure to a visual to display the % share
Example: = DIVIDE(SUM(Sales[Revenue]), CALCULATE(SUM(Sales[Revenue]), ALL(Sales)))

Question 18

Asked in

Q18. How can you find the common elements between two strings in python?

Add your answer

Answer

Finding common elements between two strings in Python.

Convert the strings into sets and use the intersection method to find common elements.
Iterate through each character in one string and check if it exists in the other string.
Use the difflib library to find the longest common substring between two strings.

Question 19

Asked in

Business Analyst Interview

Q19. How many mobile phones are sold each year in India

Add your answer

Answer

Approximately 150-200 million mobile phones are sold each year in India.

India is the second-largest smartphone market in the world after China.
The number of mobile phone users in India is expected to reach 1.25 billion by 2020.
The Indian smartphone market grew by 7% YoY in 2019.
Major players in the Indian smartphone market include Xiaomi, Samsung, and Vivo.

Question 20

Asked in

Consultant Data Analyst Interview

Q20. So why should we visit temples?

Add your answer

Answer

Visiting temples can provide spiritual and cultural experiences, as well as a sense of community and peace.

Temples offer a space for prayer and meditation
They can provide a sense of community and belonging
Visiting temples can offer cultural and historical insights
Many temples have beautiful architecture and artwork
Temples can provide a peaceful and calming environment
Some people believe that visiting temples can bring good luck or blessings

Question 21

Asked in

Q21. What metrics will you look at if you need to analyse how a retail manufacturer is performing?

Add your answer

Answer

Metrics to analyze retail manufacturer performance

Sales revenue
Profit margin
Inventory turnover
Customer satisfaction
Market share
Return on investment
Employee turnover rate

Question 22

Asked in

Q22. Have you worked on Data Analytics RFPs in the past?

Add your answer

Answer

Yes, I have worked on several Data Analytics RFPs in the past.

I have experience in analyzing large datasets and providing insights to clients
I have worked on RFPs for clients in various industries such as finance, healthcare, and retail
I have collaborated with cross-functional teams to develop proposals that meet client requirements

Question 23

Asked in

Q23. Explain Transformers how different from previous RNN, LSTM etc.

Add your answer

Answer

Transformers are a type of neural network architecture that utilizes self-attention mechanisms to process sequential data.

Transformers use self-attention mechanisms to weigh the importance of different input elements, allowing for parallel processing of sequences.
Unlike RNNs and LSTMs, Transformers do not rely on sequential processing, making them more efficient for long-range dependencies.
Transformers have been shown to outperform traditional RNNs and LSTMs in tasks such as ...read more

Question 24

Asked in

Business Analyst Interview

Q24. What are window functions in SQL

Add your answer

Answer

Window functions in SQL are used to perform calculations across a set of table rows related to the current row.

Window functions are used to calculate values based on a set of rows related to the current row.
They allow you to perform calculations without grouping the rows into a single output row.
Examples of window functions include ROW_NUMBER(), RANK(), DENSE_RANK(), and NTILE().

Question 25

Asked in

Engineer Interview

Q25. Difference between Spark and MapReduce. Spark joins like broadcast and sort merge

Add your answer

Answer

Spark is faster than MapReduce due to in-memory processing and DAG execution.

Spark uses DAG (Directed Acyclic Graph) execution while MapReduce uses batch processing.
Spark performs in-memory processing while MapReduce writes to disk after each operation.
Spark has a more flexible programming model with support for multiple languages.
Spark has built-in libraries for machine learning, graph processing, and stream processing.
MapReduce is better suited for batch processing of large...read more

Question 26

Asked in

Q26. What is the difference between canvas app and model driven apps

Add your answer

Answer

Canvas apps allow for more customization and flexibility in design, while model-driven apps are more structured and data-driven.

Canvas apps are more visually appealing and customizable, allowing users to drag and drop elements to create the app interface.
Model-driven apps are more structured and data-driven, with a focus on displaying and manipulating data from a data source.
Canvas apps are better suited for scenarios where the user interface design is a priority, while model...read more

Question 27

Asked in

Q27. What is one learning from it?

Add your answer

Answer

One learning from what?

Please provide context or specify what 'it' refers to
Without context, it is impossible to provide a meaningful answer

Question 28

Asked in

Q28. Why do you prefer Azure cloud solution as recommendations for Data Engineering pipelines? Explain data pipelines scenario you managed in the project?

Add your answer

Answer

I prefer Azure cloud solution for Data Engineering pipelines due to its scalability, reliability, and integration with other Microsoft services.

Azure provides a wide range of tools and services specifically designed for data engineering tasks, such as Azure Data Factory, Azure Databricks, and Azure HDInsight.
Azure offers seamless integration with other Microsoft services like Power BI, SQL Server, and Azure Machine Learning, making it easier to build end-to-end data pipelines...read more

Question 29

Q29. What are all the tools and used in whole life cycle of MLops and where you involved in ML engineering as well?

Add your answer

Question 30

Asked in

Q30. How much you know about fractal

Add your answer

Answer

Fractals are complex geometric shapes that can be split into parts, each of which is a reduced-scale copy of the whole.

Fractals exhibit self-similarity, meaning they look similar at any scale or magnification.
Examples of fractals include the Mandelbrot set, Koch snowflake, and Sierpinski triangle.
Fractals are used in various fields such as mathematics, computer graphics, and art.

Question 31

Asked in

Q31. How is the RFP process setup in your current org.

Add your answer

Answer

The RFP process in my current org involves a cross-functional team approach.

The RFP is received by the sales team and then assigned to a cross-functional team.
The team includes representatives from sales, product, engineering, legal, and finance.
The team reviews the RFP and determines if the company can meet the requirements.
If the decision is made to respond, the team works together to create a proposal.
The proposal is reviewed and approved by senior management before submis...read more

Question 32

Asked in

Q32. What are different types of Attention?

Add your answer

Answer

Different types of Attention include self-attention, global attention, and local attention.

Self-attention focuses on relationships within the input sequence itself.
Global attention considers the entire input sequence when making predictions.
Local attention only attends to a subset of the input sequence at a time.
Examples include Transformer's self-attention mechanism, Bahdanau attention, and Luong attention.

Question 33

Asked in

Q33. What are the evidences?

Add your answer

Answer

The evidences refer to the proof or supporting facts that validate a claim or argument.

Evidences can be in the form of data, statistics, research studies, expert opinions, eyewitness accounts, etc.
For example, in a court case, evidences can include DNA samples, fingerprints, and witness testimonies.
In scientific research, evidences can include experimental data, peer-reviewed studies, and expert analysis.
In journalism, evidences can include interviews, documents, and photogra...read more

Question 34

Asked in

Q34. What was the challenge in end-to-end product delivery and implementation of solution roadmap?

Add your answer

Answer

The challenge in end-to-end product delivery and implementation of solution roadmap involved coordinating multiple teams, managing dependencies, and ensuring alignment with business goals.

Coordinating cross-functional teams to ensure timely delivery of each component of the product
Managing dependencies between different teams and components to avoid delays
Ensuring alignment of the solution roadmap with the overall business goals and objectives
Handling unexpected challenges an...read more

Question 35

Asked in

Senior Business Analyst Interview

Q35. write SQL queries for given scenario

Add your answer

Answer

Writing SQL queries for a given scenario

Use SELECT statement to retrieve data from tables
Use WHERE clause to filter data based on specific conditions
Use JOIN clause to combine data from multiple tables

Question 36

Asked in

Q36. Explain about any of the FMCG MMM model you have done

Add your answer

Answer

I have implemented the FMCG MMM model for a leading consumer goods company to analyze the impact of marketing activities on sales.

Used historical sales data, marketing spend, and external factors to build the model
Identified key drivers of sales performance and optimized marketing strategies
Evaluated the effectiveness of different marketing channels and campaigns
Provided actionable insights to improve ROI and drive revenue growth

Question 37

Asked in

Q37. Difference between group by and distinct

Add your answer

Answer

Group by is used to group rows that have the same values into summary rows, while distinct is used to remove duplicate rows from a result set.

Group by is used with aggregate functions like COUNT, SUM, AVG, etc.
Distinct is used to retrieve unique values from a column or set of columns.
Group by is used to perform operations on groups of rows, while distinct is used to filter out duplicate rows.
Group by is used in conjunction with SELECT statement, while distinct is used as a ke...read more

Question 38

Asked in

Q38. sum and sumx in power bi

Add your answer

Answer

Sum and SumX are DAX functions used in Power BI to calculate the sum of values in a column or table.

Sum calculates the sum of values in a column or table.
SumX calculates the sum of an expression evaluated for each row in a table.
Both functions can be used in measures and calculated columns.
Example: Sum(Sales[Revenue]) calculates the total revenue for the Sales table.
Example: SumX(Orders, [Quantity]*[Price]) calculates the total sales for each order in the Orders table.

Question 39

Q39. How did you implement end to end MLOps (Dev and Deployment)

Add your answer

Question 40

Asked in

Q40. What genre of books?

Add your answer

Answer

I enjoy reading a variety of genres, including mystery, science fiction, and historical fiction.

Mystery
Science fiction
Historical fiction

Question 41

Asked in

Q41. What is PCA explain eigen values

Add your answer

Answer

PCA is a dimensionality reduction technique that uses eigenvalues to find the principal components of a dataset.

PCA is used to reduce the dimensionality of a dataset by transforming the data into a new coordinate system.
Eigenvalues represent the amount of variance captured by each principal component.
Higher eigenvalues indicate that the corresponding principal component explains more variance in the data.
Eigenvalues are used to rank the importance of the principal components ...read more

Question 42

Asked in

Q42. What do you know what sales cycle.

Add your answer

Answer

Sales cycle refers to the process of selling a product or service from initial contact with a potential customer to closing the deal.

Sales cycle involves identifying potential customers
Qualifying leads to determine if they are a good fit for the product or service
Presenting the product or service to the potential customer
Handling objections and negotiating terms
Closing the deal and following up with the customer
Sales cycle can vary in length depending on the complexity of the...read more

Question 43

Asked in

Q43. System design tradeoffs and basic principles

Add your answer

Answer

System design tradeoffs involve balancing various factors to optimize performance and efficiency.

Consider scalability, reliability, latency, and cost when designing systems
Tradeoffs may involve sacrificing one aspect for the benefit of another
Examples include choosing between consistency and availability in distributed systems

Question 44

Asked in

Q44. What are the scopes of GA3

Add your answer

Answer

GA3 is a plant hormone that regulates various growth processes in plants.

Regulates seed germination
Promotes stem elongation
Influences flowering and fruit development
Used in agriculture to increase crop yield
Can be applied externally to plants to induce growth responses

Question 45

Asked in

Q45. Why fractal?

Add your answer

Answer

Fractals offer a unique way to understand complex patterns and structures in nature and mathematics.

Fractals can be found in natural phenomena such as snowflakes, coastlines, and ferns.
They have practical applications in computer graphics, data compression, and cryptography.
Fractal geometry provides a new perspective on understanding the behavior of complex systems.
Fractals have been used to model the growth of tumors and the spread of diseases in medical research.
The study o...read more

Question 46

Asked in

Q46. Why Informatica cloud is better than Azure Cloud solution?

Add your answer

Answer

Informatica Cloud offers more comprehensive data integration capabilities compared to Azure Cloud.

Informatica Cloud provides a wide range of data integration tools and services for various data sources and formats.
Informatica Cloud offers advanced data quality and data governance features that are not available in Azure Cloud.
Informatica Cloud has a strong focus on data security and compliance, with built-in encryption and access controls.
Informatica Cloud has a user-friendly...read more

Question 47

Asked in

Q47. What kind of data modelling you worked with GenAI project?

Add your answer

Answer

I have experience working with data modelling in the GenAI project to optimize algorithms and improve performance.

Utilized various data modelling techniques to analyze and interpret data
Developed predictive models to enhance decision-making processes
Collaborated with data scientists to refine and validate models
Implemented machine learning algorithms to improve accuracy and efficiency

Question 48

Asked in

Q48. Reason to switch

Add your answer

Answer

Seeking new challenges and opportunities for growth

Desire to work on more diverse projects
Opportunity for career advancement
Seeking a better work-life balance
Interested in learning new skills or technologies

Question 49

Asked in

AWS Data Engineer Interview

Q49. Difference between Data scientist, ML and AI

Add your answer

Answer

Data scientists analyze data to gain insights, machine learning (ML) involves algorithms that improve automatically through experience, and artificial intelligence (AI) refers to machines mimicking human cognitive functions.

Data scientists analyze large amounts of data to uncover patterns and insights.
Machine learning involves developing algorithms that improve automatically through experience.
Artificial intelligence refers to machines performing tasks that typically require ...read more

Question 50

Asked in

Q50. First Letter of firstname in a column in any data manipulation package supported on databricks

Add your answer

Answer

The function to extract the first letter of the firstname in a column varies based on the data manipulation package used.

Use SUBSTR function in SQL
Use str_extract function in R
Use substring function in Python

Question 51

Q51. What are the underlying assumptions of logistic regression?

Add your answer

Question 52

Asked in

Azure Data Engineer Interview

Q52. What are the types of transformation?

Add your answer

Answer

Types of transformations include filtering, sorting, aggregating, joining, and pivoting.

Filtering: Selecting a subset of rows based on certain criteria.
Sorting: Arranging rows in a specific order based on one or more columns.
Aggregating: Combining multiple rows into a single result, such as summing or averaging values.
Joining: Combining data from multiple sources based on a common key.
Pivoting: Restructuring data from rows to columns or vice versa.

Question 53

Asked in

Senior Business Analyst Interview

Q53. How is seasonality calculated ?

Add your answer

Answer

Seasonality is calculated by analyzing historical data to identify recurring patterns or trends that occur at specific times of the year.

Identify historical data for a specific time period (e.g. monthly, quarterly)
Use statistical methods such as moving averages or regression analysis to analyze the data
Look for patterns or trends that repeat at the same time each year
Calculate the average or percentage change in data points during specific time periods

Question 54

Asked in

Senior Accounts Payable Manager Interview

Q54. Difference between GPT and BERT model

Add your answer

Answer

GPT is a generative model while BERT is a transformer model for natural language processing.

GPT is a generative model that predicts the next word in a sentence based on previous words.
BERT is a transformer model that considers the context of a word by looking at the entire sentence.
GPT is unidirectional, while BERT is bidirectional.
GPT is better for text generation tasks, while BERT is better for understanding the context of words in a sentence.

Question 55

Asked in

Q55. Documents for foreign remittance

Add your answer

Answer

Documents required for foreign remittance include invoices, purchase orders, and wire transfer instructions.

Invoices for goods or services being paid for
Purchase orders to verify the transaction
Wire transfer instructions to ensure proper routing of funds
Proof of identification for both parties involved
Any necessary customs or tax documents
Documentation of any applicable fees or charges

Question 56

Asked in

Senior Accounts Payable Manager Interview

Q56. Explain Sorting algorithms

Add your answer

Answer

Sorting algorithms are methods used to arrange elements in a specific order.

Sorting algorithms are used to rearrange elements in a specific order, such as numerical or alphabetical.
Common sorting algorithms include Bubble Sort, Selection Sort, Insertion Sort, Merge Sort, Quick Sort, and Heap Sort.
Each sorting algorithm has its own time complexity and efficiency based on the size of the input data.
Sorting algorithms can be stable (maintains the relative order of equal elements...read more

Question 57

Asked in

Q57. Tell me accounts payable process?

Add your answer

Answer

Accounts payable process involves receiving, verifying, and processing invoices for payment.

Receive invoices from vendors
Verify the accuracy of invoices against purchase orders and receipts
Code and enter invoices into accounting system
Obtain approval for payment
Schedule payments and issue checks or electronic payments
Reconcile vendor statements
Maintain accurate records and files

Question 58

Asked in

Fullstack Software Engineer Interview

Q58. How will you resolve conflicts ?

Add your answer

Answer

I will address conflicts by actively listening, seeking common ground, and collaborating on solutions.

Actively listen to all parties involved to understand their perspectives
Seek common ground and areas of agreement to build consensus
Collaborate with the team to find mutually beneficial solutions
Use conflict resolution techniques such as mediation or compromise
Focus on the issue at hand rather than personal differences

Question 59

Asked in

Q59. How chatbot works really

Add your answer

Answer

Chatbots use natural language processing and machine learning to interact with users and provide automated responses.

Chatbots use natural language processing (NLP) to understand and interpret user input.
They use machine learning algorithms to learn from past interactions and improve responses.
Chatbots can be rule-based, where responses are pre-programmed, or AI-based, where they learn and adapt over time.
Examples include chatbots like Siri, Alexa, and customer service bots on...read more

Question 60

Asked in

Azure Data Engineer Interview

Q60. What is a data layer

Add your answer

Answer

A data layer is a software component that separates the data access logic from the business logic in an application.

It acts as an intermediary between the database and the application's business logic
Helps in managing data access, storage, and retrieval
Improves scalability and maintainability of the application
Examples include ORM frameworks like Hibernate in Java or Entity Framework in .NET

Question 61

Asked in

ml engineer Interview

Q61. Design a complete mlops pipeline with all the steps in it.

Add your answer

Answer

Designing a complete MLOps pipeline with all the necessary steps.

Data collection and preprocessing
Model training and evaluation
Model deployment
Monitoring and feedback loop
Automated retraining
Version control and collaboration

Question 62

Q62. What is logistic regression and its formula?

Add your answer

Question 63

Asked in

Q63. What is SCD and there types?

Add your answer

Answer

SCD stands for Slowly Changing Dimension. There are three types: Type 1, Type 2, and Type 3.

SCD is used in data warehousing to track changes in dimension data over time.
Type 1 SCD overwrites old data with new data, losing historical information.
Type 2 SCD creates new records for each change, preserving historical data.
Type 3 SCD keeps both old and new data in the same record, with separate columns for each version.

Question 64

Asked in