MathCo
40+ Soochow University Interview Questions and Answers
Q1. How will you cut a piece of cake to 8 in minimum cuts
Cut the cake in half horizontally, then vertically, and then diagonally twice.
Cut the cake in half horizontally.
Cut the cake in half vertically.
Cut the cake diagonally from the center to the edge twice.
Total cuts required: 7
Q2. Suppose there are 3 students, everyone has x y and z salaries but no one is aware of each others salary. Find the avg of there salaries
To find the average of the salaries of 3 students with unknown salaries, add x, y, and z and divide by 3.
Add the salaries of x, y, and z together
Divide the total sum by 3 to find the average salary
Q3. sort an array of integers, find the max of two numbers using sql
Sorting and finding max of two numbers in an array using SQL
Use ORDER BY to sort the array of integers
Use LIMIT 2 to get the top two numbers
Use MAX function to find the maximum of the two numbers
Q4. Explain statistical concepts like Hypothesis testing, and type 1 and type 2 errors.
Hypothesis testing is a statistical method to test a claim about a population parameter. Type 1 error is rejecting a true null hypothesis, and type 2 error is failing to reject a false null hypothesis.
Hypothesis testing involves formulating a null hypothesis and an alternative hypothesis.
Type 1 error occurs when we reject a null hypothesis that is actually true.
Type 2 error occurs when we fail to reject a null hypothesis that is actually false.
The significance level (alpha) d...read more
Q5. what is Dense rank? , explain with use cases
Dense rank is a ranking method that assigns consecutive integers to items with the same rank, leaving no gaps between ranks.
Dense rank is used when you want to assign unique ranks to items without any gaps.
For example, if three items have ranks 1, 2, and 2, dense rank would assign ranks 1, 2, and 2 respectively.
It is commonly used in scenarios where ties need to be handled by giving the same rank to tied items.
Q6. what is javas main method refers too
The main method in Java refers to the entry point of a Java program where the execution begins.
The main method must be declared as public static void main(String[] args)
It is the starting point of a Java program where the JVM looks for the main method to start execution
The main method takes an array of strings as input which can be used to pass command line arguments
Q7. what is array what is a linkedlist
An array is a data structure that stores a collection of elements of the same type in contiguous memory locations. A linked list is a data structure that consists of nodes where each node contains a data field and a reference to the next node in the sequence.
Array is static in size and elements are accessed using indices.
Linked list is dynamic in size and elements are accessed sequentially by following the references.
Example: Array - int[] arr = {1, 2, 3}; Linked list - Node ...read more
Q8. Finding the missing element in the array
Finding the missing element in an array of strings
Iterate through the array and check for any missing elements
Use a hash set to keep track of elements that have been seen
Compare the elements in the array to a reference list of all possible elements
Example: ['apple', 'banana', 'orange', 'grape'] - missing 'kiwi'
Example: ['red', 'blue', 'green', 'yellow'] - missing 'purple'
Q9. Display only those elements which do not occur more than once.
Display non-repeated elements
Iterate through the elements and count their occurrences
Display only those elements with occurrence count of 1
Use a hash table or dictionary to store the counts
Time complexity: O(n)
Q10. what is search engine optimization
Search engine optimization (SEO) is the process of improving a website's visibility on search engines like Google.
SEO involves optimizing website content, meta tags, and backlinks to improve search engine rankings.
Keyword research is essential for SEO to target relevant search terms.
Creating high-quality content and obtaining backlinks from reputable websites can boost SEO.
Regularly monitoring and adjusting SEO strategies is important to maintain and improve rankings.
Q11. What is join and it's types
Join is a SQL operation used to combine rows from two or more tables based on a related column between them.
Types of joins include INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN
INNER JOIN returns rows when there is at least one match in both tables
LEFT JOIN returns all rows from the left table and the matched rows from the right table
RIGHT JOIN returns all rows from the right table and the matched rows from the left table
FULL JOIN returns rows when there is a match in one o...read more
Q12. Pronouncation of words in a passage
The question is about pronouncing words in a passage.
Break down the passage into smaller words and syllables to pronounce them correctly.
Use pronunciation guides or dictionaries to help with difficult words.
Practice speaking the words out loud to improve pronunciation.
Ask for feedback from others to ensure correct pronunciation.
Focus on stress and intonation patterns in sentences for natural sounding speech.
Q13. What is having in sql
HAVING clause is used in SQL to filter records based on aggregate functions.
HAVING is used with GROUP BY to filter groups based on a specified condition
It is used to filter the results of aggregate functions like COUNT, SUM, AVG, etc.
It is similar to WHERE clause but operates on groups rather than individual rows
Q14. What is data engineering
Data engineering involves designing, building, and maintaining data pipelines to collect, clean, and transform data for analysis.
Data engineering focuses on the architecture and infrastructure needed to handle large volumes of data.
It involves designing and implementing data pipelines to extract, transform, and load data into storage systems.
Data engineers work with tools like Apache Spark, Hadoop, and SQL databases to process and analyze data efficiently.
They also collaborat...read more
Q15. applications of technologies in real word usecases
Technologies are used in various real-world applications such as healthcare, finance, transportation, and marketing.
Healthcare: Technologies like AI and machine learning are used for diagnosing diseases, predicting patient outcomes, and personalized medicine.
Finance: Technologies such as blockchain are used for secure transactions and fraud detection.
Transportation: Technologies like GPS and IoT are used for route optimization, traffic management, and autonomous vehicles.
Mark...read more
Q16. Difference between a list and a tuple. Explain a use case where you will use a tuple.
List vs Tuple and use case for Tuple
List is mutable while Tuple is immutable
List uses square brackets while Tuple uses parentheses
Tuple is used when data needs to be protected from modification
Tuple is also used for returning multiple values from a function
Q17. what are object and classes
Objects are instances of classes in object-oriented programming. Classes are blueprints for creating objects with attributes and methods.
Objects are instances of classes that contain data (attributes) and behavior (methods)
Classes are blueprints for creating objects with predefined attributes and methods
Objects can interact with each other by calling methods or accessing attributes
Inheritance allows classes to inherit attributes and methods from a parent class
Encapsulation en...read more
Q18. speak to speak< basic grammar corrections
The question is about basic grammar corrections.
Always check for subject-verb agreement.
Ensure proper use of punctuation marks.
Review sentence structure for clarity and coherence.
Q19. Explain how will you implement SCD Type 2.
SCD Type 2 is implemented by creating a new record for each change and maintaining a history of changes.
Identify the business key and version number for the dimension table.
Create a new record with a new version number for each change to the dimension data.
End-date the previous record and start-date the new record to maintain a history of changes.
Use effective dates to determine which record is active at any given point in time.
Example: A customer changes their address. Creat...read more
Q20. Linear regression assumptions
Linear regression assumptions include linearity, independence, homoscedasticity, and normality.
Assumption of linearity: The relationship between the independent and dependent variables is linear.
Assumption of independence: The residuals are independent of each other.
Assumption of homoscedasticity: The variance of the residuals is constant across all levels of the independent variables.
Assumption of normality: The residuals are normally distributed.
Example: In a study examinin...read more
Q21. Print an Array in java
Use a for loop to iterate through the array and print each element
Declare and initialize an array of strings
Use a for loop to iterate through the array
Print each element of the array
Q22. Explain projects
Projects are specific tasks or assignments undertaken to achieve a particular goal or objective.
Projects involve a defined scope, timeline, and budget.
They require a team of individuals with specific skills and expertise.
Projects are temporary in nature and have a clear start and end date.
Examples include developing a new software application, implementing a marketing campaign, or constructing a building.
Q23. How will you store passwords in DB
Store passwords securely using hashing algorithms and salt
Use strong hashing algorithms like bcrypt or SHA-256
Add a unique salt to each password before hashing
Store only the hashed password in the database
Regularly update hashing algorithms and salts for added security
Q24. What do you mean by topic modeling?
Topic modeling is a technique used in natural language processing to discover the hidden thematic structure in a collection of texts.
Topic modeling is a type of statistical modeling that allows for the discovery of hidden patterns in a collection of documents.
It is often used in text mining to identify topics within a set of documents without the need for manual labeling.
One popular algorithm for topic modeling is Latent Dirichlet Allocation (LDA), which assumes that each doc...read more
Q25. Leetcode 1971: Find if path exists in graph
Check if a path exists between two nodes in a graph
Use depth-first search (DFS) or breadth-first search (BFS) to traverse the graph
Start from the source node and explore all connected nodes until the destination node is reached
If the destination node is found during the traversal, return true; otherwise, return false
Q26. What is evaluated/assessed in technical round?
Technical round evaluates candidate's technical skills, problem-solving abilities, and knowledge in data science.
Coding skills - ability to write efficient code and solve problems
Data analysis skills - ability to manipulate and analyze data
Machine learning knowledge - understanding of algorithms and models
Problem-solving abilities - approach to solving complex problems
Communication skills - ability to explain technical concepts
Experience with relevant tools and technologies -...read more
Q27. SQL syntax and difference between having and where clause
HAVING clause is used with GROUP BY to filter grouped rows, WHERE clause is used to filter individual rows.
HAVING clause is used with GROUP BY to filter grouped rows based on aggregate functions
WHERE clause is used to filter individual rows based on conditions
HAVING clause is applied after GROUP BY, WHERE clause is applied before GROUP BY
HAVING clause can only be used with SELECT statement that contains a GROUP BY clause
Q28. Describe the Project Architecture End-to-End
The project architecture end-to-end includes the overall design and structure of the project from start to finish.
The architecture should include the hardware and software components, as well as the communication protocols and data flow.
It should also consider the scalability, security, and performance of the system.
Examples of project architecture include client-server, microservices, and event-driven architectures.
The architecture should be documented and communicated clear...read more
Q29. When do you prefer Decision Tree better than Random Forest model if model outputs are similar
Decision Tree is preferred over Random Forest when interpretability is important.
Decision Tree is easier to interpret and visualize than Random Forest.
Decision Tree is better suited for small datasets with few features.
Random Forest is better suited for large datasets with many features.
Random Forest is less prone to overfitting than Decision Tree.
Decision Tree can be used as a building block for Random Forest.
If interpretability is important, Decision Tree is preferred even ...read more
Q30. difference between array and linkedlist,stack and queue
Arrays store elements in contiguous memory, while linked lists use nodes with pointers. Stacks follow LIFO, queues follow FIFO.
Arrays store elements in contiguous memory locations, allowing for constant time access to elements using indices.
Linked lists use nodes with pointers to the next node, allowing for dynamic memory allocation and insertion/deletion at any position.
Stacks follow Last In First Out (LIFO) principle, where elements are added and removed from the same end (...read more
Q31. How does RNN work?
RNN stands for Recurrent Neural Network, a type of neural network designed to handle sequential data by maintaining a memory of previous inputs.
RNNs have loops within their architecture, allowing information to persist.
They are commonly used in natural language processing, speech recognition, and time series analysis.
An example of RNN is predicting the next word in a sentence based on previous words.
Q32. What is evaluated in HR Round?
HR Round evaluates the candidate's fit for the company culture, communication skills, behavioral traits, and overall professionalism.
Fit for company culture
Communication skills
Behavioral traits
Professionalism
Q33. Can you handle bulk email campaigns?
Yes, I have experience handling bulk email campaigns.
I have successfully managed and executed multiple bulk email campaigns in my previous roles.
I am proficient in using email marketing tools like Mailchimp, Constant Contact, and HubSpot.
I have experience analyzing campaign performance metrics and optimizing email content for better engagement.
Q34. What are evaluated in Round 2
Round 2 evaluates technical skills, problem-solving abilities, communication skills, and cultural fit.
Technical skills related to data analysis, machine learning, statistics, and programming languages like Python or R
Problem-solving abilities through case studies, coding challenges, or real-world data analysis tasks
Communication skills in explaining complex concepts, collaborating with team members, and presenting findings
Cultural fit with the team and organization's values, ...read more
Q35. Explain "this" in JS
In JavaScript, 'this' refers to the current object that is executing the current function.
The value of 'this' is determined by how a function is called.
When a function is called as a method of an object, 'this' refers to the object itself.
When a function is called in the global scope, 'this' refers to the global object (window in a browser, global in Node.js).
Arrow functions do not have their own 'this' value and instead inherit it from the enclosing lexical context.
Q36. Can useEffect() hook run twice?
Yes, useEffect() hook can run twice under certain conditions.
useEffect() hook can run twice if the dependencies array changes between renders.
This can happen if the dependencies array contains values that change frequently.
For example, if a state variable is included in the dependencies array and it is updated in the component, useEffect() will run again.
Q37. Project description
The project involves developing a new software application for data analysis.
Developing a software application for data analysis
Utilizing programming languages such as Python and R
Implementing statistical models and algorithms
Creating visualizations and reports for data interpretation
Q38. Database you choose for transactions
I would choose a relational database like MySQL or PostgreSQL for transactions due to their ACID compliance and strong support for transactions.
Relational databases like MySQL or PostgreSQL are commonly used for transactions due to their ACID compliance.
These databases provide strong support for transactions, ensuring data integrity and consistency.
Other options like NoSQL databases can also be considered depending on the specific requirements of the application.
Consider fact...read more
Q39. What are React Queries?
React Queries are a library for managing server state in React applications.
React Queries provide a way to fetch and cache data from an API in React components.
They offer features like caching, background fetching, and automatic refetching.
React Queries can handle complex data fetching requirements, such as pagination and infinite scrolling.
Q40. What is VIF explain it
VIF stands for Variance Inflation Factor, a measure used to detect multicollinearity in regression analysis.
VIF is a statistic that quantifies the severity of multicollinearity in a regression analysis.
It measures how much the variance of the estimated regression coefficients is increased due to multicollinearity.
A VIF value greater than 10 is often considered indicative of multicollinearity.
To calculate VIF, each predictor variable is regressed against all other predictor va...read more
Q41. How to handle failure
Handling failure involves learning from mistakes, staying positive, and adapting to challenges.
Acknowledge the failure and take responsibility for it
Analyze the reasons for the failure and learn from it
Stay positive and maintain a growth mindset
Adapt your approach and try again with a new strategy
Seek feedback from others and use it to improve
Remember that failure is a part of the learning process
Q42. Explain Spark application lifecycle.
Q43. System design for uber
System design for Uber involves creating a scalable and reliable platform for matching riders with drivers.
Use microservices architecture for scalability and fault tolerance
Implement geolocation services for real-time tracking of drivers and riders
Utilize a distributed database for storing user and trip data
Design a robust payment system for seamless transactions
Implement surge pricing algorithms to manage demand during peak hours
Top HR Questions asked in Soochow University
Interview Process at Soochow University
Top Interview Questions from Similar Companies
Reviews
Interviews
Salaries
Users/Month