CoverPhoto
Infosys logo
Premium Employer

Infosys

Verified
3.6
based on 39.4k Reviews
Filter interviews by
Data Engineer
Clear (1)

Infosys Data Engineer Interview Questions and Answers

Updated 19 Sep 2024

Q1. Python dataframes and how we use them in project and where at time

Ans.

Python dataframes are used to organize and manipulate data in a tabular format.

  • Dataframes are created using the pandas library in Python.

  • They allow for easy manipulation of data, such as filtering, sorting, and grouping.

  • Dataframes can be used in various projects, such as data analysis, machine learning, and data visualization.

  • Examples of using dataframes include analyzing sales data, predicting customer behavior, and visualizing stock market trends.

Add your answer
right arrow

Q2. Project which handled in last organisation

Ans.

Developed a data pipeline to ingest, process, and analyze customer feedback data for product improvement.

  • Designed and implemented ETL processes to extract data from various sources

  • Utilized Apache Spark for data processing and analysis

  • Built data visualizations to present insights to stakeholders

Add your answer
right arrow

Q3. What is the architecture of Spark

Ans.

Spark has a master-slave architecture with a cluster manager and worker nodes.

  • Spark has a driver program that communicates with a cluster manager to allocate resources and schedule tasks.

  • The cluster manager can be standalone, Mesos, or YARN.

  • Worker nodes execute tasks and store data in memory or on disk.

  • Spark can also utilize external data sources like Hadoop Distributed File System (HDFS) or Amazon S3.

  • Spark supports various APIs like SQL, Streaming, MLlib, and GraphX.

Add your answer
right arrow

Q4. If clone table contain any privilege?

Ans.

Clone tables inherit the privileges of the original table.

  • Clone tables do inherit the privileges of the original table they were cloned from.

  • Any user with privileges on the original table will also have the same privileges on the clone table.

  • This can be useful for maintaining consistent access control across tables.

Add your answer
right arrow
Discover Infosys interview dos and don'ts from real experiences

Q5. What are examples of iaas,paas,saas

Ans.

Examples of IaaS, PaaS, and SaaS include AWS (IaaS), Google App Engine (PaaS), and Salesforce (SaaS).

  • IaaS - Infrastructure as a Service: AWS, Microsoft Azure, Google Cloud Platform

  • PaaS - Platform as a Service: Google App Engine, Heroku, Microsoft Azure App Service

  • SaaS - Software as a Service: Salesforce, Google Workspace, Microsoft Office 365

Add your answer
right arrow

Q6. Different ADF activities used by me

Ans.

Some ADF activities include Copy Data, Execute Pipeline, Lookup, and Web Activity.

  • Copy Data activity for moving data between sources and sinks

  • Execute Pipeline activity for running another pipeline within a pipeline

  • Lookup activity for retrieving data from a dataset

  • Web Activity for calling a web service or API

Add your answer
right arrow

Q7. What is Smb join

Ans.

Smb join is a method used to join two tables in SQL Server.

  • Smb join stands for Sort Merge Bucket join.

  • It is used when joining large tables.

  • It involves sorting the tables and then merging them.

  • It is an efficient join method for large tables with indexes.

  • Example: SELECT * FROM table1 JOIN table2 ON table1.column = table2.column OPTION (HASH JOIN, MERGE JOIN, LOOP JOIN);

Add your answer
right arrow

Q8. Difference between Adf and ADB

Ans.

ADF stands for Azure Data Factory, a cloud-based data integration service. ADB stands for Azure Databricks, an Apache Spark-based analytics platform.

  • ADF is used for data integration and orchestration, while ADB is used for big data analytics and machine learning.

  • ADF provides a visual interface for building data pipelines, while ADB offers collaborative notebooks for data exploration and analysis.

  • ADF supports various data sources and destinations, while ADB is optimized for pr...read more

Add your answer
right arrow

Q9. Code on Palindrome

Ans.

A palindrome is a word, phrase, number, or other sequence of characters that reads the same forward and backward.

  • Check if the string is equal to its reverse to determine if it's a palindrome.

  • Ignore spaces and punctuation when checking for palindromes.

  • Convert the string to lowercase before checking for palindromes.

  • Examples: 'racecar', 'A man, a plan, a canal, Panama'

Add your answer
right arrow
Contribute & help others!
Write a review
Write a review
Share interview
Share interview
Contribute salary
Contribute salary
Add office photos
Add office photos

Interview Process at Infosys Data Engineer

based on 22 interviews
2 Interview rounds
Technical Round - 1
Technical Round - 2
View more
interview tips and stories logo
Interview Tips & Stories
Ace your next interview with expert advice and inspiring stories

Top Data Engineer Interview Questions from Similar Companies

TCS Logo
3.7
 • 64 Interview Questions
CitiusTech Logo
3.4
 • 18 Interview Questions
Wipro Logo
3.7
 • 15 Interview Questions
Tech Mahindra Logo
3.5
 • 13 Interview Questions
View all
Recently Viewed
INTERVIEWS
Orient Technologies
10 top interview questions
SALARIES
Capgemini
SALARIES
ANZ Operations & Technology
JOBS
JPMorgan Chase & Co.
No Jobs
SALARIES
Accenture
SALARIES
LTIMindtree
SALARIES
Jio Platforms
SALARIES
Infosys
INTERVIEWS
Thomson Reuters
30 top interview questions
DESIGNATION
Share an Interview
Stay ahead in your career. Get AmbitionBox app
play-icon
play-icon
qr-code
Helping over 1 Crore job seekers every month in choosing their right fit company
75 Lakh+

Reviews

5 Lakh+

Interviews

4 Crore+

Salaries

1 Cr+

Users/Month

Contribute to help millions

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

Follow us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter