Add office photos
Engaged Employer

Cognizant

3.7
based on 51.7k Reviews
Video summary
Proud winner of ABECA 2024 - AmbitionBox Employee Choice Awards
Filter interviews by

10+ Knowledge Planet Interview Questions and Answers

Updated 5 Feb 2024
Popular Designations

Q1. Datastge - How will you remove Header and trailer from Sequential data file

Ans.

To remove header and trailer from a sequential data file in Datastage.

  • Use Sequential File stage in Datastage.

  • Set the 'Skip Rows' property to the number of header rows to be skipped.

  • Set the 'Trailer Rows' property to the number of trailer rows to be skipped.

  • Use a Transformer stage to remove any remaining header or trailer rows.

  • Use the 'Remove' function in the Transformer stage to remove the rows.

View 1 answer

Q2. How to delete duplicate rows in Sql

Ans.

Deleting duplicate rows in SQL

  • Use the DISTINCT keyword in SELECT statement to retrieve unique rows

  • Use GROUP BY clause to group rows with same values and then use aggregate functions to select one row

  • Use the ROW_NUMBER() function to assign a unique number to each row and then delete the rows with duplicate numbers

View 3 more answers

Q3. How to find process id in linux

Ans.

To find process id in Linux, use the command 'ps -aux | grep '

  • Open the terminal

  • Type 'ps -aux' to list all running processes

  • Use 'grep ' to filter the process you are looking for

  • The process id (PID) will be listed in the second column

View 2 more answers

Q4. Reading Data from a .log file and finding out each column with a specific regex.

Ans.

Reading data from a .log file and extracting columns with a specific regex.

  • Use Python's built-in 're' module to define the regex pattern.

  • Open the .log file using Python's 'open' function.

  • Iterate through each line of the file and extract the desired columns using the regex pattern.

  • Store the extracted data in a data structure such as a list or dictionary.

Add your answer
Discover Knowledge Planet interview dos and don'ts from real experiences

Q5. How would you kill any job in Datastage

Ans.

To kill a job in Datastage

  • Stop the job manually from the Director client

  • Terminate the job from the command line using the dsjob command

  • Kill the job process from the operating system level

  • Delete the job from the Datastage repository

Add your answer

Q6. SORT BY ORDER BY CLUSTER BY DISTRIBUTE BY

Ans.

SORT BY, ORDER BY, CLUSTER BY, and DISTRIBUTE BY are SQL clauses used for data sorting and partitioning.

  • SORT BY is used to sort the result set in ascending or descending order based on one or more columns.

  • ORDER BY is used to sort the result set in ascending or descending order based on one or more columns. It is similar to SORT BY but can be used with other clauses like LIMIT and OFFSET.

  • CLUSTER BY is used to group data based on a specific column. It is used to improve query p...read more

Add your answer

Q7. SQL QUERIES WITH WINDOW FUNCTION

Ans.

SQL queries with window functions

  • Window functions perform calculations across a set of rows that are related to the current row

  • Common window functions include ROW_NUMBER, RANK, DENSE_RANK, and NTILE

  • Window functions are used with the OVER() clause to define the window or subset of rows to perform the calculation on

Add your answer

Q8. Optimizations I can use

Ans.

Optimizations for data engineering

  • Use indexing to speed up queries

  • Partition data to improve query performance

  • Use caching to reduce data retrieval time

  • Optimize data storage format for faster processing

  • Use parallel processing to speed up data processing

  • Optimize network bandwidth usage

  • Use compression to reduce storage and network usage

Add your answer

Q9. SMALL FILE PROBLEM

Ans.

Small file problem refers to the issue of having a large number of small files in a storage system.

  • Small files can cause inefficiencies in storage and processing.

  • Solutions include consolidating small files into larger ones or using a different storage system.

  • Examples include Hadoop's SequenceFile format and Amazon S3's object size optimization.

Add your answer

Q10. RDS VA DF VS DS

Ans.

RDS, VA, DF, VS, and DS are all acronyms related to data engineering.

  • RDS stands for Relational Database Service, a managed database service by AWS.

  • VA stands for Virtual Assistant, a software program that can assist with tasks.

  • DF stands for Dataflow, a managed service by Google Cloud for data processing.

  • VS stands for Virtual Server, a server that runs on a virtual machine.

  • DS stands for Datastore, a NoSQL document database by Google Cloud.

Add your answer

More about working at Cognizant

Top Rated Mega Company - 2024
Top Rated IT/ITES Company - 2024
HQ - Teaneck. New Jersey., United States (USA)
Contribute & help others!
Write a review
Share interview
Contribute salary
Add office photos
Interview Tips & Stories
Ace your next interview with expert advice and inspiring stories
Share an Interview
Stay ahead in your career. Get AmbitionBox app
qr-code
Helping over 1 Crore job seekers every month in choosing their right fit company
75 Lakh+

Reviews

5 Lakh+

Interviews

4 Crore+

Salaries

1 Cr+

Users/Month

Contribute to help millions

Made with ❤️ in India. Trademarks belong to their respective owners. All rights reserved © 2024 Info Edge (India) Ltd.

Follow us
  • Youtube
  • Instagram
  • LinkedIn
  • Facebook
  • Twitter