Upload Button Icon Add office photos

Filter interviews by

CodeAstu Technologies Data Engineer Interview Questions and Answers

Updated 10 Nov 2024

CodeAstu Technologies Data Engineer Interview Experiences

2 interviews found

Data Engineer Interview Questions & Answers

user image Digvijay D.Rathod

posted on 10 Nov 2024

Interview experience
3
Average
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. What aldfja aldj ad fj
  • Ans. 

    The question seems to be incomplete or unclear, possibly a mistake in transcription.

    • Ask for clarification or more context from the interviewer.

    • Confirm if the question was meant to be asked in a different way.

    • Offer to provide a response based on a different question related to data engineering.

  • Answered by AI
  • Q2. What aldjf adlfj ad adf ald
Round 2 - HR 

(2 Questions)

  • Q1. Salary expectation
  • Q2. What are your Hobbies

Interview Preparation Tips

Interview preparation tips for other job seekers - flajdaldfj

Data Engineer Interview Questions & Answers

user image Mohit Jagtap

posted on 1 Nov 2022

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(2 Questions)

  • Q1. Python code with different case
  • Ans. 

    Python code with different case

    • Python is case-sensitive, so variables with different case are treated as different variables

    • It is recommended to use consistent naming conventions to avoid confusion

    • Examples: 'myVar', 'myvar', and 'MYVAR' are three different variables

  • Answered by AI
  • Q2. Sql query on diff case

Interview Preparation Tips

Interview preparation tips for other job seekers - Python more prepared and also focus on sql and other requirements

Skills evaluated in this interview

Interview questions from similar companies

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Naukri.com and was interviewed in Oct 2024. There were 2 interview rounds.

Round 1 - One-on-one 

(2 Questions)

  • Q1. Azure Scenario based questions
  • Q2. Pyspark Coding based questions
Round 2 - One-on-one 

(2 Questions)

  • Q1. ADF, Databricks related question
  • Q2. Spark Performance problem and scenarios
  • Ans. 

    Spark performance problems can arise due to inefficient code, data skew, resource constraints, and improper configuration.

    • Inefficient code can lead to slow performance, such as using collect() on large datasets.

    • Data skew can cause uneven distribution of data across partitions, impacting processing time.

    • Resource constraints like insufficient memory or CPU can result in slow Spark jobs.

    • Improper configuration settings, su...

  • Answered by AI

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - One-on-one 

(2 Questions)

  • Q1. What is PySpark, and can you explain its features and uses?
  • Ans. 

    PySpark is a Python API for Apache Spark, used for big data processing and analytics.

    • PySpark is a Python API for Apache Spark, a fast and general-purpose cluster computing system.

    • It allows for easy integration with Python libraries and provides high-level APIs in Python.

    • PySpark can be used for processing large datasets, machine learning, real-time data streaming, and more.

    • It supports various data sources such as HDFS, ...

  • Answered by AI
  • Q2. What is the difference between PySpark and Python?
  • Ans. 

    PySpark is a Python API for Apache Spark, while Python is a general-purpose programming language.

    • PySpark is specifically designed for big data processing using Spark, while Python is a versatile programming language used for various applications.

    • PySpark allows for distributed computing and parallel processing, while Python is primarily used for sequential programming.

    • PySpark provides libraries and tools for working wit...

  • Answered by AI
Interview experience
2
Poor
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
No response

I applied via Campus Placement and was interviewed in Oct 2024. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. Write code in regular expression to remove the special characters
  • Ans. 

    Use regular expression to remove special characters from a string

    • Use the regex pattern [^a-zA-Z0-9\s] to match any character that is not a letter, digit, or whitespace

    • Use the replace() function in your programming language to replace the matched special characters with an empty string

    • Example: input string 'Hello! How are you?' will become 'Hello How are you' after removing special characters

  • Answered by AI
  • Q2. Questions on resume
Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. What is Databricks, Internals of it, Optimization technique
  • Ans. 

    Databricks is a unified analytics platform that provides a collaborative environment for data scientists, engineers, and analysts.

    • Databricks is built on top of Apache Spark, providing a unified platform for data engineering, data science, and business analytics.

    • Internals of Databricks include a cluster manager, job scheduler, and workspace for collaboration.

    • Optimization techniques in Databricks include query optimizati...

  • Answered by AI
  • Q2. SQL questions of joins,group by
Round 2 - Technical 

(2 Questions)

  • Q1. Scenario based azure data factory questions
  • Q2. Project structres, pyspark dataframes related

Interview Preparation Tips

Interview preparation tips for other job seekers - Juat practice fundamentals

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Easy
Process Duration
2-4 weeks
Result
Selected Selected

I applied via LinkedIn and was interviewed in Jun 2024. There were 3 interview rounds.

Round 1 - Coding Test 

General question around data engineering

Round 2 - One-on-one 

(3 Questions)

  • Q1. Different apache tech and tools
  • Q2. What is snowflake
  • Q3. Coding questions on python
Round 3 - HR 

(2 Questions)

  • Q1. About location preference
  • Q2. Behavioural questions

Interview Preparation Tips

Interview preparation tips for other job seekers - Nothing specific
Interview experience
3
Average
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Company Website and was interviewed in Jul 2024. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. Python Lambda Function
  • Q2. What are pods in Kubernetes
  • Ans. 

    Pods are the smallest deployable units in Kubernetes, consisting of one or more containers.

    • Pods are used to run and manage containers in Kubernetes

    • Each pod has its own unique IP address within the Kubernetes cluster

    • Pods can contain multiple containers that share resources and are scheduled together

    • Pods are ephemeral and can be easily created, destroyed, or replicated

    • Pods can be managed and scaled using Kubernetes contr

  • Answered by AI

Skills evaluated in this interview

Interview experience
2
Poor
Difficulty level
-
Process Duration
Less than 2 weeks
Result
Not Selected

I applied via Approached by Company and was interviewed in Apr 2024. There was 1 interview round.

Round 1 - One-on-one 

(3 Questions)

  • Q1. What Volume of data have you handled in your POCs ?
  • Ans. 

    I have handled terabytes of data in my POCs, including data from various sources and formats.

    • Handled terabytes of data in POCs

    • Worked with data from various sources and formats

    • Used tools like Hadoop, Spark, and SQL for data processing

  • Answered by AI
  • Q2. When will you decide to use repartition and coalesce?
  • Ans. 

    Repartition is used for increasing partitions for parallelism, while coalesce is used for decreasing partitions to reduce shuffling.

    • Repartition is used when there is a need for more partitions to increase parallelism.

    • Coalesce is used when there are too many partitions and need to reduce them to avoid shuffling.

    • Example: Repartition can be used before a join operation to evenly distribute data across partitions for bette...

  • Answered by AI
  • Q3. How will you design/configure a cluster if you have given 10 petabytes of data.
  • Ans. 

    Designing/configuring a cluster for 10 petabytes of data involves considerations for storage capacity, processing power, network bandwidth, and fault tolerance.

    • Consider using a distributed file system like HDFS or object storage like Amazon S3 to store and manage the large volume of data.

    • Implement a scalable processing framework like Apache Spark or Hadoop to efficiently process and analyze the data in parallel.

    • Utilize...

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Sometimes you can easily detect that they want to rush your interview by asking irrelevant questions even though you have not mentioned such expertise in your resume. Also they have only shortlisted the resume just to reject. This breaks the confidence of the person who is looking for the job and honestly conveying the level of expertise he/she has.

Skills evaluated in this interview

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Job Portal and was interviewed in Aug 2024. There were 3 interview rounds.

Round 1 - Aptitude Test 

Its mandatory test even for experience people

Round 2 - Technical 

(1 Question)

  • Q1. Related to technology
Round 3 - HR 

(1 Question)

  • Q1. Very good discussion towards work culture, salary and all

CodeAstu Technologies Interview FAQs

How many rounds are there in CodeAstu Technologies Data Engineer interview?
CodeAstu Technologies interview process usually has 2 rounds. The most common rounds in the CodeAstu Technologies interview process are Technical, Resume Shortlist and HR.
What are the top questions asked in CodeAstu Technologies Data Engineer interview?

Some of the top questions asked at the CodeAstu Technologies Data Engineer interview -

  1. what aldfja aldj ad...read more
  2. Python code with different c...read more
  3. what aldjf adlfj ad adf ...read more

Tell us how to improve this page.

CodeAstu Technologies Data Engineer Salary
based on 4 salaries
₹5.8 L/yr - ₹15 L/yr
5% less than the average Data Engineer Salary in India
View more details

CodeAstu Technologies Data Engineer Reviews and Ratings

based on 1 review

4.0/5

Rating in categories

5.0

Skill development

5.0

Work-Life balance

3.0

Salary & Benefits

4.0

Job Security

5.0

Company culture

3.0

Promotions/Appraisal

4.0

Work Satisfaction

Explore 1 Review and Rating
Software Engineer
13 salaries
unlock blur

₹5 L/yr - ₹10 L/yr

Senior Software Engineer
11 salaries
unlock blur

₹8.5 L/yr - ₹13.8 L/yr

Softwaretest Engineer
5 salaries
unlock blur

₹3 L/yr - ₹9.5 L/yr

Software Developer
4 salaries
unlock blur

₹5.2 L/yr - ₹13 L/yr

QA Engineer
4 salaries
unlock blur

₹3.2 L/yr - ₹5.5 L/yr

Explore more salaries
Compare CodeAstu Technologies with

TCS

3.7
Compare

Infosys

3.7
Compare

Wipro

3.7
Compare

HCLTech

3.6
Compare

Calculate your in-hand salary

Confused about how your in-hand salary is calculated? Enter your annual salary (CTC) and get your in-hand salary
Did you find this page helpful?
Yes No
write
Share an Interview