Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 2K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

Tekshapers Software Solutions

Compare

3.1

based on 98 Reviews

6 Tekshapers Software Solutions Jobs

Data Engineer/Architect - AWS/PySpark (10-18 yrs)

Tekshapers Software Solutions (P) Limited

3.1

based on 98 Reviews

10-18 years

Tekshapers Software Solutions

posted 21d ago

Job Role Insights

Fixed timing

Key skills for the job

Data Engineering Python AWS SQL ETL Testing Pyspark

+ 5 more

Job Description

Position Name : Data Architect.

Experienced Range : 10 to 18 Years.

Location : Greater Noida, Gurugram, Pune, Kolkata, Hyderabad, and Bangalore

Mandatory Skills Databricks, Py-Spark, Amazon Web Services(AWS).

Job Description :

- Expertise in designing, implementing, and maintaining data solutions including delta lake, data warehouse, data marts and data pipelines on the Databricks platform that support business and technology objectives.

- Apply best practices during design in data modeling (logical, physical) and ETL pipelines (streaming and batch) using AWS cloud-based services.

- Proficiency in ETL implementation using AWS databricks, including hands on experience in predictive optimization, unity catalogue and Managed Delta tables.

- Design, develop and manage the pipelining (collection, storage, access), data engineering (data quality, ETL, Data Modelling) and understanding (documentation, exploration) of the data.

- Perform data transformation tasks, including data cleansing, aggregation, enrichment, and normalization, using Databricks and related technologies.

- Experience in extracting data from heterogenous sources vis. Flat Files, APIs, XML, RDBMs and implementing complex transformations vis. SCDs etc in databricks notebooks.

- Monitor and troubleshoot data pipelines, identifying and resolving performance issues, data quality problems, and other technical challenges.

- Implement best practices for data governance, data security, and data privacy within the Databricks environment.

- Interact with stakeholders regarding data landscape understanding, conducting discovery exercises, developing proof of concepts, and demonstrating it to stakeholders.

- Proven skill sets in AWS Data Engineering and Data Lake services such as AWS Glue, S3, Lambda, SNS, IAM etc.

- Strong SQL, Python, PySpark scripting hands on knowledge/experience.

- Experience in data migration projects from On Prem to AWS Cloud.

- Experiences with design, develop, and implement end-to-end data engineering solutions using Databricks for large-scale data processing and data integration projects.

- Build and optimize data ingestion processes from various sources, ensuring data quality, reliability, and scalability.

- Ability to understand and articulate requirements to technical and non-technical audiences.

- Experience in code conversion from native ETL to pyspark code.

- Collaborate with DevOps and infrastructure teams to optimize the performance and scalability of Databricks clusters and resources.

- Perform the code deployment using CICD.

- Stakeholder management and communication skills, including prioritizing, problem solving and interpersonal relationship building.

- Provide guidance and mentorship to junior data engineers, fostering a culture of knowledge sharing and continuous learning within the team.

Functional Areas: Software/Testing/Networking

Read full job description

Prepare for Data Engineer roles with real interview advice