Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Campus placements
  
  Interviews questions for 1K+ colleges
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

WINNERS AWAITED!
- ABECA 2025
  
  WINNERS AWAITED!
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
- AmbitionBox Best Places to Work 2021
  
  1st Edition

Add office photos

Employer? Claim Account for FREE

Sampoorna Consultants

Compare

4.7

based on 5 Reviews

260 Sampoorna Consultants Jobs

Data Engineer

Sampoorna Consultants

4.7

based on 5 Reviews

3-4 years

Pune

1 vacancy

Data Engineer

Sampoorna Consultants

posted 18d ago

Job Role Insights

Flexible timing

Key skills for the job

Data Analysis Python Logistics Supply Chain Management Project Management Procurement

+ 3 more

Job Description

Job Description

Key Responsibilies:
1. Data Pipeline Development:
o Build and maintain scalable and efficient data pipelines using Python and
PySpark.
o Write SQL queries to extract, transform, and load data from Amazon Athena.
o Work on ingeson of large-scale data from various sources using AWS tools
(e.g., S3, Redshi, Glue, Lambda, Athena, etc.) with an emphasis on ensuring
data quality and integrity.
o Implement data quality checks, validaon rules, and error-handling
mechanisms to ensure accurate data ingeson.
Data Exploraon and Wrangling:
o Ulize NLP techniques and regular expressions to explore, clean, and process
unstructured datasets.
Perform data wrangling to structure raw data for further analysis using
Pandas or PySpark as appropriate.
o Able to apply intermediate SQL techniques for data manipulaon and
aggregaon, opmizing queries for large datasets.
o Handle geospaal data and processes related to geolocaon, geofencing, and
enty deduplicaon (e.g., address normalizaon and deduplicaon of
enes such as companies).
3. Exploratory Data Analysis (EDA):
o Conduct exploratory data analysis using Python, PySpark, and SQL to discover
paerns, trends, and anomalies within large data sets, parcularly in the
logiscs and supply chain management domain.
o Leverage knowledge of data from logiscs, geolocaon, and supply chain to
idenfy key insights and make data-driven recommendaons.
4. Data Evaluaon & Quality Assurance:
o Idenfy and troubleshoot data quality issues related to geolocaon,
company addresses, logiscs, and supply chain data, ensuring data
consistency, accuracy, and reliability.
5. Communicaon & Documentaon:
o Reports: Communicate data findings and insights to both technical and nontechnical
stakeholders through detailed reports.
o Task Tracking: Document project progress and tasks using Jira and other
project management tools to ensure clear communicaon and status
tracking across teams.
Collaborate with cross-funconal teams to ensure alignment on project
objecves and deliverables.

Employment Type: Full Time, Permanent

Read full job description