Genpact
Proud winner of ABECA 2024 - AmbitionBox Employee Choice Awards
2050 Genpact Jobs
Data Engineer - Python/ETL (5-10 yrs)
Genpact
posted 16hr ago
Flexible timing
Key skills for the job
Inviting applications for the role of Senior Data Engineer.
Responsibilities :
Technical Skills : Python/Pyspark + Databricks/ AWS or Azure is also fine + SQL+ Jenkins.
The primary tasks, functions and deliverables of the role :
- Design and build reusable components, frameworks and libraries at scale to support analytics products
- Design and implement product features in collaboration with business and Technology stakeholders
- Identify and solve issues concerning data management to improve data quality
- Clean, prepare and optimize data for ingestion and consumption
- Collaborate on the implementation of new data management projects and re-structure of the current data architecture
- Implement automated workflows and routines using workflow scheduling tools
- Build continuous integration, test-driven development and production deployment frameworks
- Analyze and profile data for designing scalable solutions
- Troubleshoot data issues and perform root cause analysis to proactively resolve product and operational issues
Minimum Qualifications :
Experience :
- Strong understanding of data structures and algorithms
- Strong understanding of solution and technical design
- Has a strong problem solving and analytical mindset.
- Able to influence and communicate effectively, both verbally and written, with team members and business stakeholders
- Able to quickly pick up new programming languages, technologies, and frameworks
- Experience building cloud scalable, real time and high-performance data lake solutions
- Fair understanding of developing complex data solutions
- Experience working on end-to-end solution design
- Willing to learn new skills and technologies
- Has a passion for data solutions
Required skill :
1. Hands on experience in Databricks and AWS - EMR [Hive, Pyspark], S3, Athena.
2. Familiarity with Spark Structured Streaming
3. experience working experience with Hadoop stack dealing huge volumes of data in a scalable fashion
4. hands-on experience with SQL, ETL, data transformation and analytics functions
5. hands-on Python experience including Batch scripting, data manipulation, distributable packages
6. experience working with batch orchestration tools such as Apache Airflow or equivalent, preferable Airflow
7. working with code versioning tools such as GitHub or BitBucket; expert level understanding of repo design and best practices
8. Familiarity with deployment automation tools such as Jenkins
9. hands-on experience designing and building ETL pipelines; expert with data ingest, change data capture, data quality; hand on experience with API development
10. designing and developing relational database objects knowledgeable on logical and physical data modelling concepts; some experience with Snowflake
11. Familiarity with Tableau or Cognos use cases
Preferred Qualifications :
12. Familiarity with Agile; working experience preferred
Functional Areas: Software/Testing/Networking
Read full job descriptionPrepare for Genpact Data Engineer roles with real interview advice
The job security as if you do the worst also they won't fire you they put you under bench and performance improvement
Depending on your projects, either you work too much or you don't work at all, if you work too much you get an annual appraisal of 4.3% - 7 percent max If you work too little you get 0-4.3%