We are looking for a Data Architect to build, optimize and maintain data lake architectures. The work includes Data Transformation using Cloud platforms and Data Lakehouse design and enhancement to better enable downstream BI applications using emerging technologies and standard Data Models.
We are looking for an experienced leader who herself / himself has utilized optimal methods to transform raw data into useful decision support systems to align with the business goals.
A detail-oriented person, with excellent organizational skills, who has strong analytical skills, and the ability to combine data from different sources. Familiarity with several programming languages like Python, Scala, having worked on multiple cloud platforms and possessing knowledge of AI/ML technologies & algorithms and their applications.
An architect who has specifically designed Data Lakes on GCP while ensuring Metadata Management, operational cost reduction, data standardization, Data Governance Security, Scalability and Performance.
Primary Expectations
Build data systems and pipelines upon evaluate business needs and objectives
Conduct complex data analysis on raw source data and design data lake / model / storage to house it in a standardized manner
Collaborate with data scientists and architects on several projects and design solutions for use by the Data Science teams
Has lead a team of 15+ developers, engineers, scientists to design complex data lake solutions
Hands on experience in developing ETL workflows and pipelines using Apache Airflow , SAP BODS or similar tools
Experience in working on cloud technologies GCP / AWS especially related to data & analytics
Experience on Big Data technologies especially, Spark, NoSQL, HDFS
Technical expertise with data models and architecture, data migration and managing data quality
Hands on and efficient in writing complex SQL queries for all test cases (Unit / System / functional / Data reconciliation / etc.)
Good understanding of BI & DWH development methodologies
Strong Experience with Data governance (Data Quality, Metadata Management, Security etc.)
Should have good knowledge of SAP ERP data , e.g., HR / Payroll / SuccessFactors, Finance, Procurement, etc.