Proactively engage with internal customers within the Technology division and other parts of the business to understand their complex requirements for data and aspired outcomes.
Leverage data management platforms to deliver capabilities that can analyze customer behavior, preferences, and trends, providing actionable insights to enhance customer experience and satisfaction.
Design scalable and extensible data architectures to support current and future data needs.
People:
Work closely with cross-functional teams, including data engineers, data scientists, business analysts, and IT, to deliver comprehensive analytics and AI solutions.
Mentor junior team members, sharing knowledge and expertise to develop their skills and enhance the overall capability of the data team.
Maintain effective working relationships with the business stakeholders and the rest of the IT team.
Operations:
Develop data sourcing and ingestion models to collect, validate, and integrate data from diverse sources into our data ecosystem.
Design, build, and optimize end-to-end data pipelines for ingesting, processing, and transforming large volumes of data.
Architect and manage data storage solutions including relational databases, NoSQL databases, data lakes, and object stores.
Develop and implement monitoring solutions to track data pipeline health, performance metrics, and resource utilization.
Financial Result:
Ensure that data solutions deliver measurable business value and return on investment.
Optimize data processing workflows, database queries, and storage configurations for cost efficiency and resource utilization.
Track and report on key performance indicators (KPIs) related to data engineering initiatives to demonstrate impact and value to stakeholder.
Job Responsibilities - Experience and Education
Demonstrated experience in data architecture design, data modeling, and database technologies (SQL, NoSQL)
Expertise in managing and administering relational databases (SQL) and NoSQL databases (e.g., MongoDB, Cassandra) to store and retrieve data efficiently.
Strong understanding of distributed computing principles, parallel processing, and cluster management.
Experience with data processing frameworks such as Apache Spark, Apache Flink, or Apache Beam.
Hands-on experience with cloud-based data technologies such as Amazon S3, Azure Data Lake Storage, Google Big Query, or similar
In-depth knowledge of big data technologies and frameworks for distributed computing and data processing, including Hadoop, MapReduce, and Spark.