Conduct thorough data profiling and analysis to identify anomalies, inconsistencies, and inaccuracies in datasets
Identify new methods of detecting data anomalies.
Automation
Automate processes, data quality checks, and workflows to ease data validation processes across complex, interdependent data systems
Quality Engineering
Implement, and maintain a robust data quality framework to assess, monitor, and report the quality of data across various systems and platforms
Design and develop generic quality check frameworks that can be utilized across multiple products
Develop and execute comprehensive data quality testing strategies and plans to verify the implementation of data pipelines and data validations.
Develop and implement manual and automated test cases to ensure reliability of data pipelines, data migration processes and data transformations
Design and implement intuitive metrics that show stake holders the health of their data in an actionable format
Develop test strategies and validation steps for Analytical Data Models
Conduct initial root cause analysis for data issues, collaborate with partners to clearly identify the issue, scope and impact, and path for research/solutioning
Documentation
Create and maintain documentation related to data quality processes and standards
Reporting
Establish monitoring mechanisms to proactively identify data quality issues, and generate regular reports on data quality metrics for review
Mentoring
Provide training and guidance to team members on data quality best practices and principles. Facilitate knowledge sharing sessions to promote a culture of data quality awareness.
Collaboration
Collect data quality requirements from key partners, seeking to understand the subjective quality measures that are important to data consumers to build and maintain trust in our data & products
Collaborate with Data Engineers, Data Analysts, and business leaders to understand data quality challenges within data workflows and how the data is used by Mastercard products and customers
Collaborate across teams as a data quality advocate, guiding on the need to balance which data/sources require high accuracy versus directionally accurate data