Develop and maintain data formatter repository structure.
Develop data formatter for data transfer per project specific data transfer plan (DTP).
Develop data processing tools for data ingestion and data export.
Develop test scripts to ensure correct functionality of the developed data management tools.
Document code, procedures, tests used for data QC, clinical data integration, and data transfer operations.
Develop data pipelines for integration with internal and external systems.
Develop Data Transfer Plans in collaboration with the Software Engineering teams.
Develop in a team environment with source code management.
Performs other departmental and study-related activities as needed and upon request.
Preferred candidate profile
Requires an advanced degree (BS or MS (preferred)) in Statistics/Biostatistics/Computer Science/Engineering or equivalent.
5+ years of STATA/SAS/Python development experience (3+ years in industry).
Knowledge of Data QC (Validity, Consistency, Completeness etc.), Data Cleansing (Syntax Error Detection, Duplicate Removal, Data Auditing) & Data Transformation.
Knowledge of ETL toolkits, RDBMS, No SQL DBs (desirable but not essential).
Ability to work with deadlines and manage multiple priorities.