Data management is important throughout the lifecycle of project. I have compiled the following as a ready reckoner based on my understanding of data management, These are more of bullet points and interpretation is left unto the reader:
- How data is collected a. Instruments
- Forms and Versioning
- Instructions
- Modality: Paper/ electronic
- Organized a. Database
- Structure
- Codes
- Labels
- Relationships across tables
- File naming
- Location
- Data Related Activities
- Training
- Data Collection
- Data Quality monitoring
- Data entry: Independent, Double data entry
- Transcribing
- Translation
- Data Cleaning
- Data Quality Assessment
- Completeness / Missingness
- Consistency Checks
- Audit log
- Analysis plan
- Dummy tables
- Data Analysis – Interim; Final
- Code used for analysis: reproducibility, integrity
- Publication
- Data File
- Commands
- Security and Safety
- Storage: Locations, Copies
- Backed up i. Frequency ii. Locations
- Access Control
- Encryption
- Safety during Transportation
- Confidentiality
- De-identification
- Data Dissemination Plan
- Data Sharing Plan a. Licensing Policy
- Archival
- Accessibility in Future
- Study Metadata
- Data Archive
- Legal requirements
- Data Custodian
- Institutional Contacts
- Duration of retention
- De-identification
- Encryption
- Raw and Clean data
- Primary Case Record Forms: Paper CRFs, eCRFs
Needless to day, data management is Important throughout the lifecycle of project