Skip to content

Add support for reading CDE usage information from the Platform MDS #18

@gaurav

Description

@gaurav

We have two studies with CDEs that we can use to start implementing this:

Note that these are strictly study/CRF relationships, however, we could include the CRF entirely within the data dictionary -- but if a study has both VLMD and CRFs, then we would expect to duplicate information between the two.

We should add the ability to pull study/CRF and variable/CDE mappings from other sources, such as a Google Sheet or the private GitHub repo, as we will need this until we manage to get all the mappings into the GitHub repository (https://github.com/uc-cdis/heal-data-dictionaries/issues/381).

These will show up in the output dbGaP XML files in three ways:

  1. We could put in a top-level property storing the list of HDPCDEs for the study (the study/CRF mappings).
  2. We could provide individual mappings at the variable level.
  3. We could provide entire data dictionaries reproduced from the CRFs (i.e. if we know study A with 100 variables uses CRF X with 5 variables, we could add those five variables to the 100 variables in study A).

Unless we want to do more work in Dug (or replace dbGaP XML with something else?), we will need to produce a KGX file with all the CRFs (#17).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions