The NIH expects broad data-sharing for funded research that produces large-scale human or non-human genomic data, as described in the supplemental information to the GDS Policy. Examples of data types and sample sizes for human studies are shown in the table below. Please see the GDS Policy supplemental information for additional details.
The NIH GDS Policy: Examples of data types and sample sizes for human studies | Number of Participants |
---|---|
SNP array data from >300K single nucleotide polymorphisms (SNPS) (e.g., GWAS data) |
1,000 |
DNA sequence data from < 100 genes or regions of interest (in general less than 34Mb) (e.g., targeted sequencing) |
1,000 |
DNA sequence data from ≥ 100 genes or regions of interest (in general greater than 34Mb) (e.g., targeted sequencing, whole exome sequencing, whole genome sequencing) |
100 |
Genome-wide RNA sequencing (RNA-seq) data (e.g., transcriptomic data) |
100 |
Genome-wide DNA methylation data (e.g., bisulfite sequencing data) |
100 |
Genome-wide chromatin immunoprecipitation sequencing (ChIP-seq) data (e.g. transcription factor ChIP-seq, histone modification ChIP-seq) |
100 |
NIDCR extends this expectation to the following types of projects producing genomic data, regardless of the number of specimens or participants:
- Human studies of rare diseases: Genetic & Rare Disease Information Center
- The oral microbiome (archea, bacteria, viruses, and fungi).
- Under-studied human populations.
- Specific diseases or disorders: Human studies of Sjögren's Syndrome/xerostomia, or Temporomandibular Joint Disorder or related pain conditions.
Note that some journals have broad data-sharing expectations as well.
Questions?
Contact the Program Director for your area of research
NIDCR Genomic Program Administrator
NIDCR’s scope for the GDS policy will be reviewed periodically and updated as needed.