Integrative Omics data set

The BBMRI-NL Omics data set comprises -omics measures in blood of participants in 29 Dutch cohorts. Genetic, epigenetic, transcriptome and metabolome data are available of approximately 3,500 samples. The BBMRI Omics data set is the joint collection of the GoNL consortium, BIOS Consortium and the Metabolomics Consortium.

What is BBMRI-NL’s Omics data set? 

The BBMRI-Omics data set has been created to study the relations between different omics levels and develop omics signatures of health and disease. The data set is publicly available and has proven to facilitate researchers in their discovery of novel biological mechanisms and biomarkers for health and disease. The BBMRI-Omics data set consists of 3,500 samples with integrative data (genome, epigenome, transcriptome and metabolome) with an extension of the metabolome in 30,000 extra individuals and whole genome sequences of a selective group of 700 individuals. 

The table below shows an overview of the techniques used and number of samples available for the different Omics measures present in the data set.

Omics measure Technique Number of samples
Whole Genome Sequencing Illumina 750
Imputed genotypes HRC ~6000
DNA Methylation Illumina 450K ~4000
Transcriptome Illumina RNA seq ~4000
Metabolome Nightingale NMR ~32,000


Why BBMRI-NL’s Omics data set?

With the BBMRI-Omics Explorer, designed to query the Omics dataset, you can explore the number of samples per selected set of omics data types, and check the distribution of sex, age and smoking status of the selected data set. You could also explore the number of individuals with a certain age range of interest having multiple omics levels available.
Intended users
The Omics data set and Explorer are meant for researchers investigating the relation between different genomic levels, and researchers investigating multilevel omics classifiers for disease risk.

Using BBMRI’s Omics data set

All data shared by participating biobanks including genome-wide SNP data, phenotypes, RNA-sequences, DNA methylation, and metabolomics on all individuals can be analyzed on centralized computational facilities, namely a 64-core virtual machine in the SURFsara High Performance Computing cloud. Here, BBMRI-Omics data are readily accessible through the dedicated R package BBMRIomics.

To request access to BBMRI-Omics data sets through the Cloud, forms need to be filled out for the providing consortium separately: GoNL provides whole genome sequencing data, BIOS Consortium provides RNA sequencing data and DNA methylation data, Metabolomics Consortium provides metabolomics data. Please fill out and sign for each provider a Code of Conduct and a Data Access Request form: GoNL form, BIOS Consortium form, Metabolomics Consortium form.

Support for BBMRI-NL’s Omics data set

For support on using the Omics dataset, contact Leon Mei, Molecular Epidemiology, Leiden University Medical Center:

For more information on BBMRI-NL’s Omics data set contact:

BBMRI-NL’s WP leaders: