The 3C Content Committee has identified the following datasets that are being used in the tranSMART Platform.  You will need to determine the availability and suitability of each dataset for your use.  We have provided the information we know in the table.  If you know of additional datasets, or have more information on any of these, please contact the Content Committee.


 

Public Datasets

These datasets have been loaded into the tranSMART Platform by one of the groups using the tranSMART Platform. In most cases, some curation was done to prepare the data for loading and may be required to use the data directly from the source.
Disease AreaDatabase or StudyDisease subsetNumber of data sets or samplesCurated data sets available nowLicensed needed to downloadCan be distributed to third parties
OncologyTCGA30+ cancers>10,000 samplesRancho BioSciencesNo for levels 3,4; yes for levels 1,2No
OncologyCCLEVarious>>10,000 samplesRancho BioSciencesNoYes
OncologyCosmicVarious>>10,000 samplesNoYes
OncologyGEOVarious>400Rancho BioSciencesNoYes
OncologyICGCVarious>50YesNo
OncologyGTExNormal tissues>>1,000 samplesYesNo
ImmunologyGEOAll>100Rancho BioSciencesNoYes
ImmunologyGEOIBD (Crohn’s, UC)>35Rancho BioSciencesNoYes
ImmunologyGEOIPF>15Rancho BioSciencesNoYes
ImmunologyGEOLupus>11Rancho BioSciencesNoYes
ImmunologyGEOMisc (Kawasaki, Lyme, etc)10Rancho BioSciencesNoYes
ImmunologyGEORA, including psRA20Rancho BioSciencesNoYes
ImmunologyGEOSarcoidosis10Rancho BioSciencesNoYes
ImmunologyGEOVasculitis6Rancho BioSciencesNoYes
ImmunologydbGaPIBD (Crohn’s, UC)5YesNo
ImmunologydbGaPIPF1YesNo
ImmunologydbGaPLupus2YesNo
ImmunologydbGaPScleroderma1YesNo
ImmunologyEBIall>100Rancho BioSciencesYesNo
RespiratoryGEO>10Rancho BioSciencesNoYes
RespiratoryECLIPSEUnclearNot known
NeurosciencesGEOAlzheimer>3Rancho BioSciencesNoYes
NeurosciencesGEOMS30Rancho BioSciencesNoYes
NeurosciencesGEOParkinson7Rancho BioSciencesNoYes
NeurosciencesdbGAPAlzheimer4YesNo
NeurosciencesdbGAPMS2YesNo
NeurosciencesdbGAPParkinson6YesNo
NeurosciencesADNIAlzheimer>800 samplesYesNo
NeurosciencesPPMIParkinson>600 samplesYesNo
NeurosciencesLRRK2Parkinson>800 samplesYesNo
NeurosciencesBioFindParkinson120 samplesYesNo
CardiovasculardbGaPFramingham cohort study>1,000 samplesYesNo
Animal dataJacksons labsVarious>1,000 modelsNoYes

Last updated 8 April 2015