Explore our data

Visualize Download Pipeline

Pay it Forward!

The Treehouse Childhood Cancer Initiative is a research arm of the UCSC Genomics Institute. We enable the sharing of pediatric cancer genomic data using tools developed by our Genomics Institute colleagues. We use shared data to analyze a child’s tumor against both child and adult patient cancer tumors using a “pan cancer” or cross-comparison gene expression analysis. Our goal is to identify situations where an an approved drug, often an adult drug, is predicted to work on a child with cancer.

As part of our research, we have gathered a compendium of RNA gene expression data which we have made available for download and visualization. Please let us know if there is an additional format or tool that would make it easier for you to use our data.

April 2018: New Dataset Available

The second version of the public expression dataset has been released! This dataset includes expression from 184 additional samples and is available to visualize and download via our Public Data page.


For more information on these visualizations, see our Public Data page.

The UCSC Cluster Browser interactively displays samples in the Treehouse dataset positioned according to their RNA profiles, clustered using the t-SNE algorithm. It best shows relationships among larger groups.

UCSC Xena allows users to explore the Treehouse dataset. This example shows that neuroblastoma in comparison to other pediatric cancers has a much stronger ALK gene expression and younger patient population.

The UCSC TumorMap interactively displays samples in the Treehouse dataset positioned according to their RNA profiles, clustered using the OpenOrd algorithm. Users can color the samples based on dataset features like Disease.


Our Public Data page provides links to download clinical and expression data from our available public datasets.

Over 11, 000 samples are available for download along with clinical data including age, gender, and disease type. Our samples are derived from partner clinical sites and publicly available repositories, including TARGET and TCGA.


Information on our open-source RNA-Seq processing pipeline is available on our Public Data page.

Thank You!

We are grateful to all our supporters and clinical partners (see below, and on our Acknowledgments page). Without them, we would not be able to accomplish this important work.

Thank you to all who are sharing data. A special shout out to the St. Baldrick’s Foundation and the California Initiative to Advance Precision Medicine, not only for supporting Treehouse but for their commitment to data sharing and their efforts to advance responsible data sharing.

With support from

California Initiative to Advance Precision MedicineSt. Baldrick's Foundation
Unravel pediatric cancerLive For Others FoundationTeam G Childhood Cancer Foundation

With thanks to our clinical and research partners

Stanford MedicineCHOC Children'sUCSF Benioff Children's HospitalsBC Children's Hospital

Data Usage Policy

If you use our data, please acknowledge the Treehouse Childhood Cancer Initiative as the source of the data.

If you use our pipeline to process your data, we would appreciate it if you share the results with us, so it can be added to the public database. Just send us an email and we’ll get in touch to arrange the data transfer. Our goal is to benefit researchers and pediatric patients everywhere through access to data.