<div id="main"><!-- [et_pb_line_break_holder] --> <h1 id="title" class="ui center aligned header">UCSC Treehouse Public Data</h1><!-- [et_pb_line_break_holder] --> <p><!-- [et_pb_line_break_holder] --> The Treehouse Childhood Cancer Initiative is a research arm of the <a href="https://ucscgenomics.soe.ucsc.edu/">UCSC Genomics Institute</a>. We enable the sharing of pediatric cancer genomic data using tools developed by our Genomics Institute colleagues. We use shared data to analyze a child's tumor against both child and adult patient cancer tumors using a "pan cancer" or cross-comparison gene expression analysis. Our goal is to identify situations where an an approved drug, often an adult drug, is predicted to work on a child with cancer.<!-- [et_pb_line_break_holder] --> </p><!-- [et_pb_line_break_holder] --> <p><!-- [et_pb_line_break_holder] --> As part of our research, we have gathered a compendium of RNA gene expression data which we have made available for download and visualization.<!-- [et_pb_line_break_holder] --> </p><!-- [et_pb_line_break_holder] --> <p><!-- [et_pb_line_break_holder] --> Our samples are derived from partner clinical sites and publicly available repositories, including TARGET and TCGA. <!-- [et_pb_line_break_holder] --> Expression data from over 11,000 samples is available along with clinical data including age, gender, and disease type.<!-- [et_pb_line_break_holder] --> </p><!-- [et_pb_line_break_holder] --> <h3>Visualizations</h3><!-- [et_pb_line_break_holder] --> <div class="ui compact segment vizboxes"><!-- [et_pb_line_break_holder] --> <h4>Tumormap</h4><!-- [et_pb_line_break_holder] --> <div class="vizbox"><!-- [et_pb_line_break_holder] --> <a href="https://tumormap.ucsc.edu/help/overview.html"><!-- [et_pb_line_break_holder] --> <img src="https://treehousegenomics.soe.ucsc.edu/wp-content/uploads/2018/04/TumorMap-example-v5April2018-300x138.png"><!-- [et_pb_line_break_holder] --> </a><!-- [et_pb_line_break_holder] --> <p class="ui basic segment"><!-- [et_pb_line_break_holder] --> The <a href="https://tumormap.ucsc.edu/help/overview.html">UCSC TumorMap</a> interactively displays samples in the Treehouse dataset positioned according to their <!-- [et_pb_line_break_holder] -->RNA profiles. Users can color the samples based on dataset features like Disease. This browser shows samples clustered using the OpenOrd algorithm <!-- [et_pb_line_break_holder] -->and best separates smaller groups. (See <!-- [et_pb_line_break_holder] --> <a href="http://cancerres.aacrjournals.org/content/77/21/e111"><!-- [et_pb_line_break_holder] --> "TumorMap: Exploring the Molecular Similarities of Cancer Samples in an Interactive Portal."<!-- [et_pb_line_break_holder] --> </a> Cancer Research November 2017).<!-- [et_pb_line_break_holder] --> </p><!-- [et_pb_line_break_holder] --> </div><!-- tumormap vizbox --><!-- [et_pb_line_break_holder] --> <h4>Cluster Browser</h4><!-- [et_pb_line_break_holder] --> <div class="vizbox"><!-- [et_pb_line_break_holder] --> <a href="http://tsne.treehouse.gi.ucsc.edu/"><!-- [et_pb_line_break_holder] --> <img src="https://treehousegenomics.soe.ucsc.edu/wp-content/uploads/2017/10/tsne-thped-by-disease-300x136.png"><!-- [et_pb_line_break_holder] --> </a><!-- [et_pb_line_break_holder] --> <p class="ui basic segment"><!-- [et_pb_line_break_holder] --> The UCSC Cluster Browser interactively displays samples in the Treehouse dataset positioned according to their RNA profiles. Users can color the samples based on dataset features like Disease. This browser quickly shows samples clustered using the t-SNE algorithm and best shows relationships among larger groups.<!-- [et_pb_line_break_holder] --> </p><!-- [et_pb_line_break_holder] --> </div><!-- cluster browser vizbox --><!-- [et_pb_line_break_holder] --> <h4>Xena</h4><!-- [et_pb_line_break_holder] --> <div class="vizbox"><!-- [et_pb_line_break_holder] --> <a href="https://xenabrowser.net/heatmap/?bookmark=2ac4e86d1a597ea64de0fc9a7b1782ea"><!-- [et_pb_line_break_holder] --> <img src="https://treehousegenomics.soe.ucsc.edu/wp-content/uploads/2017/10/high-res-xena-screenshot-oct.16.2017-300x136.png"><!-- [et_pb_line_break_holder] --> </a><!-- [et_pb_line_break_holder] --> <p class="ui basic segment"><!-- [et_pb_line_break_holder] --> UCSC Xena allows users to explore the Treehouse dataset, finding correlations and trends within and across genomic and phenotypic variables. Users can interactively add, remove, and rearrange arbitrary slices of data including genes, transcripts and other dataset features. <!-- [et_pb_line_break_holder] --> <a href="https://xenabrowser.net/heatmap/?bookmark=2ac4e86d1a597ea64de0fc9a7b1782ea">This example from our July 2017 dataset</a><!-- [et_pb_line_break_holder] --> shows that neuroblastoma in comparison to other pediatric cancers has a much stronger ALK gene expression and younger patient population.<!-- [et_pb_line_break_holder] --> </p><!-- [et_pb_line_break_holder] --> </div><!-- xena vizbox --><!-- [et_pb_line_break_holder] --> </div><!-- vizboxes : visualizations--><!-- [et_pb_line_break_holder] --> <h3>Files</h3><!-- [et_pb_line_break_holder] --> <p>Three different file types are available.</p><!-- [et_pb_line_break_holder] --> <div class="ui compact segment vizboxes"><!-- [et_pb_line_break_holder] --> <h4>Limited Clinical Data</h4><!-- [et_pb_line_break_holder] --> <div class="vizbox ui basic segment"><!-- [et_pb_line_break_holder] --> Age, gender, and disease are provided for RNASeq samples compiled by the UCSC Treehouse Childhood Cancer Initiative. Samples derived from clinical sites, publicly available repositories, TARGET, and TCGA.<!-- [et_pb_line_break_holder] --> </div><!-- [et_pb_line_break_holder] --> <h4>TPM Gene Expression, log<sub>2</sub>-Normalized</h4><!-- [et_pb_line_break_holder] --> <div class="vizbox ui basic segment"><!-- [et_pb_line_break_holder] --> Values in this dataset use HUGO gene names and are TPM, transformed by log<span class="subscript">2</span>(x+1) of the TPM value.<!-- [et_pb_line_break_holder] --> </div><!-- [et_pb_line_break_holder] --> <h4>Expected Counts Gene Expression</h4><!-- [et_pb_line_break_holder] --> <div class="vizbox ui basic segment"><!-- [et_pb_line_break_holder] --> Values in this dataset are expected_count and use Ensembl gene IDs. <!-- [et_pb_line_break_holder] --> </div><!-- [et_pb_line_break_holder] --> </div><!-- vizboxes files --><!-- [et_pb_line_break_holder] --><!-- [et_pb_line_break_holder] --> <h2 id="datasets">Download</h2><!-- [et_pb_line_break_holder] --> <di
v class="ui compact segment vizboxes"><!-- [et_pb_line_break_holder] --> <h3 id="april2018">Compendium v5 Public (April 2018)</h3><!-- [et_pb_line_break_holder] --> <div class="vizbox"><!-- [et_pb_line_break_holder] --> <div><!-- [et_pb_line_break_holder] --> <h4>Visualize</h4><!-- [et_pb_line_break_holder] --> <ul class="ui"><!-- [et_pb_line_break_holder] --> <li><a href="https://tumormap.ucsc.edu/?p=Treehouse/TreehousePEDv5_April2018">Tumormap</a></li><!-- [et_pb_line_break_holder] --> <li><a href="#" class="disabled">Cluster Browser</a> (coming soon)</li><!-- [et_pb_line_break_holder] --> <li><a href="https://xenabrowser.net/datapages/?hub=https://xena.treehouse.gi.ucsc.edu:443">Xena</a></li><!-- [et_pb_line_break_holder] --> </ul><!-- [et_pb_line_break_holder] --> </div><div><!-- [et_pb_line_break_holder] --> <h4>Files</h4><!-- [et_pb_line_break_holder] --> <ul><!-- [et_pb_line_break_holder] --> <li><a href="https://xenabrowser.net/datapages/?dataset=TreehousePEDv5_clinical_metadata.2018-05-09.tsv&host=https%3A%2F%2Fxena.treehouse.gi.ucsc.edu%3A443">Clinical Data</a></li><!-- [et_pb_line_break_holder] --> <li><a href="https://xenabrowser.net/datapages/?dataset=TreehousePEDv5_unique_hugo_log2_tpm_plus_1.2018-05-09.tsv&host=https%3A%2F%2Fxena.treehouse.gi.ucsc.edu%3A443">TPM Expression</a></li><!-- [et_pb_line_break_holder] --> <li><a href="#" class="disabled">Expected Counts Expression</a> (coming soon)</li><!-- [et_pb_line_break_holder] --> </ul><!-- [et_pb_line_break_holder] --> </div><!-- [et_pb_line_break_holder] --> </div><!-- [et_pb_line_break_holder] --> This compendium was released in April 2018. It includes a total of 11,258 samples from Treehouse (identifiers start with "TH"), TCGA and TARGET projects.<!-- [et_pb_line_break_holder] -->This data was generated by library preparation methods including polyA selection and ribosomal depletion.<!-- [et_pb_line_break_holder] --> </div><!-- [et_pb_line_break_holder] --> </div>