- UCSC Xena is an online data visualization and exploration platform developed by the University of California, Santa Cruz, designed to support cancer genomics and biomedical research. Its primary purpose is to provide researchers, clinicians, and students with intuitive access to large-scale, publicly available datasets, such as those from The Cancer Genome Atlas (TCGA), Genomic Data Commons (GDC), International Cancer Genome Consortium (ICGC), and GTEx, as well as user-hosted private data. By integrating these diverse sources, UCSC Xena allows users to explore multi-omics data—including genomics, transcriptomics, epigenomics, proteomics, and clinical information—within a unified, interactive interface.
- A hallmark feature of UCSC Xena is the Xena Browser, a powerful visualization tool that enables users to create customizable data matrices, called Xena Visual Spreadsheets. These spreadsheets allow side-by-side comparison of multiple data modalities, such as gene expression, mutation status, copy number alterations, and clinical phenotypes, for thousands of samples across different cancer types. The platform supports intuitive filtering, subgrouping, and statistical analysis, making it possible to identify patterns, correlations, and potential biological insights without requiring advanced programming skills.
- Importantly, UCSC Xena is built with flexibility and privacy in mind. Researchers can host their own datasets securely using Xena Hubs, which can be either public or private. This feature ensures that sensitive or unpublished data can be analyzed in conjunction with public resources while remaining protected. The hub-based architecture also allows scalable integration of multiple data sources without duplicating storage, making it suitable for both small laboratories and large consortia.
- One of the greatest strengths of UCSC Xena is its extensive collection of major public datasets. These include:
- The Cancer Genome Atlas (TCGA Pan-Cancer Atlas), which provides multi-omics data across 11,000 tumors from 33 cancer types.
- ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG), offering whole-genome data from thousands of cancer patients.
- Genotype-Tissue Expression (GTEx) project, which contains expression data from normal tissues, enabling cancer vs. normal comparisons.
- Therapeutically Applicable Research to Generate Effective Treatments (TARGET), focusing on pediatric cancers.
- GDC-hosted datasets including harmonized RNA-seq, mutation, and copy number profiles across cancer studies.
- Together, these datasets make UCSC Xena one of the most comprehensive platforms for cancer genomics research, bridging large-scale public projects with private laboratory data.
- UCSC Xena is particularly valuable for precision oncology, as it enables exploration of genetic and molecular alterations across large patient cohorts. For example, users can quickly assess how a specific mutation correlates with patient survival, or how gene expression profiles vary across cancer subtypes. The platform also provides survival analysis tools, Kaplan–Meier plots, and statistical testing options that streamline hypothesis generation and validation.
- Beyond cancer, UCSC Xena’s infrastructure can be applied to other large-scale biomedical datasets, supporting broader applications in systems biology and human disease research. Its open-source nature ensures continual community-driven development, and its emphasis on accessibility makes it a key educational tool for training the next generation of computational biologists and biomedical researchers.
- In summary, UCSC Xena is a scalable, interactive, and user-friendly platform for visualizing and analyzing complex genomic datasets. By integrating public and private data in a secure environment, and by providing intuitive visualization and analysis tools, it accelerates discovery in cancer research and beyond. Its combination of flexibility, accessibility, and scientific depth has made it one of the most widely used platforms in modern genomics.