Projects

CB
Big Data
cBioPortal for Cancer Genomics

One of the most widely used platforms for visualizing and analyzing cancer genomics data. Fully open source, actively maintained by KSG @ DFCI, Memorial Sloan-Kettering Cancer Center, Princess Margaret Cancer Center, and The Hyve. We also maintain a local instance of cBioPortal @ DFCI, enabling researchers to mine all genomic data generated by the Profile project.

Project lead: Tali Mazor

Learn more →   cbioportal.org →
HT
Big Data
Human Tumor Atlas Network (HTAN)

Together with Sage Bionetworks, the Institute for Systems Biology, and Memorial Sloan Kettering Cancer Center, we serve as the Data Coordinating Center (DCC) for HTAN — a large NCI Moonshot project focused on longitudinal studies of pre-cancer, metastasis, and drug resistance with comprehensive clinical data collection over time.

Project lead: Alex Lash

Learn more →   humantumoratlas.org →
MG
Clinical
MatchMiner Genomics

An open source computational platform for matching patient-specific genomic profiles to precision cancer medicine clinical trials. Currently in active use within DFCI and available to both trial investigators and practicing oncologists.

Project lead: Tali Mazor

Learn more →   matchminer.org →
AI
Clinical
MatchMiner-AI

A next-generation trial matching platform that leverages large language models (LLMs) to match patients to all clinical trials — not just genomically driven ones — based on their full medical record. MatchMiner-AI processes unstructured clinical notes to extract key patient attributes and generate patient summaries for best-fit trial identification. Launched at DFCI in February 2026.

Project lead: Jen Altreuter

Learn more →
MD
Clinical
MatchMiner Dashboard

A pilot platform that centralizes and organizes real-time clinical trial data, including slot availability, accruals, study milestones, and AI-powered trial accrual forecasting.

Project lead: Erica Holdmore

KI
Infrastructure
KSG Infrastructure

Builds and maintains KSG infrastructure — servers, applications, dashboards, and cloud security — to ensure that systems are secure, reliable, and available. Supports the day-to-day operations of all KSG projects.

Project leads: James Lindsay, Jason Hansel

SP
Big Data
Spatial Profiling Data Management

An exploratory initiative focused on the storage, management, visualization, and standardization of spatial profiling data. We are actively investigating data standards, scalable storage solutions, and tools to make spatial omics data accessible and interpretable for researchers.

PR
Clinical
Profile and ImmunoProfile

Profile was a long-standing clinical next-generation sequencing (NGS) project between DFCI, Brigham and Women's Hospital, and Boston Children's Hospital. We developed clinical NGS pipelines and a cloud-based platform for running clinical NGS tests. We also supported ImmunoProfile, a research assay used to study and predict response to immunotherapy.