Data Science Zoominar: Teaching Data Science to the Masses

A conversation with Jeff Leek, PhD, Johns Hopkins University. Moderator: Rafael Irizarry. Registration required. https://dfci.zoom.us/webinar/register/WN_XscX-d21RqylhDCXzvP2-Q A recording of the talk is available on our YouTube channel.

Data Science Zoominar: COVID-19 Update

A conversation with Marc Lipsitch, PhD Harvard TH Chan School of Public Health Moderator: Rafael Irizarry Registration required: https://dfci.zoom.us/webinar/register/WN_F_ROPV-RRNyXGoXaoTWFNw A recording of the talk is available on our YouTube channel.

Data Science Zoominar: The Prevalence of Inappropriate Image Duplication in Biomedical Research Publications

A conversation with Elisabeth Bik, PhD, Microbiome & Science Integrity Consultant, Harbers Bik LLC Moderator: Rafael Irizarry, PhD We are pleased to announce a new weekly Zoominar for Data Science. Rather than a traditional seminar format, Rafael Irizarry will moderate a Q&A with invited speakers on various topics in data science. Join us for these […]

Data Science Zoominar: Massachusetts Data from the COVID-19 Response

A conversation with Gillian Haney, Director of Infectious Disease Surveillance and Informatics, Massachusetts Department of Public Health and Catherine (Katie) Brown, State Public Health Veterinarian, Massachusetts Department of Public Health Moderator: Rafael Irizarry We are pleased to announce a new weekly Zoominar for Data Science. Rather than a traditional seminar format, Rafael Irizarry will moderate […]

Data Science Zoominar: A Debate About the Severity of the COVID-19 Pandemic

A conversation with John P.A. Ioannidis, C. F. Rehnborg Professor In Disease Prevention In The School Of Medicine, Professor Of Medicine, Of Health Research And Policy (Epidemiology) And By Courtesy, Of Statistics And Of Biomedical Data Science, Stanford University Moderator: Rafael Irizarry We are pleased to announce a new weekly Zoominar for Data Science. Rather […]

Data Science Zoominar: The Latest on COVID-19 Testing: How Can Testing Help Us Reopen?

A conversation with Michael Mina, Assistant Professor Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard T.H. Chan School of Public Health Moderator: Rafael Irizarry We are pleased to announce a new weekly Zoominar for Data Science. Rather than a traditional seminar format, Rafael Irizarry will moderate a Q&A with invited speakers on various topics […]

Data Science Zoominar: A Comparison of COVID-19 Prediction Models

A conversation with Nicholas Reich, Associate Professor of Biostatistics, University of Massachusetts Amherst Moderator: Rafael Irizarry https://bit.ly/DSJuly21 We are pleased to announce a new weekly Zoominar for Data Science. Rather than a traditional seminar format, Rafael Irizarry will moderate a Q&A with invited speakers on various topics in data science. Join us for these interactive […]

Data Science Zoominar: Increasing Diversity in Data Science

A conversation with Emma Benn, DrPh Associate Professor, Center for Biostatistics and Department of Population Health Science and Policy Icahn School of Medicine at Mount Sinai Moderator: Rafael Irizarry A recording of the talk is available on our YouTube channel.

Training Sessions: InForm

Wednesday August 26 from 10:00am to 12:00PM Danny Quinn will demonstrate extracting DF/HCC clinical trials data from InForm (which is the electronic data capture system (eDC)) into SAS data sets. There will also be a discussion on the attributes of the data sets and various utilities which support the extraction process.

A New Hybrid Phase I-II-III Clinical Trial Paradigm

Frontiers in Biostatistics Seminar Tuesday September 15, 2020 at 1:00PM Eastern Time Peter F. Thall, PhD Department of Biostatistics University of Texas M.D. Anderson Cancer Center Abstract: Conventional evaluation of a new drug, 𝐴, is done in three phases. Phase I relies on toxicity to determine a “maximum tolerable dose” (MTD) of 𝐴, in phase […]

Cancer Development, Heterogeneity and Dynamics from Premalignancy to Drug Refractory Disease

Data Science Seminar September 22, 2020 2:00PM ET Ignaty Leshchiner, PhD Postdoctoral Fellow, Harvard Medical School/Brigham and Women's Hospital Zoom link: http://bit.ly/DSSept22 Abstract: Real-time study of tumor emergence and progression in patients will help predict and ultimately change the course of the patient's disease. This could be achieved by inferring genotypes of heterogeneous cell populations within […]

3D Spatial Organization Within Tumors

Data Science Seminar September 29, 2020 1:00PM ET Martin Aryee, PhD Assistant Professor of Pathology, Harvard Medical School Assistant Molecular Pathologist, Massachusetts General Hospital Zoom link: https://dfci.zoom.us/j/95524743149?pwd=SzN4cjJZUnhsNVl3dXNmZjZ1N3F4QT09 Abstract: The spatial organization of biological systems can impart additional functionality beyond that of the individual components. This is true at a range of scales – from cells […]

Marvin Zelen Symposium 2020

In December of 2019, we planned the next Marvin Zelen Symposium for March of 2020. At that point, the year ahead seemed to be a data spectacle waiting to happen. Think about all the events planned for 2020! Here’s what we wrote. The topic of this year's Marvin Zelen Symposium has to do with the […]

Constructing Confidence Interval for RMST under Group Sequential Setting

Frontiers in Biostatistics Seminar October 13, 2020 1:00PM Lu Tian, PhD Associate Professor of Biomedical Data Science in the School of Medicine Stanford University It is appealing to compared survival distributions based on restricted mean survival time (RMST), since it generates a clinically interpretable summary of the treatment effect, which can be estimated nonparametrically without […]

Computational Biology of DNA Repair in Cancer

Data Science Seminar October 15, 2020 1:00PM ET Dominik Glodzik, PhD Repare Therapeutics Zoom link: https://bit.ly/DSOct15 Abstract: Whole genome sequences contain within them signatures of mutational processes. In particular, some of the mutation signatures relate to impaired DNA-repair in cancer cells. Accurate measurement of mutation signatures reveals the role of DNA-repair deficiencies in etiology and […]

Data Science Zoominar: Data Science and Academic Leadership

Tuesday October 20, 2020 1:00PM ET Data Science and Academic Leadership A conversation with F. DuBois Bowman, PhD Dean of the University of Michigan School of Public Health Moderator: Rafael Irizarry Video available on our YouTube Channel.

Data Science Zoominar: Communicating Statistical Findings Effectively

Tuesday November 17, 2020 1:00PM ET Communicating Statistical Findings Effectively A conversation with Professor Sir David Spiegelhalter Chair, Winton Centre for Risk and Evidence Communication Centre for Mathematical Sciences Author: The Art of Statistics Moderator: Rafael Irizarry RSVP at https://bit.ly/DSNov17

Postdoctoral Open House

The Dana-Farber Cancer Institute Department of Data Science announces our first annual Postdoc Open House. Join us on Monday, December 7th at 1:00PM EST to explore postdoctoral opportunities in cancer research. Participating faculty include:  Rafael Irizarry, PhD, Professor and Department Chair X. Shirley Liu, PhD, Professor Giovanni Parmigiani, PhD, Professor Franziska Michor, PhD, Professor Lorenzo […]

DF/HCC Cancer Data Science Program &Harvard Chan Bioinformatics CoreJoint Symposium on scRNAseq Methodology

Monday, December 14, 3:00-4:30 PM ET RSVP https://bit.ly/CDSBioDec14 Speakers: Aedin Culhane, Senior Research Scientist, Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health Isabella Grabski, PhD Student in Biostatistics, Harvard University Probabilistic gene barcodes identify cell-types in single-cell RNA-sequencing data Shannan Ho Sui, Senior Research Scientist, Harvard T.H. Chan School of Public Health […]

Data Science Zoominar: Data-Driven Policy in Puerto Rico

Tuesday December 15, 2020 1:00PM ET Data-Driven Policy in Puerto Rico A conversation with Arnaldo Cruz Director of Research and Policy of the Financial Oversight and Management Board of Puerto Rico Co-founder of ABRE Puerto Rico Moderator: Rafael Irizarry RSVP at https://bit.ly/DSDec15

Frontiers in Biostatistics: Statistical Modeling and Adjustment for Sampling Biases

Frontiers in Biostatistics Seminar January 12, 2021 1:00PM Jing Ning, PhD Associate Professor. Department of Biostatistics Division of Quantitative Sciences The University of Texas M.D. Anderson Cancer Center Register at: https://bit.ly/FIBJan12 Abstract: Bias sampling mechanisms are commonly encountered in applications where the subjects in a target population are not given an equal chance to be selected, […]

Data Science Zoominar: Vaccine Prioritization Strategies

Tuesday January 26, 2021 1:00PM ET A conversation with Daniel Larremore Assistant Professor, Department of Computer Science at University of Colorado-Boulder and BioFrontiers Institute and Kate Bubar Student, Department of Applied Mathematics, University of Colorado-Boulder Moderator: Rafael Irizarry RSVP at https://bit.ly/DSJan26 YouTube Link: https://www.youtube.com/watch?v=fJuHNNP8TLg

Frontiers in Biostatistics: Group Sequential Design Assuming Delayed Benefit

February 9, 2021 1:00PM ET Keaven Anderson, PhD Scientific AVP, Methodology Research, Biostatistics at Merck Group Sequential Design Assuming Delayed Benefit Abstract: We consider an asymptotic approach to design of group sequential trials with a potentially delayed effects. Logrank, weighted logrank tests and combination tests are of primary interest, but we also consider restricted mean […]

Data Science Zoominar: Diversity and Ethics in Genomics

Tuesday February 23, 2021 1:00PM Eastern Time A conversation with Keolu Fox, PhD Assistant Professor, Department of Anthropology, University of California, San Diego Moderator: Aedin Culhane YouTube Link: https://www.youtube.com/watch?v=VANlStOnFPY

Frontiers in Biostatistics: Distributed Statistical Learning and Inference in EHR and Other Healthcare Datasets

Frontiers in Biostatistics Seminar March 9, 2021 1:00PM Rui Duan, PhD Assistant Professor of Biostatistics Harvard TH Chan School of Public Health Distributed Statistical Learning and Inference in EHR and Other Healthcare Datasets Abstract: The growth of availability and variety of healthcare data sources has provided unique opportunities for data integration and evidence synthesis, which […]

Data Science Zoominar: The Importance of Representative Samples in Clinical Trials

Tuesday March 23, 2021 1:00PM Eastern Time A conversation with Timothy Rebbeck Vincent L. Gregory, Jr. Professor of Cancer Prevention, Epidemiology, Harvard T.H. Chan School Of Public Health Professor, Medical Oncology, Dana-Farber Cancer Institute Moderator: Rafael Irizarry YouTube Link: https://www.youtube.com/watch?v=Q4uBibxGf20

Frontiers in Biostatistics: Single-Cell RNA-Seq Data Analysis Via a Regularized Zero-Inflated Mixture Model Framework

Frontiers in Biostatistics Seminar May 11, 2021 1:00PM Jianhua Hu, PhD Professor, Biostatistics (in Medicine and in the Herbert Irving Comprehensive Cancer Center) Director, Cancer Biostatistics Program Columbia University Register at: http://bit.ly/FIBMay21 Abstract: Applications of single-cell RNA sequencing in various biomedical research areas have been blooming. This new technology provides unprecedented opportunities to study disease […]

Training Session: Efficient Phase I Clinical Trial Design

May 19, 2021 10:00am-12:00PM Eastern Time Fangxin Hong, PhD Senior Research Scientist Department of Data Science, Dana-Farber Cancer Institute Department of Biostatistics, Harvard T.H. Chan School of Public Health    

Data Science Seminar: Causal Inference Methods for Measures of Health Disparities

Thursday, October 28, 2021 11:00am Eastern Time Tengfei Li Georgetown University Causal Inference Methods for Measures of Health Disparities There is increased interest in the evaluation of health disparities between different socioeconomic groups using data from observational studies. However, in the absence of randomization, the results and conclusions may be limited to associations rather than […]

Postdoc Recruitment Day

The Dana-Farber Cancer Institute Department of Data Science announces its second annual Postdoc Recruitment Day to be held on Wednesday, November 3rd from 1-3pm EST. If you are interested in learning more about postdoctoral opportunities at Dana-Farber Cancer Institute and would like to learn about the research our faculty are conducting, please sign up for […]

Frontiers in Biostatistics: A Bayesian Phase I/II Trial Design for Immunotherapy

Tuesday, November 9, 2021 1:00pm Eastern Time Suyu Liu, PhD Associate Professor Department of Biostatistics The University of Texas MD Anderson Cancer Center A Bayesian Phase I/II Trial Design for Immunotherapy Immunotherapy is an innovative treatment approach that stimulates a patient's immune system to fight cancer. It demonstrates characteristics distinct from conventional chemotherapy and stands […]

Frontiers in Biostatistics: Cancer on the Way to Mars

Tuesday, December 7, 2021 1:00pm Eastern Time Giovanni Parmigiani, PhD Professor Harvard T.H. Chan School of Public Health Link to YouTube Video Cancer on the Way to Mars Links: https://www.nap.edu/download/26155 https://www.env.go.jp/en/chemi/rhm/basic-info/index.html https://www.nasa.gov/feature/goddard/real-martians-how-to-protect-astronauts-from-space-radiation-on-mars https://www.nasa.gov/sites/default/files/atoms/files/space_radiation_ebook.pdf https://standards.nasa.gov/standard/nasa/nasa-std-3001-vol-1 https://spaceradiation.larc.nasa.gov/nasapapers/2020/5008710.pdf

Frontiers in Biostatistics: Studies on COVID-19 and Cancer using National Real-World VA Data

Nathanael Fillmore is the Associate Director for Machine Learning and Advanced Analytics at the VA Boston Healthcare System’s Cooperative Studies Program Informatics Center. He leads a data science team focused on using machine learning and data science methods, in combination with the VA’s large clinical, genomic, and imaging databases, to generate knowledge and resources that […]

2022 DF/HCC Celebration of Early Career Investigators in Cancer Research

January 19 | 2022 | 1-4PM Eastern Time This annual symposium showcases the talent of early career investigators at the Dana-Farber/Harvard Cancer Center who work in population science, including epidemiology, biostatistics, outcomes, diversity, and cancer care delivery research, and early detection. This year, Ann Partridge, MD, MPH will be our keynote speaker. Dr. Partridge is […]

Data Science Seminar: Deciphering Tissue Microenvironment from Next Generation Sequencing Data

Friday February 4, 2022 1:00PM Eastern Time Register. Jian Hu PhD Candidate, Department of Biostatistics, Epidemiology and Informatics University of Pennsylvania ABSTRACT: The advent of high-throughput next-generation sequencing (NGS) technologies has transformed our understanding of cell biology and human disease. As NGS has been adopted earliest by the scientific community, its use has now become […]

Frontiers in Biostatistics: Early Phase Design Considerations for Oncology Drug Development in the Era of Immunotherapy and Targeted Agents

Tuesday, February 8, 2022 1:00pm Eastern Time YouTube Video Elizabeth Garrett-Mayer, PhD, FSCT Vice President Center for Research and Analytics (CENTRA) Dr. Garrett-Mayer joined ASCO in 2017 as CENTRA’s Division Director for Biostatistics and Research Data Governance and became CENTRA’s first Vice President in 2022. CENTRA leads ASCO’s research efforts, including the TAPUR Study, ASCO’s […]

Data Science Seminar: End-to-end AI for Screening Mammography

Tuesday February 15, 2022 1:00PM Eastern Time William Lotter, PhD Vice President of Machine Learning, RadNet, Inc. Chief Technology Officer & Co-Founder, DeepHealth, Inc. Register. Screening mammography has been estimated to reduce breast cancer mortality by 20-40%, but significant opportunities remain for improving access and overall quality. Artificial intelligence (AI) has the potential to deliver […]

Data Science Seminar: Spatial meshing for general Bayesian multivariate models

Thursday February 24, 2022 1:00PM Eastern Time Michele Peruzzi, PhD Postdoctoral Associate, Department of Statistical Science, Duke University Register. Abstract: In this talk, I will consider the problem of fitting Bayesian models with spatial random effects to large scale multivariate multi-type data from satellite imaging, land-based weather and air quality sensors, and citizen science, with […]

Data Science Seminar: From descriptive to predictive biology via single-cell multiomics

Monday February 28, 2022 1:00PM Eastern Time Genevieve Stein-O'Brien Instructor, Johns Hopkins University, School of Medicine Department of Oncology, Division of Biostatistics and Bioinformatics; Department of Neuroscience; and McKusick-Nathans Department of Genomic Medicine Assistant Director, Johns Hopkins University Single Cell Consortium Register. Abstract: As the single-cell field races to characterize each cell type, state, and […]

Frontiers in Biostatistics: Considerations for Extracting Real-World Evidence from Real-World Data

Tuesday, March 1, 2022 1:00pm Eastern Time Rebecca A. Hubbard, PhD Professor of Biostatistics University of Pennsylvania Perlman School of Medicine YouTube Video Abstract: Opportunities to use real-world data (RWD), including electronic health records (EHR) and medical claims data, have exploded over the past decade. The Covid-19 pandemic has provided a particularly dramatic illustration of […]

Data Science Seminar: Radiomics for Feature Extraction from Radiological Images

Friday, March 4, 2022 12:00PM Eastern Time Ani Eloyan, PhD Assistant Professor Department of Biostatistics, Brown University Register. Abstract: Cancer patients routinely undergo radiological evaluations where images of various modalities including computed tomography, positron emission tomography, and magnetic resonance images are collected for diagnosis and for evaluation of disease progression. Tumor characteristics, often referred to […]

Data Science Seminar: Engineering Protease Activity Sensors For Personalized Detection and Profiling of Cancer

Monday March 7th, 2022 1:00PM Eastern Time Ava Soleimany, PhD Senior Researcher, Biomedical Machine Learning Group at Microsoft Research, New England Abstract: Precision cancer medicine envisions a world where diagnostic and therapeutic opportunities are intelligently tailored to individual patient needs. Achieving this vision necessitates access to high quality, accurate, and individualized information about disease state. […]

2022 Marvin Zelen Symposum: Data Visualization

The growing availability of informative datasets and software tools has led to increased reliance on data visualizations across many industries, academia, and government. News organizations are increasingly embracing data journalism and including effective infographics as part of their reporting, while in research we increasingly rely on data visualization to assess data quality and describe and […]

Frontiers in Biostatistics: Tree-based Ensembling Strategies for Handling Heterogeneous Data

Maya Ramchandran Data Scientist, ZephyrAI Abstract: Adapting machine learning algorithms to better handle clustering or other partition structure within training data sets is important across a wide variety of biological applications. We first consider the task of learning prediction models when multiple training studies are available. We present a novel weighting approach  for constructing tree-based ensemble […]

Data Science Seminar: Identification of Novel Oncogenic and Neurodevelopmental Programs Using Bulk and Single-cell Sequencing Approaches

Jeremy M. Simon, Ph.D. (he/him/his) Associate Professor, Department of Genetics Co-Principal, Bioinformatics and Analytics Research Collaborative (BARC) Director, UNC Neuroscience Center Bioinformatics Core Carolina Institute for Developmental Disabilities University of North Carolina at Chapel Hill Register for Zoom Webinar. Establishing and maintaining proper transcriptional programs is central to both development and disease, and involves the […]

Training Session: Biomarkers in Cancer Research

Wednesday June 15, 2022 10:00-12:00pm Eastern Time Nabihah Tayob, PhD Assistant Professor Department of Data Science, Dana-Farber Cancer Institute Harvard Medical School Zoom registration: https://bit.ly/TSJune22  

It’s All Relative: Testing Differential Abundance in Compositional Microbiome Data

Frontiers in Biostatistics Seminar Series Tuesday September 13, 2022 1:00PM Eastern Time Yijuan Hu, Ph.D. Associate Professor Department of Biostatistics and Bioinformatics Rollins School of Public Health Emory University Register for Zoom link Abstract: Studies on the human microbiome have revealed that differences in microbial communities are associated with many human disorders such as inflammatory […]

Postdoc Open House

The Dana-Farber Cancer Institute Department of Data Science announces its third annual Postdoc Open House Day on Friday, October 14th, 12pm-7pm in person. If you are interested in learning more about postdoctoral opportunities at Dana-Farber Cancer Institute and would like to learn about the research our faculty are conducting, please sign up for this free […]