Frontiers in Biostatistics: Tree-based Ensembling Strategies for Handling Heterogeneous Data
Maya Ramchandran Data Scientist, ZephyrAI Abstract: Adapting machine learning algorithms to better handle clustering or other partition structure within training data sets is important across a wide variety of biological applications. We first consider the task of learning prediction models when multiple training studies are available. We present a novel weighting approach for constructing tree-based ensemble […]