Materials for a 1-hour mini-workshop on Empirical Bayes Normal Means (EBNM), Empirical Bayes Matrix Factorization (EBMF), and multi-block extensions.
Delivered Tue 24 March at Melbourne Integrative Genomics, University of Melbourne.
Let me know if you have any suggestions for improving the materials, or that you'd want to use the materials for some other purposes (e.g. a different dataset): Jiadong Mao's email.
| Material | Format | Description |
|---|---|---|
| Workshop slides | xaringan slides | 45 min presentation + 15 min hands-on |
| EBNMF handout | HTML document | Technical reference covering EBNM, prior families, flashier, and GBCD |
| DIVAS handout | HTML document | Multi-block decomposition via DIVAS |
- Empirical Bayes shrinkage -- connecting limma, edgeR, and ashr to the EBNM framework
- The EBNM problem -- prior families, the
ebnmR package, and adaptive shrinkage - EBMF and flashier -- matrix factorization with EB priors; automatic K selection
- GBCD -- Generalized Binary Covariance Decomposition for shared/context-specific programs
- DIVAS -- Data Integration Via Analysis of Subspaces for multi-block data
The workshop uses a COVID-19 multiomics cohort (Su et al., Cell 2020) as a running example: 120 patients, Olink proteomics at two time points, severity scores 1--7.
EBNMF.md-- handout source (Markdown)DIVAS.md-- DIVAS handout sourceslides_workshop.Rmd-- slides source (xaringan/R Markdown)workshopCode.R-- hands-on R code for participantsEBNMF_guide.md-- flashier + GBCD API referenceDIVAS_guide.md-- DIVAS API reference
ebnm-- Empirical Bayes Normal Meansflashier-- Empirical Bayes Matrix FactorizationfastTopics-- Topic models / Poisson NMFDIVAS-- Multi-block decomposition
- Willwerscheid, Carbonetto & Stephens (2025). ebnm: an R package for solving the empirical Bayes normal means problem using a variety of prior families. JSS.
- Liu, Carbonetto et al. (2025). Dissecting tumor transcriptional heterogeneity from multi-tumor single-cell RNA-seq data. Nature Genetics.
- Carbonetto, Sarkar, Wang & Stephens (2021). Non-negative matrix factorization algorithms greatly improve topic model fits. arXiv.
- Carbonetto, Luo, Sarkar et al. (2023). GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership. Genome Biology.
- Sun, Marron, Le Cao & Mao (2026). DIVAS: Data Integration Via Analysis of Subspaces. bioRxiv.