Events that appear on this page are in the Sponsoring Department of "Biostatistics" with "Dissertation Defenses" as its Category.
To add or edit events on this page, go to http://publichealth.pitt.edu/submit
Zhe Sun of the Department of Biostatistics defends her dissertation on “Novel Statistical Methods in Analyzing Single Cell Sequencing Data”.
Committee Chairpersons: Ying Ding, PhD, Department of Biostatistics and Wei Chen, PhD, Department of Pediatrics
Kong Chen, PhD, Department of Medicine
Ming Hu, PhD, Department of Quantitative Health Sciences, Cleveland Clinic
Yongseok Park, PhD, Department of Biostatistics
Graduate faculty of the University and all other interested parties are invited to attend
Understanding of biological systems requires the knowledge of their individual components. Single cell RNA sequencing (scRNA-Seq) becomes a revolutionary tool to investigate cell-to-cell transcriptomic heterogeneity, which cannot be obtained in population-averaged measurements such as the bulk RNA-Seq. The newly developed droplet-based system enables parallel processing with digital counting of thousands of single cells in a single experiment, leading to the discovery of novel cell types which facilitates newly biological discoveries. This dissertation focuses on developing novel statistical methods for analyzing droplet-based scRNA-Seq data, which includes clustering methods to identify cell types from single or multiple individuals, and a joint clustering approach to simultaneously analyze paired data from scRNA-Seq and Cellular Indexing of Transcriptomes and Epitopes by sequencing (CITE-Seq), a state-of-art technology that allows the detection of cell surface proteins and transcriptome profiling within the same cell simultaneously.
In the first part of this dissertation, I developed DIMM-SC, a Dirichlet mixture model which explicitly models the raw UMI count for clustering droplet-based scRNA-Seq data and produces cluster membership with uncertainties. Both simulation studies and real data applications demonstrated that overall, DIMM-SC achieves substantially improved clustering accuracy and much lower clustering variability compared to other existing clustering methods. In the second part, I developed BAMM-SC, a novel Bayesian hierarchical Dirichlet mixture model to cluster droplet-based scRNA-Seq data from population studies. BAMM-SC takes raw count data as input and accounts for data heterogeneity and batch effect among multiple individuals in a unified Bayesian hierarchical model framework. Extensive simulation studies and applications to multiple in house experimental scRNA-seq datasets using blood, lung and skin cells from humans or mice demonstrated that BAMM-SC outperformed existing clustering methods with considerable improved clustering accuracy, particularly in the presence of heterogeneity among individuals. In the third part, I developed RE-DIMM-SC, a novel random effects model that jointly cluster the paired data from scRNA-seq and CITE-Seq simultaneously. Simulations and analysis of in-house real data sets were performed, which successfully demonstrated the validity and advantages of our method in helping people understand the heterogeneity and dynamics of various cell populations in complex multicellular tissue or organs.
PUBLIC HEALTH SIGNIFICANCE: Recent droplet-based single cell sequencing technology and its extensions have brought revolutionary insights to the understanding of cell heterogeneity and molecular processes at single cell resolution. I believe the proposed statistical approaches in this thesis for single cell data will improve the identification and characterization of cell subtypes from heterogeneous tissues, which is essential to fully understand cell identity and cell function.
Last Updated On Wednesday, December 4, 2019 by Valenti, Renee Nerozzi
Created On Monday, July 8, 2019
Md Tanbin Rahman of the Department of Biostatistics defends his dissertation on "Classification and Clustering for RNA-seq Data with Variable Selection".
Shu Wang of the Department of Biostatistics defends her dissertation on "Clustering Methods with Variable Selection for Data with Mixed Variable Types of Limits of Detection".
Di Zhang of the Department of Biostatistics defends her dissertation on “Inference on Win Ratio for Clustered Semi-Competing Risk Data”.
Li Zhu of the Department of Biostatistics defends her dissertation on "Bayesian variable selection model and differential co-expression network analysis for multi-omics data integration".
Xingyuan Li of the Department of Biostatistics defends her dissertation on "Modeling Exposure-Time-Response Association in the Presence of Competing Risks".
Christopher Keener of the Department of Biostatistics defends his dissertation on "Power and Sample Size Determination for Stepped Wedge Cluster Randomized Trials".
Joanne Beer of the Department of Biostatistics defends her dissertation on "Predicting Clinical Variables from Neuroimages Using Fused Sparse Group Lasso".
John Pleis of the Department of Biostatistics defends his dissertation on "Mixtures of Discrete and Continuous Variables: Considerations for Dimension Reduction".
Zhou "Ark" Fang of the Department of Biostatistics defends his dissertation on "Integration and Missing Data Handling in Multiple Omics Studies".
The schedule of dissertation defenses in the Department of Biostatistics is maintained by...
Share the details for your defense to ensure it is displayed here as well as on the department calendar, Pitt Public Health's calendar, Weekly Update, and LCD screens.
Follow the links below to view defenses in each department: