Directory Calendar News Careers Alumni Giving

Biostatistics Dissertation Defenses

Biostatistics Dissertations


Events that appear on this page are in the Sponsoring Department of "Biostatistics" with "Dissertation Defenses" as its Category.

To add or edit events on this page, go to


Biostatistics Dissertation Defense
Xingyuan Li - TBA Biostatistics Dissertation Defense
Xingyuan Li - TBA
Fri 1/18/2019 1:00PM - 3:00PM
7139 Public Health, Peterson Seminar Room

Xingyuan Li of the Department of Biostatistics defends her dissertation on "TBA".

Fri 1/18/2019


Biostatistics Dissertation Defense

Chien-Wei Lin: "Power calculation and study design in RNA-Seq and Methyl-Seq"

Friday 4/14 3:00PM - 5:00PM
7139 Public Health, Peterson Seminar Room
Chien-Wei Lin of the Department of Biostatistics defends his dissertation on "Power calculation and study design in RNA-Seq and Methyl-Seq"

Graduate faculty of the University and all other interested parties are invited to attend.

Next generation sequencing (NGS) technology has emerged as a powerful tool in characterizing genomic profiles. Among several applications, RNA sequencing (RNA-Seq) and Methylation sequencing (Methyl-Seq) have gradually become standard tools for transcriptomic and epigenetic monitoring respectively. Although the costs of NGS experiments have constantly decreased, the high cost and bioinformatic complexity remain obstacles for many biomedical projects. Unlike earlier microarray technologies, modeling of NGS data should consider discrete count data. In addition to sample size, sequencing depth is also directly related to experimental costs. Consequently, given a total budget and a pre-specified unit experimental cost, the study design issue in RNA-Seq/Methyl-Seq is a multi-dimensional constrained optimization problem rather than a one-dimensional sample size calculation in a traditional hypothesis setting. In the first part of this dissertation, we proposed a statistical framework, namely “RNASeqDesign”, to utilize pilot data for power calculation and study design of RNA-Seq experiments. The approach was based on a mixture model fitting of the p-value distribution from pilot data and a parametric bootstrap procedure to infer genome-wide power for optimal sample size and sequencing depth. We further illustrated five practical study design tasks for practitioners. We performed simulations and real data applications to evaluate performance and compare to existing methods.
In the second part of this dissertation, we proposed another statistical framework, namely “MethylSeqDesign”, specifically for Methyl-Seq data. There were mainly two challenges. Firstly, the statistical modeling for Methyl-Seq data required a powerful statistical test using beta-binomial model for conducting power calculation. Secondly, there is an extremely high number of CpG sites (about 30M) in the human genome, which results in many CpG sites with very shallow coverage. We focused on a region-/capture-based method which produced more counts in a region/window such that power calculation became feasible.
Public health significance: As sequencing costs keep dropping, RNA-Seq and Methyl-Seq experiments will become more prevalent and more projects with large sample size will be expected. We believe our work will provide practical guidance for future study design to understand disease mechanism and improve disease diagnosis and treatment.

Last Updated On Friday, July 07, 2017 by Valenti, Renee Nerozzi
Created On Tuesday, March 07, 2017

Questions and Submissions

The schedule of dissertation defenses in the Department of Biostatistics is maintained by:

Renee Valenti


Share the details for your defense to ensure it is displayed here as well as on the department calendar, Pitt Public Health's calendar, Weekly Update, and LCD screens.

Defenses by department

© 2018 by University of Pittsburgh Graduate School of Public Health

Login  |  Sitemap