Mingyao Li, PhD, Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania.
Peter Mueller, PhD, Department of Mathematics, Department of Statistics and Data Sciences, University of Texas at Austin
Lu Mao, PhD, Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison
Snehalata Huzurbazar, PhD, Department of Biostatistics, West Virginia University
Meetings of the Eastern North American Region of the International Biometric Society (a.k.a. "ENAR meetings") are held in late March or early April each year and reflect the broad interests of the Society, including both quantitative techniques and application areas. Faculty and student presenters from the Department of Biostatistics regularly participate giving invited talks, contributed talks, and poster presentations.
The Joint Statistical Meetings, known simply as "JSM", is the largest gathering of statisticians held annually in North American. Faculty and student presenters from the Department of Biostatistics regularly participate giving invited talks, contributed talks, and poster presentations. Our students often receive top awards and participate in the affiliated career marketplace at the event.
Inference on Treatment Effects after Model Selection
Inferring cause-effect relationships between variables is of primary importance in many sciences. In this talk, I will discuss two approaches for making valid inference on treatment effects when a large number of covariates are present. The first approach is to perform model selection and then to deliver inference based on the selected model. If the inference is made ignoring the randomness of the model selection process, then there could be severe biases in estimating the parameters of interest. While the estimation bias in an under-fitted model is well understood, I will address a lesser known bias that arises from an over-fitted model. The over-fitting bias can be eliminated through data splitting at the cost of statistical efficiency, and I will propose a repeated data splitting approach to mitigate the efficiency loss. The second approach concerns the existing methods for debiased inference. I will show that the debiasing approach is an extension of OLS to high dimensions, and that a careful bias analysis leads to an improvement to further control the bias. The comparison between these two approaches provides insights into their intrinsic bias-variance trade-off, and I will show that the debiasing approach may lose efficiency in observational studies.
Last Updated On Monday, January 07, 2019 by Tang, Lu
Created On Friday, January 04, 2019