数学科学研究所
Insitute of Mathematical Science

Applied Mathematical Seminar 54: Distribution-free prediction bands for clustered data with missing responses

Seminar| Institute of Mathematical Sciences

Time: WednesdayDecember 13th, 2023 , 16:00-17:00

LocationRS518, IMS

Speaker: Yanlin Tang, East China Normal University

AbstractExisting methods for missing clustered data often rely on strong model assumptions and are therefore prone to model misspecification. We construct prediction bands for the whole trajectories of new subjects based on the conformal inference, yielding covariate-dependent prediction bands with coverage guarantees in finite samples, without making any assumptions about model specification and within-cluster dependency structure. We first reduce the clustered data into independent cross-sectional data by subsampling, then propose three weighted conformal methods to produce prediction regions. To make use of the correlation information of the clustered data, we repeat the subsampling and conformal inference, to produce an integrated prediction region by combining the dependent p-values. Among the three proposed methods, the weighted CD-split method yields the smallest prediction region by converging to the highest density set, and provides asymptotic conditional coverage guarantees for each given subject. Simulations show that our methods have excellent finite-sample behavior under different complex error distributions compared to other alternatives. The practical use is demonstrated in the motivating serum cholesterol data and CD4+ cell data sets.


地址:上海市浦东新区华夏中路393号
邮编:201210
上海市徐汇区岳阳路319号8号楼
200031(岳阳路校区)