意昂官网
Insitute of Mathematical Science

Applied Mathematical Seminar23: Optimal Subsampling via Predictive Inference

Seminar| Institute of Mathematical Sciences

Time:Friday, March 24th, 2023, 16:00-17:00
Location:RS408, IMS
Speaker: Haojie Ren, SJTU
AbstractIn the big data era, subsampling or sub-data selection techniques are often adopted to extract a fraction of informative individuals from the massive data. Existing subsampling algorithms focus mainly on obtaining a representative subset to achieve the best estimation accuracy under a given class of models. In this work, we consider a semi-supervised setting wherein a small or moderate sized labeled data is available in addition to a much larger sized unlabeled data. The goal is to sample from the unlabeled data with a given budget to obtain informative individuals that are characterized by their unobserved responses. We propose an optimal subsampling procedure that is able to maximize the diversity of the selected subsample and control the false selection rate (FSR) simultaneously, allowing us to explore reliable information as much as possible. The key ingredients of our method are the use of predictive inference for quantifying the uncertainty of response predictions and a reformulation of the objective into a constrained optimization problem. We show that the proposed method is asymptotically optimal in the sense that the diversity of the subsample converges to its oracle counterpart with FSR control. Numerical simulations and a real-data example validate the superior performance of the proposed strategy.





地址:上海市浦东新区华夏中路393号
邮编:201210
上海市徐汇区岳阳路319号8号楼
意昂 -【首页推荐】每天更新,游戏不断!

Copyright © 意昂平台 版权所有 沪ICP备13001436号 沪公网安备31011502006855号

意昂专业提供:意昂意昂平台意昂官网等服务,提供最新官网平台、地址、注册、登陆、登录、入口、全站、网站、网页、网址、娱乐、手机版、app、下载、欧洲杯、欧冠、nba、世界杯、英超等,界面美观优质完美,安全稳定,服务一流,意昂欢迎您。