首页|期刊导航|统计与决策|超高维数据下部分线性可加分位数回归模型的变量选择

超高维数据下部分线性可加分位数回归模型的变量选择

白永昕钱曼玲田茂再

统计与决策2024，Vol.40Issue(9)：43-48,6.

统计与决策2024，Vol.40Issue(9)：43-48,6.DOI:10.13546/j.cnki.tjyjc.2024.09.007

超高维数据下部分线性可加分位数回归模型的变量选择

Variable Selection for Partial Linear Additive Quantile Regression Model Under Ultra-high-dimensional Data

白永昕 ¹钱曼玲 ²田茂再³

作者信息

1. 北京信息科技大学理学院,北京 100192
2. 墨尔本大学数学与统计学院,澳大利亚墨尔本 3010
3. 中国人民大学应用统计科学研究中心,北京 100872||新疆财经大学统计与信息学院,乌鲁木齐 830012||昌吉大学数学与数据科学学院,湖南昌吉 831100
折叠

摘要

Abstract

In ultrahigh dimensional data,on the one hand,the dimensionality of covariates may be much larger than the sam-ple size,even growing exponentially with the sample size;on the other hand,ultrahigh dimensional data are typically heteroge-neous,where the influence of covariates on the center of the conditional distribution may differ greatly from their influence on the tails,leading to complex situations such as heavy tails and outliers.This paper investigates variable selection and robust estima-tion of partial linear additive quantile regression models under the condition of divergence of covariate dimension and ultrahigh di-mension.Firstly,in order to achieve model sparsity and nonparametric smoothness,a non-convex Atan double penalty is intro-duced,and the proposed optimization problem is solved by using a quantile iterative coordinate descent algorithm;the theoretical properties of the proposed double penalty estimator are demonstrated under the selection of appropriate regularization parameters.Subsequently,the performance of the proposed method is verified through simulation studies.The simulations results indicate that the proposed method outperforms other penalty methods,especially in the case of data with heavy tails.Finally,the practicality of the proposed method is verified through empirical analysis of blood sample data from cancer screening patients.

关键词

超高维数据/分位数回归/部分线性可加/变量选择/Atan双惩罚

Key words

ultrahigh dimensional data/quantile regression/partial linear additivity/variable selection/Atan double penalty

分类

数理科学

引用本文复制引用

白永昕,钱曼玲,田茂再..超高维数据下部分线性可加分位数回归模型的变量选择[J].统计与决策,2024,40(9):43-48,6.

基金项目

北京市自然科学基金资助项目(1242005) （1242005）

北京信息科技大学校科研基金资助项目(2022XJJ31) （2022XJJ31）

统计与决策

OA北大核心CHSSCDCSSCICSTPCD

ISSN：1002-6487

访问量5

下载量0

段落导航