现代信息科技2024,Vol.8Issue(7):128-135,140,9.DOI:10.19850/j.cnki.2096-4706.2024.07.027
基于LightGBM模型的中国成人吸烟行为研究
Study of Adult Smoking Behavior in China Based on the LightGBM Model
摘要
Abstract
Using the adult tobacco survey data conducted by the World Health Organization in China in 2018,this study explores the influencing factors of adult smoking behavior.Firstly,perform data cleaning on the original data,including removing irrelevant variables,combining new variables,and other steps.Secondly,feature selection is performed on the processed dataset by combining Chi-square test,analysis of variance,and Maximal Information Coefficient(MIC).Then,it conducts modeling based on XGBoost and LightGBM algorithms,sorting and analyzing the factors affecting adult smoking behavior.Finally,based on the well performing LightGBM model,variable combination modeling is performed to further explore the characteristics of smokers.Through modeling and analysis,it is identified that adult gender,tobacco environment,attitude towards value-added tax,low tar smoke awareness,educational background,and age importance have a varying impact from strong to weak on smoking behavior.关键词
LightGBM/XGBoost/吸烟行为Key words
LightGBM/XGBoost/smoking behavior分类
信息技术与安全科学引用本文复制引用
刘忠华,卢鑫,梅文强,赵旻,胡彬彬,张轲,殷红慧..基于LightGBM模型的中国成人吸烟行为研究[J].现代信息科技,2024,8(7):128-135,140,9.基金项目
云南省烟草公司文山州公司科技计划一般项目(20235326002) (20235326002)