软件导刊2025,Vol.24Issue(3):92-98,7.DOI:10.11907/rjdk.241201
基于LDA与snowNLP的学术造假事件主题挖掘与情感分析
Topic Mining and Sentiment Analysis of Academic Misconduct Events Based on LDA and snowNLP
摘要
Abstract
This study conducts topic mining and sentiment analysis on Bilibili video comments related to the event of students from Huazhong Agricultural University jointly reporting Professor Huang Ruofei for academic misconduct.Data collection involves scraping 4 405 comments from Bilibili using Python's Requests library,followed by text preprocessing steps such as segmentation with jieba,stop word removal,cus-tom dictionary creation,and TF-IDF text vectorization.The Latent Dirichlet Allocation(LDA)topic model is utilized to model the comments,determining the optimal number of topics as through perplexity and pyLDAvis visualization results,and identifying hot topics.The results of topic mining are categorized into three datasets,and sentiment analysis is performed using the snowNLP library to classify each comment into positive,negative,or neutral sentiment categories.Statistical analysis and visualization showcase the distribution of different sentiment catego-ries and tendencies.The study reveals multiple hot topics including justice and effort,public concern over academic misconduct,and rationali-ty and courage.The sentiment analysis uncovers diverse sentiment tendencies among the public,with the majority supporting and praising the students' reporting behavior while expressing concerns and anger about the academic environment.However,the study acknowledges certain limitations and suggests future research expand the scope by collecting and analyzing data from more social media platforms to obtain a more comprehensive understanding of public opinions and sentiment feedback.Additionally,integrating quantitative and qualitative methods could further explore the underlying reasons and mechanisms behind academic misconduct events.关键词
学术造假/LDA/snowNLP/主题挖掘/情感分析Key words
academic misconduct/LDA/snowNLP/topic mining/sentiment analysis分类
信息技术与安全科学引用本文复制引用
王智迪..基于LDA与snowNLP的学术造假事件主题挖掘与情感分析[J].软件导刊,2025,24(3):92-98,7.基金项目
吉林省教育厅高校思想政治工作专项(JJKH20241447SZ) (JJKH20241447SZ)