基于预训练模型的仇恨言论检测

Hate speech detection based on pre-trained models

导出

摘要为准确检测和识别仇恨言论,通过微调大语言模型对数据集样本进行扩充与平衡,并基于预训练模型RoBERTa构建RoBERTa-Attention-GRU-TextCNN模型,将深度学习强大的特征捕获和提取能力应用到文本序列数据的分析、挖掘中。首先通过RoBERTa模型对文本数据进行特征提取;然后利用自注意机制获取单词间的依赖关系;最后将获取到的特征矩阵输入到GRU-TextCNN层中以捕捉更深层次的语义信息和局部特征。使用TweetEval提供的2个公开的数据集来评估模型效果,实验结果表明,该模型相较于传统的仇恨言论检测模型具有更好的检测效果。 To accurately detect and identify hate speech,the dataset samples are expanded and balanced by fine-tuning the large language model.The RoBERTa-Attention-GRU-TextCNN model is constructed based on the pre-training model RoBERTa,leveraging the powerful feature capture and extraction capabilities of deep learning for the analysis and mining of text sequence data.Firstly,the RoBERTa model is used to extract features from the text data;then,the self-attention mechanism is used to obtain the dependencies between words;finally,the acquired feature matrix is input into the GRU-TextCNN layer to capture deeper semantic information and local features.Two publicly available datasets provided by TweetEval are used to evaluate the model effect,and the experimental results show that the model has a better detection effect compared to the traditional hate speech detection model.

作者林原张亚于蒙许侃林鸿飞 LIN Yuan;ZHANG Ya;YU Meng;XU Kan;LIN Hongfei(School of Public Administration and Policy,Dalian University of Technology,Dalian 116024,Liaoning,China;Faculty of Electronic Information and Electrical Engineering,Dalian University of Technology,Dalian 116024,Liaoning,China)

机构地区大连理工大学公共管理学院大连理工大学电子信息与电气工程学部

出处《山东大学学报(理学版)》北大核心 2026年第3期44-53,共10页 Journal of Shandong University(Natural Science)

基金国家自然科学基金资助项目(61976036) 国家社会科学基金资助项目(20BTQ074)。

关键词大语言模型仇恨检测 RoBERTa 预训练模型 RoBERTa-Attention-GRU-TextCNN large language model hate detection RoBERTa pre-trained model RoBERTa-Attention-GRU-TextCNN

分类号 TP391 [自动化与计算机技术—计算机应用技术]