期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Enhancing imbalanced text classification:an overlap-based refinement approach
1
作者 Sihem Nouas lamia oukid Fatima Boumahdi 《Data Science and Management》 2025年第4期474-484,共11页
The inherent class imbalance within textual data poses a significant challenge for machine learning-based techniques,as the available data often fails to adequately represent all classes.This scarcity of instances can... The inherent class imbalance within textual data poses a significant challenge for machine learning-based techniques,as the available data often fails to adequately represent all classes.This scarcity of instances can make it even more challenging when there are overlapping regions within different classes.To address these limitations,this study introduces a refinement model for textual data classification with imbalanced datasets.The proposed approach,refined classification using overlap data with bagging and genetic algorithms(ReCO-BGA),aims to refine the classification predictions by creating a two-tier classification process.First,a bagging model is employed,incorporating three distinct classes:majority,minority,and an additional extracted class specifically for overlapping instances.Second,we propose to rectify the predicted overlap instances using a genetic-based oversampling technique.To evaluate the performance of ReCO-BGA,we conducted several experiments,focusing on two practical use cases:hate speech detection and sentiment analysis.The results demonstrated the effectiveness of the proposed method and showed that it outperforms state-of-the-art methods. 展开更多
关键词 Overlap Oversampling Imbalanced datasets Text classification Ensemble learning
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部