Detecting geomagnetic anomalies preceding earthquakes is a challenging yet promising area of research that has gained increasing attention in recent years.This study introduces a novel reconstruction-based modeling ap...Detecting geomagnetic anomalies preceding earthquakes is a challenging yet promising area of research that has gained increasing attention in recent years.This study introduces a novel reconstruction-based modeling approach enhanced by negative learning,employing a Bidirectional Long Short-Term Memory(BiLSTM)network explicitly trained to accurately reconstruct non-seismic geomagnetic signals while intentionally amplifying reconstruction errors for seismic signals.By penalizing the model for accurately reconstructing seismic anomalies,the negative learning approach effectively magnifies the differences between normal and anomalous data.This strategic differentiation enhances the sensitivity of the BiLSTM network,enabling improved detection of subtle geomagnetic anomalies that may serve as earthquake precursors.Experimental validation clearly demonstrated statistically significant higher reconstruction errors for seismic signals compared to non-seismic signals,confirmed through the Mann-Whitney U test with a p-value of 0.0035 for Root Mean Square Error(RMSE).These results provide compelling evidence of the enhanced anomaly detection capability achieved through negative learning.Unlike traditional classification-based methods,negative learning explicitly encourages sensitivity to subtle precursor signals embedded within complex geomagnetic data,establishing a robust basis for further development of reliable earthquake prediction methods.展开更多
针对信息安全课程知识推荐存在的多源行为融合不足、偏好适配针对性弱等问题,提出基于双向长短期记忆-多头注意力-学生多源行为数据融合(bidirectional long short-term memory-multi-head attention-fusion of student multi-source be...针对信息安全课程知识推荐存在的多源行为融合不足、偏好适配针对性弱等问题,提出基于双向长短期记忆-多头注意力-学生多源行为数据融合(bidirectional long short-term memory-multi-head attention-fusion of student multi-source behavior data,BiLSTM-MA-FSBD)的知识推荐方法。首先,整合学生多源行为数据,提取核心行为特征,构建涵盖动态时序与静态关联的融合特征体系;然后,设计BiLSTM网络对行为序列依赖关系进行编码,利用MA机制自适应分配行为权重,实现学习偏好的精准推断;最后,构建3层级信息安全知识图谱,量化知识点依赖关系,结合偏好匹配度进行个性化推荐。结果表明,BiLSTM-MA-FSBD方法的推荐精确率比协同过滤(collaborative filtering,CF)方法提高了26.2个百分点。该方法可以有效适配信息安全课程的教学特性与学生个性化学习需求,为解决课程知识的精准推荐问题提供了切实可行的技术方案。展开更多
股票市场的不确定性和复杂性使得股票预测成为一项具有挑战性的任务。鉴于金融文本在股票预测中的潜在价值,采用词典法和BERT双向长短期记忆模型(bidirectional encoder representations from transformers-bidirectional long short-te...股票市场的不确定性和复杂性使得股票预测成为一项具有挑战性的任务。鉴于金融文本在股票预测中的潜在价值,采用词典法和BERT双向长短期记忆模型(bidirectional encoder representations from transformers-bidirectional long short-term memory,BERT-BiLSTM)对在线财经新闻提取情感特征,构建了融合情感特征和股票交易特征的股指预测模型。实验对比了融合情感特征前后模型的预测能力,并探讨了不同模型、不同时间周期下预测能力的差异。实验结果表明,融合词典法和深度学习技术提取的情感特征均能提升各模型股指预测的准确率。LSTM模型相较其他实验模型在融合情感特征前后的股指预测上均表现较好。进一步的时间跨度分析表明,股指预测模型在较短的时间跨度上对股票指数涨跌的预测能力更强。为验证股指预测模型的实际价值,对沪深300指数的牛熊市和震荡市进行回测分析,结合LSTM模型和深度Q网络(deep Q-network,DQN)原理,对比了传统均线策略以及结合DQN强化学习算法后股指回测差异。回测结果表明,相比于单一的传统交易策略,结合传统交易策略和深度学习方法的股票指数预测模型在牛熊市及震荡市中均保证了正的夏普比例和累积收益率,并有效控制了最大回撤,显示出更强的市场适应性和盈利能力。展开更多
基金funded by the Ministry of Higher Education through Universiti Putra Malaysia(UPM)under Grant FRGS/1/2023/STG07/UPM/02/4.
文摘Detecting geomagnetic anomalies preceding earthquakes is a challenging yet promising area of research that has gained increasing attention in recent years.This study introduces a novel reconstruction-based modeling approach enhanced by negative learning,employing a Bidirectional Long Short-Term Memory(BiLSTM)network explicitly trained to accurately reconstruct non-seismic geomagnetic signals while intentionally amplifying reconstruction errors for seismic signals.By penalizing the model for accurately reconstructing seismic anomalies,the negative learning approach effectively magnifies the differences between normal and anomalous data.This strategic differentiation enhances the sensitivity of the BiLSTM network,enabling improved detection of subtle geomagnetic anomalies that may serve as earthquake precursors.Experimental validation clearly demonstrated statistically significant higher reconstruction errors for seismic signals compared to non-seismic signals,confirmed through the Mann-Whitney U test with a p-value of 0.0035 for Root Mean Square Error(RMSE).These results provide compelling evidence of the enhanced anomaly detection capability achieved through negative learning.Unlike traditional classification-based methods,negative learning explicitly encourages sensitivity to subtle precursor signals embedded within complex geomagnetic data,establishing a robust basis for further development of reliable earthquake prediction methods.
文摘股票市场的不确定性和复杂性使得股票预测成为一项具有挑战性的任务。鉴于金融文本在股票预测中的潜在价值,采用词典法和BERT双向长短期记忆模型(bidirectional encoder representations from transformers-bidirectional long short-term memory,BERT-BiLSTM)对在线财经新闻提取情感特征,构建了融合情感特征和股票交易特征的股指预测模型。实验对比了融合情感特征前后模型的预测能力,并探讨了不同模型、不同时间周期下预测能力的差异。实验结果表明,融合词典法和深度学习技术提取的情感特征均能提升各模型股指预测的准确率。LSTM模型相较其他实验模型在融合情感特征前后的股指预测上均表现较好。进一步的时间跨度分析表明,股指预测模型在较短的时间跨度上对股票指数涨跌的预测能力更强。为验证股指预测模型的实际价值,对沪深300指数的牛熊市和震荡市进行回测分析,结合LSTM模型和深度Q网络(deep Q-network,DQN)原理,对比了传统均线策略以及结合DQN强化学习算法后股指回测差异。回测结果表明,相比于单一的传统交易策略,结合传统交易策略和深度学习方法的股票指数预测模型在牛熊市及震荡市中均保证了正的夏普比例和累积收益率,并有效控制了最大回撤,显示出更强的市场适应性和盈利能力。
文摘针对现有的中文命名实体识别算法没有充分考虑实体识别任务的数据特征,存在中文样本数据的类别不平衡、训练数据中的噪声太大和每次模型生成数据的分布差异较大的问题,提出了一种以BERT-BiLSTM-CRF(Bidirectional Encoder Representations from Transformers-Bidirectional Long Short-Term Memory-Conditional Random Field)为基线改进的中文命名实体识别模型。首先在BERT-BiLSTM-CRF模型上结合P-Tuning v2技术,精确提取数据特征,然后使用3个损失函数包括聚焦损失(Focal Loss)、标签平滑(Label Smoothing)和KL Loss(Kullback-Leibler divergence loss)作为正则项参与损失计算。实验结果表明,改进的模型在Weibo、Resume和MSRA(Microsoft Research Asia)数据集上的F 1得分分别为71.13%、96.31%、95.90%,验证了所提算法具有更好的性能,并且在不同的下游任务中,所提算法易于与其他的神经网络结合与扩展。