期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Deep reinforcement learning:a survey 被引量:28
1
作者 Hao-nan WANG Ning LIU +4 位作者 Yi-yun ZHANG da-wei feng feng HUANG Dong-sheng LI Yi-ming ZHANG 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2020年第12期1726-1744,共19页
Deep reinforcement learning(RL)has become one of the most popular topics in artificial intelligence research.It has been widely used in various fields,such as end-to-end control,robotic control,recommendation systems,... Deep reinforcement learning(RL)has become one of the most popular topics in artificial intelligence research.It has been widely used in various fields,such as end-to-end control,robotic control,recommendation systems,and natural language dialogue systems.In this survey,we systematically categorize the deep RL algorithms and applications,and provide a detailed review over existing deep RL algorithms by dividing them into modelbased methods,model-free methods,and advanced RL methods.We thoroughly analyze the advances including exploration,inverse RL,and transfer RL.Finally,we outline the current representative applications,and analyze four open problems for future research. 展开更多
关键词 Reinforcement learning Deep reinforcement learning Reinforcement learning applications
原文传递
Learning to select pseudo labels: a semi-supervisedmethod for named entity recognition 被引量:5
2
作者 Zhen-zhen LI da-wei feng +1 位作者 Dong-sheng LI Xi-cheng LU 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2020年第6期903-916,共14页
Deep learning models have achieved state-of-the-art performance in named entity recognition(NER);the good performance,however,relies heavily on substantial amounts of labeled data.In some specific areas such as medica... Deep learning models have achieved state-of-the-art performance in named entity recognition(NER);the good performance,however,relies heavily on substantial amounts of labeled data.In some specific areas such as medical,financial,and military domains,labeled data is very scarce,while unlabeled data is readily available.Previous studies have used unlabeled data to enrich word representations,but a large amount of entity information in unlabeled data is neglected,which may be beneficial to the NER task.In this study,we propose a semi-supervised method for NER tasks,which learns to create high-quality labeled data by applying a pre-trained module to filter out erroneous pseudo labels.Pseudo labels are automatically generated for unlabeled data and used as if they were true labels.Our semi-supervised framework includes three steps:constructing an optimal single neural model for a specific NER task,learning a module that evaluates pseudo labels,and creating new labeled data and improving the NER model iteratively.Experimental results on two English NER tasks and one Chinese clinical NER task demonstrate that our method further improves the performance of the best single neural model.Even when we use only pre-trained static word embeddings and do not rely on any external knowledge,our method achieves comparable performance to those state-of-the-art models on the CoNLL-2003 and OntoNotes 5.0 English NER tasks. 展开更多
关键词 Named entity recognition Unlabeled data Deep learning Semi-supervised method
原文传递
Mini-batch cutting plane method for regularized risk minimization
3
作者 Meng-long LU Lin-bo QIAO +2 位作者 da-wei feng Dong-sheng LI Xi-cheng LU 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2019年第11期1551-1563,共13页
Although concern has been recently expressed with regard to the solution to the non-convex problem, convex optimization is still important in machine learning, especially when the situation requires an interpretable m... Although concern has been recently expressed with regard to the solution to the non-convex problem, convex optimization is still important in machine learning, especially when the situation requires an interpretable model. Solution to the convex problem is a global minimum, and the final model can be explained mathematically. Typically, the convex problem is re-casted as a regularized risk minimization problem to prevent overfitting. The cutting plane method (CPM) is one of the best solvers for the convex problem, irrespective of whether the objective function is differentiable or not. However, CPM and its variants fail to adequately address large-scale data-intensive cases because these algorithms access the entire dataset in each iteration, which substantially increases the computational burden and memory cost. To alleviate this problem, we propose a novel algorithm named the mini-batch cutting plane method (MBCPM), which iterates with estimated cutting planes calculated on a small batch of sampled data and is capable of handling large-scale problems. Furthermore, the proposed MBCPM adopts a "sink" operation that detects and adjusts noisy estimations to guarantee convergence. Numerical experiments on extensive real-world datasets demonstrate the effectiveness of MBCPM, which is superior to the bundle methods for regularized risk minimization as well as popular stochastic gradient descent methods in terms of convergence speed. 展开更多
关键词 Machine learning Optimization methods Gradient methods Cutting plane method
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部