期刊文献+
共找到11篇文章
< 1 >
每页显示 20 50 100
Efficient Temporal Difference Learning with Adaptive λ
1
作者 毕金波 吴沧浦 《Journal of Beijing Institute of Technology》 EI CAS 1999年第3期251-257,共7页
Aim To find a more efficient learning method based on temporal difference learning for delayed reinforcement learning tasks. Methods A kind of Q learning algorithm based on truncated TD( λ ) with adaptive scheme... Aim To find a more efficient learning method based on temporal difference learning for delayed reinforcement learning tasks. Methods A kind of Q learning algorithm based on truncated TD( λ ) with adaptive schemes of λ value selection addressed to absorbing Markov decision processes was presented and implemented on computers. Results and Conclusion Simulations on the shortest path searching problems show that using adaptive λ in the Q learning based on TTD( λ ) can speed up its convergence. 展开更多
关键词 dynamic programming delayed reinforcement learning absorbing Markov decision processes temporal difference learning Q learning
在线阅读 下载PDF
Learning style and cultural differences
2
作者 胡家浩 《Sino-US English Teaching》 2007年第8期5-7,共3页
Learning style is the most important variable that affects the success of English learning. It can both give full play to student's learning superiority and make up their inferiority. The formation of learning style ... Learning style is the most important variable that affects the success of English learning. It can both give full play to student's learning superiority and make up their inferiority. The formation of learning style is related with external elements, including culture. Chinese culture greatly differs from American culture. With the distinct cultural differences, the learning styles of the Chinese student and the American student show clear differences. 展开更多
关键词 learning style Chinese culture and American culture learning style difference
在线阅读 下载PDF
INDIVIDUAL DIFFERENCES IN FOREIGN LANGUAGE TEACHING AND LEARNING 被引量:4
3
作者 Song Wenwei 《外语与外语教学》 CSSCI 北大核心 1993年第1期26-30,共5页
Individual differences in foreign language learning have long been the concern of linguists and language teachers. Researches on this subject have been carried out in schools, universities and other educational instit... Individual differences in foreign language learning have long been the concern of linguists and language teachers. Researches on this subject have been carried out in schools, universities and other educational institutions and great achievements have been made. As it is, there are many individual differences which affect the learning of foreign languages, such as intelligence, aptitude, motivation, personality, attitude, 展开更多
关键词 INDIVIDUAL differenceS IN FOREIGN LANGUAGE TEACHING AND learning
在线阅读 下载PDF
Incremental Multi Step R Learning
4
作者 胡光华 吴沧浦 《Journal of Beijing Institute of Technology》 EI CAS 1999年第3期245-250,共6页
Aim To investigate the model free multi step average reward reinforcement learning algorithm. Methods By combining the R learning algorithms with the temporal difference learning (TD( λ ) learning) algorithm... Aim To investigate the model free multi step average reward reinforcement learning algorithm. Methods By combining the R learning algorithms with the temporal difference learning (TD( λ ) learning) algorithms for average reward problems, a novel incremental algorithm, called R( λ ) learning, was proposed. Results and Conclusion The proposed algorithm is a natural extension of the Q( λ) learning, the multi step discounted reward reinforcement learning algorithm, to the average reward cases. Simulation results show that the R( λ ) learning with intermediate λ values makes significant performance improvement over the simple R learning. 展开更多
关键词 reinforcement learning average reward R learning Markov decision processes temporal difference learning
在线阅读 下载PDF
An Adaptive Strategy via Reinforcement Learning for the Prisoner's Dilemma Game 被引量:9
5
作者 Lei Xue Changyin Sun +2 位作者 Donald Wunsch Yingjiang Zhou Fang Yu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2018年第1期301-310,共10页
The iterated prisoner's dilemma(IPD) is an ideal model for analyzing interactions between agents in complex networks. It has attracted wide interest in the development of novel strategies since the success of tit-... The iterated prisoner's dilemma(IPD) is an ideal model for analyzing interactions between agents in complex networks. It has attracted wide interest in the development of novel strategies since the success of tit-for-tat in Axelrod's tournament. This paper studies a new adaptive strategy of IPD in different complex networks, where agents can learn and adapt their strategies through reinforcement learning method. A temporal difference learning method is applied for designing the adaptive strategy to optimize the decision making process of the agents. Previous studies indicated that mutual cooperation is hard to emerge in the IPD. Therefore, three examples which based on square lattice network and scale-free network are provided to show two features of the adaptive strategy. First, the mutual cooperation can be achieved by the group with adaptive agents under scale-free network, and once evolution has converged mutual cooperation, it is unlikely to shift. Secondly, the adaptive strategy can earn a better payoff compared with other strategies in the square network. The analytical properties are discussed for verifying evolutionary stability of the adaptive strategy. 展开更多
关键词 Complex network prisoner’s dilemma reinforcement learning temporal differences learning
在线阅读 下载PDF
Learner Beliefs of Language Learning Revisited 被引量:2
6
作者 Zhuangwei Huang 《Sino-US English Teaching》 2006年第3期62-67,共6页
Learner beliefs of language learning are of critical importance to the success or failure of any student's efforts to master a foreign language. In this paper, some recent researches on learner beliefs about language... Learner beliefs of language learning are of critical importance to the success or failure of any student's efforts to master a foreign language. In this paper, some recent researches on learner beliefs about language learning have been reviewed and summarized. The learner beliefs in Chinese context are also discussed. It is suggested that it is premature to determine learner beliefs in Chinese context and to conclude that cultural differences have great influences on learner beliefs. More researches need to be conducted by means of more various research methodolozies. 展开更多
关键词 learner beliefs about language learning gap cultural differences
在线阅读 下载PDF
Adaptive learning rate GMM for moving object detection in outdoor surveillance for sudden illumination changes 被引量:1
7
作者 HOCINE Labidi 曹伟 +2 位作者 丁庸 张笈 罗森林 《Journal of Beijing Institute of Technology》 EI CAS 2016年第1期145-151,共7页
A dynamic learning rate Gaussian mixture model(GMM)algorithm is proposed to deal with the problem of slow adaption of GMM in the case of moving object detection in the outdoor surveillance,especially in the presence... A dynamic learning rate Gaussian mixture model(GMM)algorithm is proposed to deal with the problem of slow adaption of GMM in the case of moving object detection in the outdoor surveillance,especially in the presence of sudden illumination changes.The GMM is mostly used for detecting objects in complex scenes for intelligent monitoring systems.To solve this problem,a mixture Gaussian model has been built for each pixel in the video frame,and according to the scene change from the frame difference,the learning rate of GMM can be dynamically adjusted.The experiments show that the proposed method gives good results with an adaptive GMM learning rate when we compare it with GMM method with a fixed learning rate.The method was tested on a certain dataset,and tests in the case of sudden natural light changes show that our method has a better accuracy and lower false alarm rate. 展开更多
关键词 object detection background modeling Gaussian mixture model(GMM) learning rate frame difference
在线阅读 下载PDF
The Study of Language Learning Strategies of Non-English Majors 被引量:1
8
作者 Cong Zhang 《Sino-US English Teaching》 2005年第5期36-41,共6页
This paper, from the educational and psychological point of view, explores EFL college students' language learning strategies in the Chinese context. The subjects under study involve 106 non-English majors from Hohai... This paper, from the educational and psychological point of view, explores EFL college students' language learning strategies in the Chinese context. The subjects under study involve 106 non-English majors from Hohai University at its Changzhou Campus. The approach is used for the research through two questionnaires to investigate the learners' language learning strategies. In the study, it is found that students use compensation strategies most frequently, while metacognitive strategies less and social strategies the least. Findings of the present study also indicate that the different strategies are respectively emphasized for the male and female students, students of arts and science and engineering. 展开更多
关键词 language learning strategies strategy classification major differences sex differences
在线阅读 下载PDF
Differences Between Children’s First Language Acquisition and Adults’Second Language Acquisition
9
作者 高嘉欣 《海外英语》 2021年第19期285-287,共3页
There is an apparent contrast between children’s first language acquisition and adults’second language acquisition,which are mainly manifested in the following three aspects:age difference,difference in learning pro... There is an apparent contrast between children’s first language acquisition and adults’second language acquisition,which are mainly manifested in the following three aspects:age difference,difference in learning process and motivation difference.This paper will analyze these three differences in detail,and combine the analysis results to guide second language pedagogical implications according to the current situation. 展开更多
关键词 children’s first language acquisition adults’second language acquisition age difference difference in learning process motivation difference
在线阅读 下载PDF
Computational Intelligence and Games:Challenges and Opportunities 被引量:1
10
作者 Simon M.Lucas 《International Journal of Automation and computing》 EI 2008年第1期45-57,共13页
The last few decades have seen a phenomenal increase in the quality, diversity and pervasiveness of computer games. The worldwide computer games market is estimated to be worth around USD 21bn annually, and is predict... The last few decades have seen a phenomenal increase in the quality, diversity and pervasiveness of computer games. The worldwide computer games market is estimated to be worth around USD 21bn annually, and is predicted to continue to grow rapidly. This paper reviews some of the recent developments in applying computational intelligence (CI) methods to games, points out some of the potential pitfalls, and suggests some fruitful directions for future research. 展开更多
关键词 GAMES machine learning EVOLUTION temporal difference learning (TDL) neural networks n-tuple systems
在线阅读 下载PDF
Efficient policy evaluation by matrix sketching
11
作者 Cheng CHEN Weinan ZHANG Yong YU 《Frontiers of Computer Science》 SCIE EI CSCD 2022年第5期97-105,共9页
In the reinforcement learning,policy evaluation aims to predict long-term values of a state under a certain policy.Since high-dimensional representations become more and more common in the reinforcement learning,how t... In the reinforcement learning,policy evaluation aims to predict long-term values of a state under a certain policy.Since high-dimensional representations become more and more common in the reinforcement learning,how to reduce the computational cost becomes a significant problem to the policy evaluation.Many recent works focus on adopting matrix sketching methods to accelerate least-square temporal difference(TD)algorithms and quasi-Newton temporal difference algorithms.Among these sketching methods,the truncated incremental SVD shows better performance because it is stable and efficient.However,the convergence properties of the incremental SVD is still open.In this paper,we first show that the conventional incremental SVD algorithms could have enormous approximation errors in the worst case.Then we propose a variant of incremental SVD with better theoretical guarantees by shrinking the singular values periodically.Moreover,we employ our improved incremental SVD to accelerate least-square TD and quasi-Newton TD algorithms.The experimental results verify the correctness and effectiveness of our methods. 展开更多
关键词 temporal difference learning policy evaluation matrix sketching
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部