期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
Word Embeddings and Semantic Spaces in Natural Language Processing 被引量:2
1
作者 Peter J. Worth 《International Journal of Intelligence Science》 2023年第1期1-21,共21页
One of the critical hurdles, and breakthroughs, in the field of Natural Language Processing (NLP) in the last two decades has been the development of techniques for text representation that solves the so-called curse ... One of the critical hurdles, and breakthroughs, in the field of Natural Language Processing (NLP) in the last two decades has been the development of techniques for text representation that solves the so-called curse of dimensionality, a problem which plagues NLP in general given that the feature set for learning starts as a function of the size of the language in question, upwards of hundreds of thousands of terms typically. As such, much of the research and development in NLP in the last two decades has been in finding and optimizing solutions to this problem, to feature selection in NLP effectively. This paper looks at the development of these various techniques, leveraging a variety of statistical methods which rest on linguistic theories that were advanced in the middle of the last century, namely the distributional hypothesis which suggests that words that are found in similar contexts generally have similar meanings. In this survey paper we look at the development of some of the most popular of these techniques from a mathematical as well as data structure perspective, from Latent Semantic Analysis to Vector Space Models to their more modern variants which are typically referred to as word embeddings. In this review of algoriths such as Word2Vec, GloVe, ELMo and BERT, we explore the idea of semantic spaces more generally beyond applicability to NLP. 展开更多
关键词 Natural Language Processing Vector Space Models semantic spaces Word Embeddings Representation Learning Text Vectorization Machine Learning Deep Learning
在线阅读 下载PDF
Explanatory Multi-Scale Adversarial Semantic Embedding Space Learning for Zero-Shot Recognition
2
作者 Huiting Li 《Open Journal of Applied Sciences》 2022年第3期317-335,共19页
The goal of zero-shot recognition is to classify classes it has never seen before, which needs to build a bridge between seen and unseen classes through semantic embedding space. Therefore, semantic embedding space le... The goal of zero-shot recognition is to classify classes it has never seen before, which needs to build a bridge between seen and unseen classes through semantic embedding space. Therefore, semantic embedding space learning plays an important role in zero-shot recognition. Among existing works, semantic embedding space is mainly taken by user-defined attribute vectors. However, the discriminative information included in the user-defined attribute vector is limited. In this paper, we propose to learn an extra latent attribute space automatically to produce a more generalized and discriminative semantic embedded space. To prevent the bias problem, both user-defined attribute vector and latent attribute space are optimized by adversarial learning with auto-encoders. We also propose to reconstruct semantic patterns produced by explanatory graphs, which can make semantic embedding space more sensitive to usefully semantic information and less sensitive to useless information. The proposed method is evaluated on the AwA2 and CUB dataset. These results show that our proposed method achieves superior performance. 展开更多
关键词 Zero-Shot Recognition semantic Embedding Space Adversarial Learning Explanatory Graph
在线阅读 下载PDF
Learning the spatial co-occurrence for browsing interests extraction of domain users on public map service platforms 被引量:3
3
作者 Guangsheng Dong Rui Li +4 位作者 Huayi Wu Wei Huang Hongping Zhang Vincent Tao Quan Liu 《Geo-Spatial Information Science》 CSCD 2024年第2期455-474,共20页
Public Map Service Platforms(PMSPs)provide embedded map services in domains such as forests and rivers.Users from different domains(Domain Users)prefer specific spatial features,and extracting the Browsing Interests o... Public Map Service Platforms(PMSPs)provide embedded map services in domains such as forests and rivers.Users from different domains(Domain Users)prefer specific spatial features,and extracting the Browsing Interests of Domain Users(BIDUs)can help elucidate users’access intentions and provide suitable recommendations.Previous research has found that access frequency of spatial features is an indicator of users’browsing interests;however,highfrequency spatial features are sparsely distributed,resulting in inaccurate extraction of browsing interests.Our objective is to model the spatial co-occurrence of spatial features and employ BIDUs extraction to address this limitation.First,to extract spatial features in tiles,we proposed a k-nearest neighbor method for Point-of-Interest(POI)extraction and a template-based method for Land Uses/Land Covers extraction.Then,we developed the word2vec model to construct a POI semantic space to quantify spatial co-occurrence and employed multi-domain user classification to verify its effectiveness.Finally,a combined word2vec and singular value decomposition model is proposed to perform topic extraction as a representation of BIDUs.Compared with the baseline models,the proposed model integrates spatial co-occurrence from massive POIs to achieve high-accuracy BIDU extraction.Our findings can help construct domain user profiles and support the development of intelligent PMSPs. 展开更多
关键词 Browsing interest extraction spatial co-occurrence Pointof-Interest(POI)semantic space word2vec Public Map Service Platform(PMSP)
原文传递
Semantic-Oriented Knowledge Transfer for Review Rating 被引量:1
4
作者 王波 张宁 +2 位作者 林泉 陈松灿 李玉华 《Tsinghua Science and Technology》 SCIE EI CAS 2010年第6期633-641,共9页
With the rapid development of Web 2.0, more and more people are sharing their opinions about online products, so there is much product review data. However, it is difficult to compare products directly using ratings b... With the rapid development of Web 2.0, more and more people are sharing their opinions about online products, so there is much product review data. However, it is difficult to compare products directly using ratings because many ratings are based on different scales or ratings are even missing. This paper addresses the following question: given textual reviews, how can we automatically determine the semantic orientations of reviewers and then rank different items? Due to the absence of ratings in many reviews, it is difficult to collect sufficient rating data for certain specific categories of products (e.g., movies), but it is easier to find rating data in another different but related category (e.g., books). We refer to this problem as transfer rating, and try to train a better ranking model for items in the interested category with the help of rating data from another related category. Specifically, we developed a ranking-oriented method called TRate for determining the semantic orientations and for ranking different items and formulated it in a regularized algorithm for rating knowledge transfer by bridging the two related categories via a shared latent semantic space. Tests on the Epinion dataset verified its effectiveness. 展开更多
关键词 review rating latent semantic space transfer rating
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部