期刊文献+
共找到2,191篇文章
< 1 2 110 >
每页显示 20 50 100
Semantic-based query processing for relational data integration 被引量:1
1
作者 苗壮 张亚非 +2 位作者 王进鹏 陆建江 周波 《Journal of Southeast University(English Edition)》 EI CAS 2011年第1期22-25,共4页
To solve the query processing correctness problem for semantic-based relational data integration,the semantics of SAPRQL(simple protocol and RDF query language) queries is defined.In the course of query rewriting,al... To solve the query processing correctness problem for semantic-based relational data integration,the semantics of SAPRQL(simple protocol and RDF query language) queries is defined.In the course of query rewriting,all relative tables are found and decomposed into minimal connectable units.Minimal connectable units are joined according to semantic queries to produce the semantically correct query plans.Algorithms for query rewriting and transforming are presented.Computational complexity of the algorithms is discussed.Under the worst case,the query decomposing algorithm can be finished in O(n2) time and the query rewriting algorithm requires O(nm) time.And the performance of the algorithms is verified by experiments,and experimental results show that when the length of query is less than 8,the query processing algorithms can provide satisfactory performance. 展开更多
关键词 data integration relational database simple protocol and RDF query language(SPARQL) minimal connectable unit query processing
在线阅读 下载PDF
SDN Orchestration for Dynamic End-to-End Control of Data Center Multi-Domain Optical Networking 被引量:3
2
作者 LIU Lei 《China Communications》 SCIE CSCD 2015年第8期10-21,共12页
New and emerging use cases, such as the interconnection of geographically distributed data centers(DCs), are drawing attention to the requirement for dynamic end-to-end service provisioning, spanning multiple and hete... New and emerging use cases, such as the interconnection of geographically distributed data centers(DCs), are drawing attention to the requirement for dynamic end-to-end service provisioning, spanning multiple and heterogeneous optical network domains. This heterogeneity is, not only due to the diverse data transmission and switching technologies, but also due to the different options of control plane techniques. In light of this, the problem of heterogeneous control plane interworking needs to be solved, and in particular, the solution must address the specific issues of multi-domain networks, such as limited domain topology visibility, given the scalability and confidentiality constraints. In this article, some of the recent activities regarding the Software-Defined Networking(SDN) orchestration are reviewed to address such a multi-domain control plane interworking problem. Specifically, three different models, including the single SDN controller model, multiple SDN controllers in mesh, and multiple SDN controllers in a hierarchical setting, are presented for the DC interconnection network with multiple SDN/Open Flow domains or multiple Open Flow/Generalized Multi-Protocol Label Switching( GMPLS) heterogeneous domains. I n addition, two concrete implementations of the orchestration architectures are detailed, showing the overall feasibility and procedures of SDN orchestration for the end-to-endservice provisioning in multi-domain data center optical networks. 展开更多
关键词 software-defined networking(SDN) generalized multi-protocol labelswitching (GMPLS) path computationelement (PCE) data center ORCHESTRATION multi-domain optical network
在线阅读 下载PDF
Cost-Aware Multi-Domain Virtual Data Center Embedding 被引量:1
3
作者 Xiao Ma Zhongbao Zhang Sen Su 《China Communications》 SCIE CSCD 2018年第12期190-207,共18页
Virtual data center is a new form of cloud computing concept applied to data center. As one of the most important challenges, virtual data center embedding problem has attracted much attention from researchers. In dat... Virtual data center is a new form of cloud computing concept applied to data center. As one of the most important challenges, virtual data center embedding problem has attracted much attention from researchers. In data centers, energy issue is very important for the reality that data center energy consumption has increased by dozens of times in the last decade. In this paper, we are concerned about the cost-aware multi-domain virtual data center embedding problem. In order to solve this problem, this paper first addresses the energy consumption model. The model includes the energy consumption model of the virtual machine node and the virtual switch node, to quantify the energy consumption in the virtual data center embedding process. Based on the energy consumption model above, this paper presents a heuristic algorithm for cost-aware multi-domain virtual data center embedding. The algorithm consists of two steps: inter-domain embedding and intra-domain embedding. Inter-domain virtual data center embedding refers to dividing virtual data center requests into several slices to select the appropriate single data center. Intra-domain virtual data center refers to embedding virtual data center requests in each data center. We first propose an inter-domain virtual data center embedding algorithm based on label propagation to select the appropriate single data center. We then propose a cost-aware virtual data center embedding algorithm to perform the intra-domain data center embedding. Extensive simulation results show that our proposed algorithm in this paper can effectively reduce the energy consumption while ensuring the success ratio of embedding. 展开更多
关键词 virtual data CENTER EMBEDDING multi-domain cost-aware LABEL PROPAGATION
在线阅读 下载PDF
Dynamic Routing of Multiple QoS-Required Flows in Cloud-Edge Autonomous Multi-Domain Data Center Networks 被引量:1
4
作者 Shiyan Zhang Ruohan Xu +3 位作者 Zhangbo Xu Cenhua Yu Yuyang Jiang Yuting Zhao 《Computers, Materials & Continua》 SCIE EI 2024年第2期2287-2308,共22页
The 6th generation mobile networks(6G)network is a kind of multi-network interconnection and multi-scenario coexistence network,where multiple network domains break the original fixed boundaries to form connections an... The 6th generation mobile networks(6G)network is a kind of multi-network interconnection and multi-scenario coexistence network,where multiple network domains break the original fixed boundaries to form connections and convergence.In this paper,with the optimization objective of maximizing network utility while ensuring flows performance-centric weighted fairness,this paper designs a reinforcement learning-based cloud-edge autonomous multi-domain data center network architecture that achieves single-domain autonomy and multi-domain collaboration.Due to the conflict between the utility of different flows,the bandwidth fairness allocation problem for various types of flows is formulated by considering different defined reward functions.Regarding the tradeoff between fairness and utility,this paper deals with the corresponding reward functions for the cases where the flows undergo abrupt changes and smooth changes in the flows.In addition,to accommodate the Quality of Service(QoS)requirements for multiple types of flows,this paper proposes a multi-domain autonomous routing algorithm called LSTM+MADDPG.Introducing a Long Short-Term Memory(LSTM)layer in the actor and critic networks,more information about temporal continuity is added,further enhancing the adaptive ability changes in the dynamic network environment.The LSTM+MADDPG algorithm is compared with the latest reinforcement learning algorithm by conducting experiments on real network topology and traffic traces,and the experimental results show that LSTM+MADDPG improves the delay convergence speed by 14.6%and delays the start moment of packet loss by 18.2%compared with other algorithms. 展开更多
关键词 multi-domain data center networks AUTONOMOUS ROUTING
在线阅读 下载PDF
Design and development of real-time query platform for big data based on hadoop 被引量:1
5
作者 刘小利 Xu Pandeng +1 位作者 Liu Mingliang Zhu Guobin 《High Technology Letters》 EI CAS 2015年第2期231-238,共8页
This paper designs and develops a framework on a distributed computing platform for massive multi-source spatial data using a column-oriented database(HBase).This platform consists of four layers including ETL(extract... This paper designs and develops a framework on a distributed computing platform for massive multi-source spatial data using a column-oriented database(HBase).This platform consists of four layers including ETL(extraction transformation loading) tier,data processing tier,data storage tier and data display tier,achieving long-term store,real-time analysis and inquiry for massive data.Finally,a real dataset cluster is simulated,which are made up of 39 nodes including 2 master nodes and 37 data nodes,and performing function tests of data importing module and real-time query module,and performance tests of HDFS's I/O,the MapReduce cluster,batch-loading and real-time query of massive data.The test results indicate that this platform achieves high performance in terms of response time and linear scalability. 展开更多
关键词 big data massive data storage real-time query HADOOP distributed computing
在线阅读 下载PDF
Multidimensional Data Querying on Tree-Structured Overlay
6
作者 XU Lizhen WANG Shiyuan 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1367-1372,共6页
Multidimensional data query has been gaining much interest in database research communities in recent years, yet many of the existing studies focus mainly on ten tralized systems. A solution to querying in Peer-to-Pee... Multidimensional data query has been gaining much interest in database research communities in recent years, yet many of the existing studies focus mainly on ten tralized systems. A solution to querying in Peer-to-Peer(P2P) environment was proposed to achieve both low processing cost in terms of the number of peers accessed and search messages and balanced query loads among peers. The system is based on a balanced tree structured P2P network. By partitioning the query space intelligently, the amount of query forwarding is effectively controlled, and the number of peers involved and search messages are also limited. Dynamic load balancing can be achieved during space partitioning and query resolving. Extensive experiments confirm the effectiveness and scalability of our algorithms on P2P networks. 展开更多
关键词 range query skyline query P2P indexing multi-dimensional data partition
在线阅读 下载PDF
Linked-Tree: An Aggregate Query Algorithm Based on Sliding Window over Data Stream
7
作者 YU Yaxin WANG Guoren +1 位作者 SU Dong ZHU Xinhua 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1114-1119,共6页
How to process aggregate queries over data streams efficiently and effectively have been becoming hot re search topics in both academic community and industrial community. Aiming at the issues, a novel Linked-tree alg... How to process aggregate queries over data streams efficiently and effectively have been becoming hot re search topics in both academic community and industrial community. Aiming at the issues, a novel Linked-tree algorithm based on sliding window is proposed in this paper. Due to the proposal of concept area, the Linked-tree algorithm reuses many primary results in last window and then avoids lots of unnecessary repeated comparison operations between two successive windows. As a result, execution efficiency of MAX query is improved dramatically. In addition, since the size of memory is relevant to the number of areas but irrelevant to the size of sliding window, memory is economized greatly. The extensive experimental results show that the performance of Linked-tree algorithm has significant improvement gains over the traditional SC (Simple Compared) algorithm and Ranked-tree algorithm. 展开更多
关键词 data streams sliding window aggregate query area HOP
在线阅读 下载PDF
Supporting Various Top-k Queries over Uncertain Datasets
8
作者 LI Wenfeng FU Zufa +2 位作者 WANG Liwei LI Deyi PENG Zhiyong 《Wuhan University Journal of Natural Sciences》 CAS 2014年第1期84-92,共9页
There have been many researches and semantics in answering top-k queries on uncertain data in various applications. However, most of these semantics must consume much of their time in computing position probability. O... There have been many researches and semantics in answering top-k queries on uncertain data in various applications. However, most of these semantics must consume much of their time in computing position probability. Our approach to support various top-k queries is based on position probability distribution (PPD) sharing. In this paper, a PPD-tree structure and several basic operations on it are proposed to support various top-k queries. In addition, we proposed an approximation method to improve the efficiency of PPD generation. We also verify the effectiveness and efficiency of our approach by both theoretical analysis and experiments. 展开更多
关键词 top-k queries uncertain data position probability distribution
原文传递
A Shallow Parsing Approach to Natural Language Queries of a Database
9
作者 Richard Skeggs Stasha Lauria 《Journal of Software Engineering and Applications》 2019年第9期365-382,共18页
The performance and reliability of converting natural language into structured query language can be problematic in handling nuances that are prevalent in natural language. Relational databases are not designed to und... The performance and reliability of converting natural language into structured query language can be problematic in handling nuances that are prevalent in natural language. Relational databases are not designed to understand language nuance, therefore the question why we must handle nuance has to be asked. This paper is looking at an alternative solution for the conversion of a Natural Language Query into a Structured Query Language (SQL) capable of being used to search a relational database. The process uses the natural language concept, Part of Speech to identify words that can be used to identify database tables and table columns. The use of Open NLP based grammar files, as well as additional configuration files, assist in the translation from natural language to query language. Having identified which tables and which columns contain the pertinent data the next step is to create the SQL statement. 展开更多
关键词 NLIDB NATURAL LANGUAGE Processing dataBASE query data MINING
在线阅读 下载PDF
Management and instant query of distributed oil and gas production dynamic data
10
作者 WANG Hongliang MU Longxin +2 位作者 SHI Fugeng LIU Kaiming QIAN Yurong 《Petroleum Exploration and Development》 2019年第5期1014-1021,共8页
The multidimensional analysis engine data management platform is constructed using big data distributed storage and parallel computing,data warehouse modeling technology,realizing the optimal management and instant qu... The multidimensional analysis engine data management platform is constructed using big data distributed storage and parallel computing,data warehouse modeling technology,realizing the optimal management and instant query of distributed oil and gas production dynamic big data.The centralized management and quick response of the production data of more than 36×10^4 oil,gas and water wells is realized.Multidimensional analysis subject model of oil,gas and water well production is built to pretreat the relevant data.At the level of China National Petroleum Corporation(CNPC),the rapid analysis and applications such as oil and gas production tracking,early production warning of key oilfields,analysis of low production wells and long shutdown wells,classification of reservoir development laws have been realized,and the processing time has been shortened from 1 d to 5 s.The basic unit of oil and gas production analysis is refined from oilfield to single well,making the production management more detailed.The process can be traced step by step according to CNPC,oil field company,field,block and single well,and the oil and gas production performance of each unit can be mastered in real time. 展开更多
关键词 PRODUCTION performance big data parallel computation MULTIDIMENSIONAL analysis optimal MANAGEMENT INSTANT query early PRODUCTION WARNING
在线阅读 下载PDF
Visual Composition of Complex Queries on an Integrative Genomic and Proteomic Data Warehouse
11
作者 Francesco Pessina Marco Masseroli Arif Canakoglu 《Engineering(科研)》 2013年第10期94-98,共5页
Biomedical questions are usually complex and regard several different life science aspects. Numerous valuable and he- terogeneous data are increasingly available to answer such questions. Yet, they are dispersedly sto... Biomedical questions are usually complex and regard several different life science aspects. Numerous valuable and he- terogeneous data are increasingly available to answer such questions. Yet, they are dispersedly stored and difficult to be queried comprehensively. We created a Genomic and Proteomic Data Warehouse (GPDW) that integrates data provided by some of the main bioinformatics databases. It adopts a modular integrated data schema and several metadata to describe the integrated data, their sources and their location in the GPDW. Here, we present the Web application that we developed to enable any user to easily compose queries, although complex, on all data integrated in the GPDW. It is publicly available at http://www.bioinformatics.dei.polimi.it/GPKB/. Through a visual interface, the user is only required to select the types of data to be included in the query and the conditions on their values to be retrieved. Then, the Web application leverages the metadata and modular schema of the GPDW to automatically compose an efficient SQL query, run it on the GPDW and show the extracted requested data, enriched with links to external data sources. Performed tests demonstrated efficiency and usability of the developed Web application, and showed its and GPDW relevance in supporting answering biomedical questions, also difficult. 展开更多
关键词 SQL query COMPOSITION VISUAL Interface Integrated data Extraction data Warehousing Bioinformatics database
暂未订购
A Systematic Review of Automated Classification for Simple and Complex Query SQL on NoSQL Database
12
作者 Nurhadi Rabiah Abdul Kadir +1 位作者 Ely Salwana Mat Surin Mahidur R.Sarker 《Computer Systems Science & Engineering》 2024年第6期1405-1435,共31页
A data lake(DL),abbreviated as DL,denotes a vast reservoir or repository of data.It accumulates substantial volumes of data and employs advanced analytics to correlate data from diverse origins containing various form... A data lake(DL),abbreviated as DL,denotes a vast reservoir or repository of data.It accumulates substantial volumes of data and employs advanced analytics to correlate data from diverse origins containing various forms of semi-structured,structured,and unstructured information.These systems use a flat architecture and run different types of data analytics.NoSQL databases are nontabular and store data in a different manner than the relational table.NoSQL databases come in various forms,including key-value pairs,documents,wide columns,and graphs,each based on its data model.They offer simpler scalability and generally outperform traditional relational databases.While NoSQL databases can store diverse data types,they lack full support for atomicity,consistency,isolation,and durability features found in relational databases.Consequently,employing machine learning approaches becomes necessary to categorize complex structured query language(SQL)queries.Results indicate that the most frequently used automatic classification technique in processing SQL queries on NoSQL databases is machine learning-based classification.Overall,this study provides an overview of the automatic classification techniques used in processing SQL queries on NoSQL databases.Understanding these techniques can aid in the development of effective and efficient NoSQL database applications. 展开更多
关键词 NoSQL database data lake machine learning ACID complex query smart city
在线阅读 下载PDF
Semantic Analysis of Natural Language Queries for an Object Oriented Database
13
作者 Bentamar Hemerelain Hafida Belbachir 《Journal of Software Engineering and Applications》 2010年第11期1047-1053,共7页
This paper presents the semantic analysis of queries written in natural language (French) and dedicated to the object oriented data bases. The studied queries include one or two nominal groups (NG) articulating around... This paper presents the semantic analysis of queries written in natural language (French) and dedicated to the object oriented data bases. The studied queries include one or two nominal groups (NG) articulating around a verb. A NG consists of one or several keywords (application dependent noun or value). Simple semantic filters are defined for identifying these keywords which can be of semantic value: class, simple attribute, composed attribute, key value or not key value. Coherence rules and coherence constraints are introduced, to check the validity of the co-occurrence of two consecutive nouns in complex NG. If a query is constituted of a single NG, no further analysis is required. Otherwise, if a query covers two valid NG, it is a subject of studying the semantic coherence of the verb and both NG which are attached to it. 展开更多
关键词 query NOMINAL Group Natural Language OBJECT Oriented data Base SEMANTIC Validation
暂未订购
Desktop Data Driven Approach to Personalize Query Recommendation
14
作者 Xiao-yun Li Ying Yu 《国际计算机前沿大会会议论文集》 2017年第1期25-27,共3页
Query recommendation is an effective method to help users describe their search intentions.In a personalized system,cold-start and the data sparsity were unavoidable,which directly lead to deficient performance of per... Query recommendation is an effective method to help users describe their search intentions.In a personalized system,cold-start and the data sparsity were unavoidable,which directly lead to deficient performance of personalizing.As a significant part of a user’s personal information space,a personal computer owns lots of documents relevant to his or her interest.Therefore,desktop data was introduced to construct a user’s preference model.Furthermore,considering the variety of desktop data,relationship between search task and work task was simultaneously exploited to predict a user’s specific information need.Ten volunteers joined experiments to evaluate the potential of desktop data.A series of experiments were conducted and the results proved that desktop data greatly contributed to providing effective personalized reference words.Besides,the results demonstrated that a user’s long-term interest model performed steadier than work task context,but the most valuable words were the top-3 words extracted from the work context. 展开更多
关键词 query RECOMMENDATION DESKTOP data USER model Work TASK SEARCH TASK
在线阅读 下载PDF
A solution of spatial query processing and query optimization for spatial databases
15
作者 YUANJie XIEKun-qing +2 位作者 MAXiu-jun ZHANGMin SUNLe-bin 《重庆邮电学院学报(自然科学版)》 2004年第5期165-172,共8页
Recently, attention has been focused on spatial query language which is used to query spatial databases. A design of spatial query language has been presented in this paper by extending the standard relational databas... Recently, attention has been focused on spatial query language which is used to query spatial databases. A design of spatial query language has been presented in this paper by extending the standard relational database query language SQL. It recognizes the significantly different requirements of spatial data handling and overcomes the inherent problems of the application of conventional database query languages. This design is based on an extended spatial data model, including the spatial data types and the spatial operators on them. The processing and optimization of spatial queries have also been discussed in this design. In the end, an implementation of this design is given in a spatial query subsystem. 展开更多
关键词 空间数据库 询问语言 空间数据模型 空间操作 最优化
在线阅读 下载PDF
基于Access的电子健康记录管理数据库设计
16
作者 欧雪山 《计算机应用文摘》 2026年第3期143-145,共3页
随着信息技术的飞速发展,电子健康记录系统已成为现代医疗体系的重要基础设施,其核心功能是实现患者健康信息的系统化、高效化管理。文章以Microsoft Access数据库为基础,设计并实现了一套电子健康记录管理数据库系统。借助Access平台... 随着信息技术的飞速发展,电子健康记录系统已成为现代医疗体系的重要基础设施,其核心功能是实现患者健康信息的系统化、高效化管理。文章以Microsoft Access数据库为基础,设计并实现了一套电子健康记录管理数据库系统。借助Access平台在数据建模、界面开发与查询统计方面的灵活性优势,构建了覆盖患者基本信息、病历记录、检查结果及用药信息等核心模块的集成化管理平台。系统通过规范化的表结构设计、关系建立及窗体查询功能,支持多维度数据检索与统计分析,有效提升了患者信息管理工作的准确性和效率。实际应用表明,该系统能够显著减少人工录入错误与重复操作,为临床决策及机构管理提供及时、可靠的数据支持。 展开更多
关键词 电子健康记录 Microsoft Access 数据库设计 医疗信息管理 数据查询与统计
在线阅读 下载PDF
Optimization of RDF link traversal based query execution 被引量:2
17
作者 朱艳琴 花岭 《Journal of Southeast University(English Edition)》 EI CAS 2013年第1期27-32,共6页
Aiming at the problem that only some types of SPARQL ( simple protocal and resource description framework query language) queries can be answered by using the current resource description framework link traversal ba... Aiming at the problem that only some types of SPARQL ( simple protocal and resource description framework query language) queries can be answered by using the current resource description framework link traversal based query execution (RDF-LTE) approach, this paper discusses how the execution order of the triple pattern affects the query results and cost based on concrete SPARQL queries, and analyzes two properties of the web of linked data, missing backward links and missing contingency solution. Then three heuristic principles for logic query plan optimization, namely, the filtered basic graph pattern (FBGP) principle, the triple pattern chain principle and the seed URIs principle, are proposed. The three principles contribute to decrease the intermediate solutions and increase the types of queries that can be answered. The effectiveness and feasibility of the proposed approach is evaluated. The experimental results show that more query results can be returned with less cost, thus enabling users to develop the full potential of the web of linked data. 展开更多
关键词 web of linked data resource description framework link traversal based query execution (RDF-LTE) SPARQL query query optimization
在线阅读 下载PDF
面向营商环境评估的异构融合存储区块链查询优化方法
18
作者 李素 王俊陆 +2 位作者 陈泽 王妍 宋宝燕 《计算机科学与探索》 北大核心 2026年第3期892-904,共13页
营商环境的优劣对现代化经济高质量可持续发展具有重要战略意义。为有效提升营商环境评估的可信度与执行效能,针对现有营商区块链系统中存在的存储资源消耗过高、查询接口功能单一、可支持查询模式受限等问题,创新性地提出一种异构融合... 营商环境的优劣对现代化经济高质量可持续发展具有重要战略意义。为有效提升营商环境评估的可信度与执行效能,针对现有营商区块链系统中存在的存储资源消耗过高、查询接口功能单一、可支持查询模式受限等问题,创新性地提出一种异构融合存储区块链查询优化方法。基于营商环境区块链数据多源高维的特性,设计一种链上-链下协同的异构融合存储架构,降低整体存储开销,并为区块数据添加关系语义,实现数据关系语义增强,以支持复杂查询;构建索引机制(包含区块索引、表级位图索引、层次索引),以加速营商环境评估进行数据查询时的访问效率,丰富查询类型;根据不同索引结构的特性适配最佳营商环境评估查询场景,设计三种动态自适应查询优化算法,进一步优化了查询效率。在四类公开数据集上的实验表明,所提方法在保证可用性的前提下,显著降低了存储开销,对三种不同的查询类型具有较短的查询延迟,相较于基线方法,整体性能也有显著提升。 展开更多
关键词 区块链数据库 数据存储 关系语义 索引 查询优化
在线阅读 下载PDF
智能查询优化算法研究综述
19
作者 何家豪 王嘉辰 +3 位作者 王晓 张喜盈 李翠平 陈红 《软件学报》 北大核心 2026年第1期279-300,共22页
查询优化是数据库系统中至关重要的环节,查询优化器通过找出一条查询语句对应的最佳查询计划来减少查询执行的代价.传统优化器依赖固定规则或简单启发式算法加工并筛选候选计划.然而随着实际应用中关系模式和查询逐渐复杂,传统的查询优... 查询优化是数据库系统中至关重要的环节,查询优化器通过找出一条查询语句对应的最佳查询计划来减少查询执行的代价.传统优化器依赖固定规则或简单启发式算法加工并筛选候选计划.然而随着实际应用中关系模式和查询逐渐复杂,传统的查询优化器已经难以满足应用需求.智能查询优化算法将机器学习技术应用到查询优化领域,通过学习查询计划与复杂关系模式的特征来协助传统优化器完成查询优化.此类算法在代价模型、连接优化、计划生成和查询改写等方面都提出了创新有效的解决方案.梳理上述4类智能查询优化算法近年来的研究成果和发展脉络,并对智能查询优化未来的研究方向进行展望,希望研究者可以全面了解智能查询优化算法的研究现状,以助于其后续科研工作的开展. 展开更多
关键词 查询优化 人工智能 强化学习 数据库系统 数据管理
在线阅读 下载PDF
基于Term-Query-URL异构信息网络的查询推荐 被引量:3
20
作者 刘钰峰 李仁发 《湖南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2014年第5期106-112,共7页
查询推荐是一种帮助搜索引擎更好的理解用户检索需求的方法.基于查询的上下文片段训练词汇和查询之间的语义关系,同时结合查询和URL的点击图以及查询中的序列行为构建Term Query URL异构信息网络,采用重启动随机游走(Random Walk withR... 查询推荐是一种帮助搜索引擎更好的理解用户检索需求的方法.基于查询的上下文片段训练词汇和查询之间的语义关系,同时结合查询和URL的点击图以及查询中的序列行为构建Term Query URL异构信息网络,采用重启动随机游走(Random Walk withRestart,RWR)进行查询推荐.综合利用语义信息和日志信息,提高了稀疏查询的推荐效果.基于概率语言模型构造查询的词汇向量,可以为新的查询进行查询推荐.在大规模商业搜索引擎查询日志上的实验表明本文方法相比传统的查询推荐方法性能提升约为3%~10%. 展开更多
关键词 信息检索 查询推荐 点击日志 重启动随机游走
在线阅读 下载PDF
上一页 1 2 110 下一页 到第
使用帮助 返回顶部