期刊文献+
共找到692篇文章
< 1 2 35 >
每页显示 20 50 100
A method for improving graph queries processing using positional inverted index (P.I.I) idea in search engines and parallelization techniques 被引量:2
1
作者 Hamed Dinari Hassan Naderi 《Journal of Central South University》 SCIE EI CAS CSCD 2016年第1期150-159,共10页
The idea of positional inverted index is exploited for indexing of graph database. The main idea is the use of hashing tables in order to prune a considerable portion of graph database that cannot contain the answer s... The idea of positional inverted index is exploited for indexing of graph database. The main idea is the use of hashing tables in order to prune a considerable portion of graph database that cannot contain the answer set. These tables are implemented using column-based techniques and are used to store graphs of database, frequent sub-graphs and the neighborhood of nodes. In order to exact checking of remaining graphs, the vertex invariant is used for isomorphism test which can be parallel implemented. The results of evaluation indicate that proposed method outperforms existing methods. 展开更多
关键词 graph query processing frequent subgraph graph mining data mining positional inverted index
在线阅读 下载PDF
A Processing Approach for Event-Based Location Aware Queries in Hybrid Wireless Sensor Networks
2
作者 HONG Liang,LU Yansheng College of Computer Science and Technology,Huazhong University of Science and Technology,Wuhan 430074,Hubei,China 《Wuhan University Journal of Natural Sciences》 CAS 2009年第4期327-332,共6页
In hybrid wireless sensor networks, sensor mobility causes the query areas to change dynamically. Aiming at the problem of inefficiency in processing the data aggregation queries in dynamic query areas, this paper pro... In hybrid wireless sensor networks, sensor mobility causes the query areas to change dynamically. Aiming at the problem of inefficiency in processing the data aggregation queries in dynamic query areas, this paper proposes a processing approach for event-based location aware queries (ELAQ), which includes query dissemination algorithm, maximum distance projection proxy selection algorithm, in-network query propagation, and aggregation algorithm. ELAQs are triggered by the events and the query results are dependent on mobile sensors' location, which are the characteristics of ELAQ model. The results show that compared with the TinyDB query processing approach, ELAQ processing approach increases the accuracy of the query result and decreases the query response time. 展开更多
关键词 query processing wireless sensor network MOBILITY data aggregation EVENT
原文传递
Supporting Various Top-k Queries over Uncertain Datasets
3
作者 LI Wenfeng FU Zufa +2 位作者 WANG Liwei LI Deyi PENG Zhiyong 《Wuhan University Journal of Natural Sciences》 CAS 2014年第1期84-92,共9页
There have been many researches and semantics in answering top-k queries on uncertain data in various applications. However, most of these semantics must consume much of their time in computing position probability. O... There have been many researches and semantics in answering top-k queries on uncertain data in various applications. However, most of these semantics must consume much of their time in computing position probability. Our approach to support various top-k queries is based on position probability distribution (PPD) sharing. In this paper, a PPD-tree structure and several basic operations on it are proposed to support various top-k queries. In addition, we proposed an approximation method to improve the efficiency of PPD generation. We also verify the effectiveness and efficiency of our approach by both theoretical analysis and experiments. 展开更多
关键词 top-k queries uncertain data position probability distribution
原文传递
Semantic-based query processing for relational data integration 被引量:1
4
作者 苗壮 张亚非 +2 位作者 王进鹏 陆建江 周波 《Journal of Southeast University(English Edition)》 EI CAS 2011年第1期22-25,共4页
To solve the query processing correctness problem for semantic-based relational data integration,the semantics of SAPRQL(simple protocol and RDF query language) queries is defined.In the course of query rewriting,al... To solve the query processing correctness problem for semantic-based relational data integration,the semantics of SAPRQL(simple protocol and RDF query language) queries is defined.In the course of query rewriting,all relative tables are found and decomposed into minimal connectable units.Minimal connectable units are joined according to semantic queries to produce the semantically correct query plans.Algorithms for query rewriting and transforming are presented.Computational complexity of the algorithms is discussed.Under the worst case,the query decomposing algorithm can be finished in O(n2) time and the query rewriting algorithm requires O(nm) time.And the performance of the algorithms is verified by experiments,and experimental results show that when the length of query is less than 8,the query processing algorithms can provide satisfactory performance. 展开更多
关键词 data integration relational database simple protocol and RDF query language(SPARQL) minimal connectable unit query processing
在线阅读 下载PDF
Processing Constrained K Closest Pairs Query in Spatial Databases 被引量:1
5
作者 LIU Xiaofeng LIU Yunsheng XIAO Yingyuan 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第3期543-546,共4页
In this paper, constrained K closest pairs query is introduced, wbich retrieves the K closest pairs satisfying the given spatial constraint from two datasets. For data sets indexed by R trees in spatial databases, thr... In this paper, constrained K closest pairs query is introduced, wbich retrieves the K closest pairs satisfying the given spatial constraint from two datasets. For data sets indexed by R trees in spatial databases, three algorithms are presented for answering this kind of query. Among of them, two-phase Range+Join and Join+Range algorithms adopt the strategy that changes the execution order of range and closest pairs queries, and constrained heap-based algorithm utilizes extended distance functions to prune search space and minimize the pruning distance. Experimental results show that constrained heap-base algorithm has better applicability and performance than two-phase algorithms. 展开更多
关键词 spatial databases query processing R-TREE closest pairs query constrained closest pairs query
在线阅读 下载PDF
Monitoring Median Queries over Moving Objects
6
作者 许浒 卢炎生 李支成 《Journal of Southwest Jiaotong University(English Edition)》 2010年第4期326-332,共7页
The k-median problem has attracted a number of researchers. However,few of them have considered both the dynamic environment and the issue of accuracy. In this paper,a new type of query is studied,called continuous me... The k-median problem has attracted a number of researchers. However,few of them have considered both the dynamic environment and the issue of accuracy. In this paper,a new type of query is studied,called continuous median monitoring (CMM) query. It considers the k-median problem under dynamic environment with an accuracy guarantee. A continuous group nearest neighbor based (CGB) algorithm and an average distance medoid (ADM) algorithm are proposed to solve the CMM problem. ADM is a hill climbing schemed algorithm and achieves a rapid converging speed by checking only qualified candidates. Experiments show that ADM is more efficient than CGB and outperforms the classical PAM (partitioning around medoids) and CLARANS (clustering large applications based on randomized search) algorithms with various parameter settings. 展开更多
关键词 Spatial databases Query processing Nearest neighbor query k-Median problem
在线阅读 下载PDF
An Optimized Labeling Scheme for Reachability Queries
7
作者 Xian Tang Ziyang Chen +3 位作者 Haiyan Zhang Xiang Liu Yunyu Shi Asad Shahzadi 《Computers, Materials & Continua》 SCIE EI 2018年第5期267-283,共17页
Answering reachability queries is one of the fundamental graph operations.Existing approaches either accelerate index construction by constructing an index that covers only partial reachability relationship,which may ... Answering reachability queries is one of the fundamental graph operations.Existing approaches either accelerate index construction by constructing an index that covers only partial reachability relationship,which may result in performing cost traversing operation when answering a query;or accelerate query answering by constructing an index covering the complete reachability relationship,which may be inefficient due to comparing the complete node labels.We propose a novel labeling scheme,which covers the complete reachability relationship,to accelerate reachability queries processing.The idea is to decompose the given directed acyclic graph(DAG)G into two subgraphs,G1 and G2.For G1,we propose to use topological labels consisting of two integers to answer all reachability queries.For G2,we construct 2-hop labels as existing methods do to answer queries that cannot be answered by topological labels.The benefits of our method lie in two aspects.On one hand,our method does not need to perform the cost traversing operation when answering queries.On the other hand,our method can quickly answer most queries in constant time without comparing the whole node labels.We confirm the efficiency of our approaches by extensive experimental studies using 20 real datasets. 展开更多
关键词 DAG COMPUTING detection reachability queries processing
在线阅读 下载PDF
A Shallow Parsing Approach to Natural Language Queries of a Database
8
作者 Richard Skeggs Stasha Lauria 《Journal of Software Engineering and Applications》 2019年第9期365-382,共18页
The performance and reliability of converting natural language into structured query language can be problematic in handling nuances that are prevalent in natural language. Relational databases are not designed to und... The performance and reliability of converting natural language into structured query language can be problematic in handling nuances that are prevalent in natural language. Relational databases are not designed to understand language nuance, therefore the question why we must handle nuance has to be asked. This paper is looking at an alternative solution for the conversion of a Natural Language Query into a Structured Query Language (SQL) capable of being used to search a relational database. The process uses the natural language concept, Part of Speech to identify words that can be used to identify database tables and table columns. The use of Open NLP based grammar files, as well as additional configuration files, assist in the translation from natural language to query language. Having identified which tables and which columns contain the pertinent data the next step is to create the SQL statement. 展开更多
关键词 NLIDB NATURAL LANGUAGE processing DATABASE QUERY Data MINING
在线阅读 下载PDF
Storage optimization for query processing over data streams
9
作者 唐向红 《Journal of Chongqing University》 CAS 2010年第2期79-92,共14页
A defining characteristic of continuous queries over on-line data streams,possibly bounded by sliding windows,is the potentially infinite and time-evolving nature of their inputs and outputs.For different update patte... A defining characteristic of continuous queries over on-line data streams,possibly bounded by sliding windows,is the potentially infinite and time-evolving nature of their inputs and outputs.For different update patterns of continuous queries,suitable data structures bring great query processing efficiency.In this paper,we proposed a data structure suitable for weak nonmonotonic update pattern in which the lifetime of each tuple is known at generation time,but the length of lifetime is not necessarily the same.The new data structure combined the ladder queue with the feature of weak non-monotonic update pattern.The experiment results show that the new data structure performs much better than the traditional calendar queue in many cases. 展开更多
关键词 calendar queue ladder queue query processing sliding windows
在线阅读 下载PDF
Efficient Pr-Skyline Query Processing and Optimization in Wireless Sensor Networks
10
作者 Jianzhong Li Shuguang Xiong 《Wireless Sensor Network》 2010年第11期838-849,共12页
As one of the commonly used queries in modern databases, skyline query has received extensive attention from database research community. The uncertainty of the data in wireless sensor networks makes the corresponding... As one of the commonly used queries in modern databases, skyline query has received extensive attention from database research community. The uncertainty of the data in wireless sensor networks makes the corresponding skyline uncertain and not unique. This paper investigates the Pr-Skyline problem, i.e., how to compute the skyline with the highest existence probability in a computational and energy-efficient way. We formulate the problem and prove that it is NP-Complete and cannot be approximated in a given expression. However, the proposed algorithm SKY-SEARCH with pruning techniques can guarantee the computational efficiency given relatively large input size, while the filter-based distributed optimization strategy significantly reduces the transmission cost and the required storage space of the sensor nodes. Extensive experiments verify the efficiency and scalability of SKY-SEARCH and the distributed optimizing strategy. 展开更多
关键词 Wireless Sensor Network QUERY processing UNCERTAIN DATA PROBABILISTIC DATA SKYLINE QUERY
在线阅读 下载PDF
Distributed location-based query processing on large volumes of moving items
11
作者 JEON Se gil LEE Chung woo +2 位作者 NAH Yunmook KIM Moon hae HAN Ki joon 《重庆邮电学院学报(自然科学版)》 2004年第5期101-107,共7页
Recently, new techniques to efficiently manage current and past location information of moving objects have received significant interests in the area of moving object databases and location based service systems. In ... Recently, new techniques to efficiently manage current and past location information of moving objects have received significant interests in the area of moving object databases and location based service systems. In this paper, we exploit query processing schemes for location management systems, which consist of multiple data processing nodes to handle massive volume of moving objects such as cellular phone users. To show the usefulness of the proposed schemes, some experimental results showing performance factors regarding distributed query processing are explained. In our experiments, we use two kinds of data set: one is generated by the extended GSTD simulator and another is generated by the real time data generator which generates location sensing reports of various types of users having different movement patterns. 展开更多
关键词 TMO 定位数据服务 GALIS 基础定位处理
在线阅读 下载PDF
时空数据查询技术研究综述 被引量:1
12
作者 孟祥福 翁雪 徐永杰 《计算机科学与探索》 北大核心 2025年第8期2001-2023,共23页
随着现代信息技术的快速发展与应用,时空数据的规模迅速增长。这些数据呈现出海量聚集、高维异构以及动态复杂等特点。近年来,以时空数据为背景的时空查询技术得到广泛的研究和应用,如何有效地存储、管理和查询这些数据成为了研究的重... 随着现代信息技术的快速发展与应用,时空数据的规模迅速增长。这些数据呈现出海量聚集、高维异构以及动态复杂等特点。近年来,以时空数据为背景的时空查询技术得到广泛的研究和应用,如何有效地存储、管理和查询这些数据成为了研究的重点。对时空数据的相关查询技术进行综述,从时空数据相关基本概念入手,系统阐述了当前主流的时空查询处理模式,涵盖了范围查询、K近邻查询、反K近邻查询等多种类型;介绍了不同的时空索引技术,包括基于轨迹的索引结构、基于抽样的索引以及其他创新的索引方法;分析了结合其他技术的查询方法,主要包括时空-文本查询、语义近似轨迹查询、并行和分布式查询等,这些技术不仅提升了时空查询的多样性和准确性,还能有效地处理大规模时空数据。展望了时空查询技术的未来发展方向,包括查询结果的可视化展示、隐私保护以及结合机器学习的新型索引结构,为时空数据的高效利用提供了新的思路和挑战。 展开更多
关键词 时空数据 查询处理 索引技术 时空-文本 语义近似 分布式
在线阅读 下载PDF
机器学习赋能的多维数据查询处理研究综述 被引量:4
13
作者 马超红 郝新丽 +1 位作者 孟小峰 张旭康 《计算机学报》 北大核心 2025年第1期100-123,共24页
多维数据的查询和处理在数据库中普遍存在。高效的多维数据查询处理,一方面依赖于精细的索引结构,例如R-tree、KD-tree等被广泛应用;另一方面,也有诸多工作探索利用硬件优势设计高效的数据布局,即研究面向扫描的数据处理策略以及构建数... 多维数据的查询和处理在数据库中普遍存在。高效的多维数据查询处理,一方面依赖于精细的索引结构,例如R-tree、KD-tree等被广泛应用;另一方面,也有诸多工作探索利用硬件优势设计高效的数据布局,即研究面向扫描的数据处理策略以及构建数据概要,避免高代价地访问原始数据。然而,随着数字化社会的发展,网络Web服务更加普及,传感器网络无处不在,诸如网约车、电子地图等基于位置的服务愈发盛行,使得多维数据正在以前所未有的速度产生,对查询处理提出新的要求,包括更快的查询响应、更低的存储占用。近年来,机器学习包括深度学习算法不断优化,且计算机等硬件环境持续发展,为多维数据查询处理带来更多的优化契机,不仅降低查询执行时间,同时能够节省存储资源,取得显著性优势。因此,机器学习被广泛应用于构建更好的数据管理和数据分析任务解决方案。该文提出机器学习赋能的多维数据查询处理研究框架,一方面介绍机器学习模型对多维索引结构的优化和改进;另一方面,介绍机器学习对不依赖索引结构的查询处理任务的赋能研究,包括数据布局策略和数据概要研究。在总结已有研究现状的基础上,指出该领域面临的挑战和未来研究方向。 展开更多
关键词 查询处理 多维学习化索引 数据布局 数据概要 机器学习
在线阅读 下载PDF
不确定性Top-K查询处理 被引量:55
14
作者 李文凤 彭智勇 李德毅 《软件学报》 EI CSCD 北大核心 2012年第6期1542-1560,共19页
高效Top-K查询处理在涉及大量数据交互的应用中是一项重要技术,随着应用中不确定性数据的大量涌现,不确定性数据的管理逐渐引起人们的重视.不确定性数据上Top-K查询从语义和处理上都呈现出与传统Top-K查询不同的特点.在主流不确定性数... 高效Top-K查询处理在涉及大量数据交互的应用中是一项重要技术,随着应用中不确定性数据的大量涌现,不确定性数据的管理逐渐引起人们的重视.不确定性数据上Top-K查询从语义和处理上都呈现出与传统Top-K查询不同的特点.在主流不确定性数据模型和可能世界语义模型下,学者们已经提出了多种不确定性Top-K查询的语义和处理方法.介绍了当前不确定性Top-K查询的研究工作,并对其进行分类,讨论包括语义、排序标准、算法以及应用等方面的技术.最后提出不确定性Top-K查询面临的挑战和下一步的发展方向. 展开更多
关键词 Top—K查询语义 top-k查询处理 排序标准 不确定性数据 可能世界
在线阅读 下载PDF
不确定数据Top-k查询算法 被引量:6
15
作者 周帆 李树全 +1 位作者 肖春静 吴跃 《电子测量与仪器学报》 CSCD 2010年第7期650-657,共8页
不确定数据普遍存在于大量应用之中,如移动计算、RFID技术和传感器网络等。针对不确定数据的各种查询算法是数据库领域近年来的热点研究课题。其中,基于不确定数据的Top-k查询和排序查询提出了很多有价值的查询语义和查询算法。详细分... 不确定数据普遍存在于大量应用之中,如移动计算、RFID技术和传感器网络等。针对不确定数据的各种查询算法是数据库领域近年来的热点研究课题。其中,基于不确定数据的Top-k查询和排序查询提出了很多有价值的查询语义和查询算法。详细分析这一最新领域提出的各种查询算法、数据模型、算法复杂度和不同算法所适应的应用场景,并用实验从多个方面比较不同查询算法执行效率、查询语义以及结果集的关联程度。 展开更多
关键词 概率数据库 不确定top-k查询 排序查询 查询算法 数据处理
在线阅读 下载PDF
基于不确定数据的分布式Top-k查询算法 被引量:2
16
作者 王爽 王国仁 《东北大学学报(自然科学版)》 EI CAS CSCD 北大核心 2010年第2期177-180,共4页
目前基于不确定数据的Top-k查询算法仅考虑了集中式的环境,为了解决分布式系统中节省系统带宽的问题,在此基础上,提出了在分布式环境中基于不确定数据的Top-k查询算法UDTopk.该算法定义了一个候选集(candidate set),仅使用候选集中的数... 目前基于不确定数据的Top-k查询算法仅考虑了集中式的环境,为了解决分布式系统中节省系统带宽的问题,在此基础上,提出了在分布式环境中基于不确定数据的Top-k查询算法UDTopk.该算法定义了一个候选集(candidate set),仅使用候选集中的数据,而不用访问数据集中所有数据,就可以得到正确的Top-k查询答案.算法通过动态维护候选集、仅传输少量数据,达到减少网络中数据传输的目的.实验结果表明,该算法可以有效地节省网络带宽. 展开更多
关键词 top-k查询 不确定数据 分布式处理 通信代价 查询处理
在线阅读 下载PDF
基于历史信息的高效近似查询系统
17
作者 韩雨钢 马廷淮 荣欢 《计算机工程与设计》 北大核心 2025年第2期578-586,共9页
近似查询处理技术是提高数据库聚合查询效率的重要方法,针对海量二维数据提出一种基于历史查询负载的近似查询系统,引入历史查询信息,通过在历史查询空间中进行命中性检测,提高查询区域偏斜等情况时的效率。针对全局查询,通过空间数据... 近似查询处理技术是提高数据库聚合查询效率的重要方法,针对海量二维数据提出一种基于历史查询负载的近似查询系统,引入历史查询信息,通过在历史查询空间中进行命中性检测,提高查询区域偏斜等情况时的效率。针对全局查询,通过空间数据划分方法将完整数据集划分为子区域,组织为树状分片索引结构,实现采样和数据摘要方法的结合,提高查询准确性。实验结果表明,当历史查询记录量达到10~4量级时,查询响应时间仅为传统方法的40%。与传统方法相比,该系统平均相对误差降低了63%。随分片数的增加效果有更大提升,当分片数达64时,其平均相对误差仅为传统方法的10%。 展开更多
关键词 数据库系统 近似查询处理 空间索引 历史查询 分片索引树 学习型索引 空间填充曲线
在线阅读 下载PDF
GPU加速的分段Top-k查询算法 被引量:1
18
作者 黄玉龙 邹循进 +1 位作者 刘奎 苏本跃 《计算机应用》 CSCD 北大核心 2014年第11期3112-3116,共5页
现有Top-k查询优化算法无法充分利用图形处理器(GPU)强大的并行吞吐量及时获取查询结果,为此提出了一种基于统一计算设备架构(CUDA)模型的大规模分段查询算法。通过划分查询过程以及采用分段并行处理策略,该算法可最大限度地提升查询过... 现有Top-k查询优化算法无法充分利用图形处理器(GPU)强大的并行吞吐量及时获取查询结果,为此提出了一种基于统一计算设备架构(CUDA)模型的大规模分段查询算法。通过划分查询过程以及采用分段并行处理策略,该算法可最大限度地提升查询过程中的计算和比较效率。实验结果表明,与4线程多核优化算法相比,所提算法具有明显的性能优势,当有序列表数量为6,遍历步长为120时,性能达到最优,此时比多核算法快40倍。 展开更多
关键词 top-k查询 通用计算图形处理器 分段处理 并行优化 禁止随机访问
在线阅读 下载PDF
RDF图的Top-k最短路径查询 被引量:1
19
作者 章登义 吴文李 欧阳黜霏 《电子学报》 EI CAS CSCD 北大核心 2015年第8期1531-1537,共7页
最短路径查询是图数据管理与复杂关系挖掘的基本操作之一.本文针对资源描述框架图上的top-k最短路径查询,构造基于组件的索引,并在该索引的基础上实现查询的响应.查询优化阶段,针对查询效率问题,提出频繁路径以及结构剪枝策略,... 最短路径查询是图数据管理与复杂关系挖掘的基本操作之一.本文针对资源描述框架图上的top-k最短路径查询,构造基于组件的索引,并在该索引的基础上实现查询的响应.查询优化阶段,针对查询效率问题,提出频繁路径以及结构剪枝策略,并给出有效性证明.实验表明,本文方法准确返回top-k最短路径并提高92%的查询速率.索引构造时间相比已有方法,提高约56%.同时,索引所占空间仅为原始数据大小的1~1.2倍. 展开更多
关键词 资源描述框架 最短路径查询 图数据库 top-k 查询处理
在线阅读 下载PDF
基于StarRocks的实时物联网数据处理系统 被引量:2
20
作者 董一舟 潘伟华 +1 位作者 张楠 孟壮 《计算机与现代化》 2025年第1期15-19,共5页
随着物联网技术的普及和应用,大量的实时数据需要被处理和分析,因为物联数据的海量性、实时性特点,传统数据库无法满足其数据存储规模和数据处理效率的要求。本文提出一种基于StarRocks的分布式实时物联网数据处理系统。该系统利用StarR... 随着物联网技术的普及和应用,大量的实时数据需要被处理和分析,因为物联数据的海量性、实时性特点,传统数据库无法满足其数据存储规模和数据处理效率的要求。本文提出一种基于StarRocks的分布式实时物联网数据处理系统。该系统利用StarRocks的分布式架构构建底层数据存储,通过引入消息队列和数据合并批量提交技术,保证数据的快速写入;同时通过存储策略优化、索引优化、物化视图技术,实现对大规模实时数据的快速处理和查询;系统强大的数据压缩能力也有效节省了数据存储空间。该框架在数据存储规模上支持横向扩展,提高了可用性和健壮性。通过实验分析,该系统在数据写入、数据查询、数据压缩方面较传统分布式数据库具有明显优势。 展开更多
关键词 StarRocks 实时数据处理 分布式系统 数据压缩 查询优化
在线阅读 下载PDF
上一页 1 2 35 下一页 到第
使用帮助 返回顶部