期刊文献+
共找到9,720篇文章
< 1 2 250 >
每页显示 20 50 100
Top-K Query Framework in Wireless Sensor Networks for Smart Grid 被引量:2
1
作者 WANG Hui GUAN Zhitao +1 位作者 YANG Tingting XU Yue 《China Communications》 SCIE CSCD 2014年第6期89-98,共10页
The smart grid has caught great attentions in recent years, which is poised to transform a centralized, producer-controlled network to a decentralized, consumer- interactive network that's supported by fine-grained m... The smart grid has caught great attentions in recent years, which is poised to transform a centralized, producer-controlled network to a decentralized, consumer- interactive network that's supported by fine-grained monitoring. Large-scale WSNs (Wireless Sensor Networks) have been considered one of the very promising technologies to support the implementation of smart grid. WSNs are applied in almost every aspect of smart grid, including power generation, power transmission, power distribution, power utilization and power dispatch, and the data query processing of 'WSNs in power grid' become an hotspot issue due to the amount of data of power grid is very large and the requirement of response time is very high. To meet the demands, top-k query processing is a good choice, which performs the cooperative query by aggregating the database objects' degree of match for each different query predicate and returning the best k matching objects. In this paper, a framework that can effectively apply top-k query to wireless sensor network in smart grid is proposed, which is based on the cluster-topology sensor network. In the new method, local indices are used to optimize the necessary query routing and process intermediate results inside the cluster to cut down the data traffic, and the hierarchical join query is executed based on the local results.Besides, top-k query results are verified by the clean-up process, and two schemes are taken to deal with the problem of node's dynamicity, which further reduce communication cost. Case studies and experimental results show that our algorithm has outperformed the current existing one with higher quality results and better efficiently. 展开更多
关键词 smart grid wireless sensornetwork top-k query framework.
在线阅读 下载PDF
Probabilistic Top-k Query:Model and Application on Web Traffic Analysis 被引量:1
2
作者 Xiaolin Gui Jun Liu +2 位作者 Qiujian Lv Chao Dong Zhenming Lei 《China Communications》 SCIE CSCD 2016年第6期123-137,共15页
Top-k ranking of websites according to traffic volume is important for Internet Service Providers(ISPs) to understand network status and optimize network resources. However, the ranking result always has a big deviati... Top-k ranking of websites according to traffic volume is important for Internet Service Providers(ISPs) to understand network status and optimize network resources. However, the ranking result always has a big deviation with actual rank for the existence of unknown web traffic, which cannot be identified accurately under current techniques. In this paper, we introduce a novel method to approximate the actual rank. This method associates unknown web traffic with websites according to statistical probabilities. Then, we construct a probabilistic top-k query model to rank websites. We conduct several experiments by using real HTTP traffic traces collected from a commercial ISP covering an entire city in northern China. Experimental results show that the proposed techniques can reduce the deviation existing between the ground truth and the ranking results vastly. In addition, we find that the websites providing video service have higher ratio of unknown IP as well as higher ratio of unknown traffic than the websites providing text web page service. Specifically, we find that the top-3 video websites have more than 90% of unknown web traffic. All these findings are helpful for ISPs understanding network status and deploying Content Distributed Network(CDN). 展开更多
关键词 top-k query traffic model temporal bipartite graph uncertain data unknown traffic
在线阅读 下载PDF
Approximate Continuous Top-k Query over Sliding Window 被引量:2
3
作者 Rui Zhu Bin Wang +2 位作者 Shi-Ying Luo Xiao-Chun Yang Guo-Ren Wang 《Journal of Computer Science & Technology》 SCIE EI CSCD 2017年第1期93-109,共17页
Continuous top-k query over sliding window is a fundamental problem in database, which retrieves k objects with the highest scores when the window slides. Existing studies mainly adopt exact algorithms to tackle this ... Continuous top-k query over sliding window is a fundamental problem in database, which retrieves k objects with the highest scores when the window slides. Existing studies mainly adopt exact algorithms to tackle this type of queries, whose key idea is to maintain a subset of objects in the window, and try to retrieve answers from it. However, all the existing algorithms are sensitive to query parameters and data distribution. In addition, they suffer from expensive overhead for incremental maintenance, and thus cannot satisfy real-time requirement. In this paper, we define a novel query named (ε, δ)-approximate continuous top-κ query, which returns approximate answers for top-κ query. In order to efficiently support this query, we propose an efficient framework, named PABF (Probabilistic Approximate Based Framework), to support approximate top-κ query over sliding window. We firstly maintain a self-adaptive pruning value, which could filter out newly arrived objects who have a probability less than 1 - 5 of being a query result. For those objects that are not filtered, we combine them together, if the score difference among them is less than a threshold. To efficiently maintain these combined results, the framework PABF also proposes a multi-phase merging algorithm. Theoretical analysis indicates that even in the worst case, we require only logarithmic complexity for maintaining each candidate. 展开更多
关键词 continuous top-k query APPROXIMATE sliding window
原文传递
Supporting Various Top-k Queries over Uncertain Datasets
4
作者 LI Wenfeng FU Zufa +2 位作者 WANG Liwei LI Deyi PENG Zhiyong 《Wuhan University Journal of Natural Sciences》 CAS 2014年第1期84-92,共9页
There have been many researches and semantics in answering top-k queries on uncertain data in various applications. However, most of these semantics must consume much of their time in computing position probability. O... There have been many researches and semantics in answering top-k queries on uncertain data in various applications. However, most of these semantics must consume much of their time in computing position probability. Our approach to support various top-k queries is based on position probability distribution (PPD) sharing. In this paper, a PPD-tree structure and several basic operations on it are proposed to support various top-k queries. In addition, we proposed an approximation method to improve the efficiency of PPD generation. We also verify the effectiveness and efficiency of our approach by both theoretical analysis and experiments. 展开更多
关键词 top-k queries uncertain data position probability distribution
原文传递
基于离散度分析的Top-k组合Skyline查询算法
5
作者 董雷刚 刘国华 +1 位作者 王鑫 崔晓微 《计算机应用与软件》 北大核心 2025年第2期72-80,共9页
现有的组合Skyline查询算法不能区分组合中数据的离散度,且输出结果集很大。针对这种情况,提出基于数据离散度分析的Top-k组合Skyline查询算法。提出基于权重的组合离散系数概念及其计算方法;设置分类器将组合划分至不同的组合队列;采... 现有的组合Skyline查询算法不能区分组合中数据的离散度,且输出结果集很大。针对这种情况,提出基于数据离散度分析的Top-k组合Skyline查询算法。提出基于权重的组合离散系数概念及其计算方法;设置分类器将组合划分至不同的组合队列;采用并行处理方式对各组合队列进行计算。实验结果表明,该算法可以根据用户自定义条件准确有效地返回结果,能满足实际应用的需要。 展开更多
关键词 组合Skyline 离散度分析 top-k 离散系数 分类器 并行处理
在线阅读 下载PDF
不确定时间序列Top-k窗口聚合查询方法
6
作者 张航 熊浩然 何震瀛 《计算机工程》 北大核心 2025年第7期161-170,共10页
近年来,如何分析挖掘不确定时间序列数据逐渐受到业界关注。Top-k查询作为数据库领域研究的热点问题,旨在从大规模数据中检索出最符合用户查询条件的前k项结果。然而,尽管Top-k查询在其他领域已被广泛应用,针对不确定时间序列的Top-k查... 近年来,如何分析挖掘不确定时间序列数据逐渐受到业界关注。Top-k查询作为数据库领域研究的热点问题,旨在从大规模数据中检索出最符合用户查询条件的前k项结果。然而,尽管Top-k查询在其他领域已被广泛应用,针对不确定时间序列的Top-k查询研究仍然较少。这种查询可以有效帮助用户从不确定时间序列提取重要信息。提出一种新的Top-k查询问题——不确定时间序列Top-k窗口聚合查询,并针对该问题给出高效的查询方法。这个查询可以作为一个基础工具,辅助用户探索和分析不确定时间序列数据。现有能够支持这个查询的方法均存在查询效率较低或所需存储空间过高的问题。针对该问题,提出一种基于子窗口拼接策略的两级Top-k查询方法,并提出高效计算阈值上界方法解决基于子窗口拼接策略引入的阈值计算复杂难题。该方法能够以较少的预计算存储空间,高效支持不确定时间序列Top-k窗口聚合查询。为了验证所提方法的有效性,在真实和人造数据集上进行实验。实验结果表明,所提方法与基于TA的Top-k查询方法相比,明显降低了预计算列表的存储空间;与基于遍历的FSEC-S方法相比,所提方法以及使用计算阈值上界优化方法的平均查询效率分别提升了7.27倍和20.04倍。 展开更多
关键词 不确定时间序列 top-k查询 窗口 聚合查询 有序列表 阈值
在线阅读 下载PDF
KDS-CM:A Cache Mechanism Based on Top-K Data Source for Deep Web Query
7
作者 KOU Yue SHEN Derong +2 位作者 YU Ge LI Dong NIE Tiezheng 《Wuhan University Journal of Natural Sciences》 CAS 2007年第5期830-834,共5页
Caching is an important technique to enhance the efficiency of query processing. Unfortunately, traditional caching mechanisms are not efficient for deep Web because of storage space and dynamic maintenance limitation... Caching is an important technique to enhance the efficiency of query processing. Unfortunately, traditional caching mechanisms are not efficient for deep Web because of storage space and dynamic maintenance limitations. In this paper, we present on providing a cache mechanism based on Top-K data source (KDS-CM) instead of result records for deep Web query. By integrating techniques from IR and Top-K, a data reorganization strategy is presented to model KDS-CM. Also some measures about cache management and optimization are proposed to improve the performances of cache effectively. Experimental results show the benefits of KDS-CM in execution cost and dynamic maintenance when compared with various alternate strategies. 展开更多
关键词 CACHE top-k Deep Web data reorganization cache management and optimization
在线阅读 下载PDF
A Database Querying Language for Formulating Relational Queries on Small Devices
8
作者 Ahmad Rohiza Abdul-Kareem Sameem 《Computer Technology and Application》 2011年第3期172-181,共10页
For small devices like the PDAs and mobile phones, formulation of relational database queries is not as simple as using conventional devices such as the personal computers and laptops. Due to the restricted size and r... For small devices like the PDAs and mobile phones, formulation of relational database queries is not as simple as using conventional devices such as the personal computers and laptops. Due to the restricted size and resources of these smaller devices, current works mostly limit the queries that can be posed by users by having them predetermined by the developers. This limits the capability of these devices in supporting robust queries. Hence, this paper proposes a universal relation based database querying language which is targeted for small devices. The language allows formulation of relational database queries that uses minimal query terms. The formulation of the language and its structure will be described and usability test results will be presented to support the effectiveness of the language. 展开更多
关键词 DATABASE query language relational queries small devices.
在线阅读 下载PDF
A Parallel Processing Method for Moving Top-K Spatial Keyword Query
9
作者 Kunlun Chen Yanru Liu Qingxu Deng 《Journal of Software Engineering and Applications》 2019年第4期72-84,共13页
We propose an influential set based moving k keyword query processing model, which avoids the shortcoming of safe region-based approaches that the update cost and update frequency cannot be optimized simultaneously. B... We propose an influential set based moving k keyword query processing model, which avoids the shortcoming of safe region-based approaches that the update cost and update frequency cannot be optimized simultaneously. Based on the model, we design a parallel query processing method and a parallel validation method for multicore processing platforms. The time complexity of the algorithms is O((log|D|+p.k)/p.k)?and O(log p.k), respectively, which are all O(1/k) times the time complexity of the state-of-the-art method. The experiment result confirms the superiority of our algorithms over the state-of-the-art method. 展开更多
关键词 SPATIAL KEYWORDS K Nearest NEIGHBORS Influential Set SPATIAL MOVING query Safe Region
在线阅读 下载PDF
Exploring features for automatic identification of news queries through query logs
10
作者 Xiaojuan ZHANG Jian LI 《Chinese Journal of Library and Information Science》 2014年第4期31-45,共15页
Purpose:Existing researches of predicting queries with news intents have tried to extract the classification features from external knowledge bases,this paper tries to present how to apply features extracted from quer... Purpose:Existing researches of predicting queries with news intents have tried to extract the classification features from external knowledge bases,this paper tries to present how to apply features extracted from query logs for automatic identification of news queries without using any external resources.Design/methodology/approach:First,we manually labeled 1,220 news queries from Sogou.com.Based on the analysis of these queries,we then identified three features of news queries in terms of query content,time of query occurrence and user click behavior.Afterwards,we used 12 effective features proposed in literature as baseline and conducted experiments based on the support vector machine(SVM)classifier.Finally,we compared the impacts of the features used in this paper on the identification of news queries.Findings:Compared with baseline features,the F-score has been improved from 0.6414 to0.8368 after the use of three newly-identified features,among which the burst point(bst)was the most effective while predicting news queries.In addition,query expression(qes)was more useful than query terms,and among the click behavior-based features,news URL was the most effective one.Research limitations:Analyses based on features extracted from query logs might lead to produce limited results.Instead of short queries,the segmentation tool used in this study has been more widely applied for long texts.Practical implications:The research will be helpful for general-purpose search engines to address search intents for news events.Originality/value:Our approach provides a new and different perspective in recognizing queries with news intent without such large news corpora as blogs or Twitter. 展开更多
关键词 query intent News query News intent query classification Automaticidentification
原文传递
top-k频繁挖掘下电力敏感数据差分隐私保护 被引量:1
11
作者 奚增辉 王卫斌 +2 位作者 屈志坚 姚嵘 陆嘉铭 《电子设计工程》 2025年第10期112-115,120,共5页
由于电力系统中的数据量庞大且具有动态变化的特性,敏感性和非敏感性的电力数据都存储在数据库中。如果用户在查询数据对象时发生错误,就会造成敏感数据的隐私泄露问题。为避免上述情况的发生,提出top-k频繁挖掘下电力敏感数据差分隐私... 由于电力系统中的数据量庞大且具有动态变化的特性,敏感性和非敏感性的电力数据都存储在数据库中。如果用户在查询数据对象时发生错误,就会造成敏感数据的隐私泄露问题。为避免上述情况的发生,提出top-k频繁挖掘下电力敏感数据差分隐私保护方法。通过设置top-k项目,对电力敏感数据频繁挖掘处理。引入差分隐私,创建电力敏感数据私有账本,分析其隐私性,完善差分隐私保护方案,实现对电力敏感数据差分隐私保护。实验结果表明,在top-k频繁挖掘算法作用下,主机元件不会出现错误查询到敏感性电力数据的情况,能够较好地保护敏感数据的差分隐私。 展开更多
关键词 top-k频繁挖掘 电力敏感数据 差分隐私 私有账本
在线阅读 下载PDF
ACCF:时间预测机制驱动的top-k流测量
12
作者 胡永庆 杨含 +2 位作者 刘子源 秦广军 戴庆龙 《计算机科学》 北大核心 2025年第10期98-105,共8页
针对当前top-k流测量过滤算法依赖固定计数器阈值的问题,提出了基于活跃度预测机制的ACCF(Activity Counting Cuckoo Filter)测量结构。ACCF通过引入活跃度预测机制,利用时间序列分析和指数加权移动平均(Exponentially Weighted Moving ... 针对当前top-k流测量过滤算法依赖固定计数器阈值的问题,提出了基于活跃度预测机制的ACCF(Activity Counting Cuckoo Filter)测量结构。ACCF通过引入活跃度预测机制,利用时间序列分析和指数加权移动平均(Exponentially Weighted Moving Average,EWMA)机制,动态计算网络流的活跃度,实现对潜在的top-k流的实时识别与提前过滤。针对哈希冲突可能导致的精度损失,ACCF引入了自刷新存储表(Self-Refreshing Storage Table,SRST),用于存储踢出路径上的网络流信息。当踢出操作达到设定的MaxNumKicks值时,SRST会在局部范围内优先踢出活跃度最小的网络流项,避免重要流量信息丢失。实验结果证明,ACCF与SRST在合适的参数组合条件下,可以提前过滤65%左右的大流并减少41%左右的插入操作,并显著提升了在top-k流量测量中的精度,尤其是在与传统的Space Saving(SS),CM Sketch,LUSketch和Cuckoo Counter算法对比时,展现了明显的优势。 展开更多
关键词 top-k 活跃度 时间序列 EWMA SRST SKETCH
在线阅读 下载PDF
基于Top-k查询算法的国际贸易数据高速检索研究
13
作者 汤陈燕 《湖南邮电职业技术学院学报》 2025年第3期62-67,共6页
传统数据高速检索方法的数据检索准确率易受数据相似度高的影响,基于此,引进Top-k查询算法,以国际贸易数据为例,设计了一种针对该数据的高速检索方法。运用小波分解技术对自整合的国际贸易数据进行除杂去噪处理,基于Top-k查询算法融合... 传统数据高速检索方法的数据检索准确率易受数据相似度高的影响,基于此,引进Top-k查询算法,以国际贸易数据为例,设计了一种针对该数据的高速检索方法。运用小波分解技术对自整合的国际贸易数据进行除杂去噪处理,基于Top-k查询算法融合相似国际贸易数据,并引进Solr数据检索引擎,从多个方面对高速检索行为进行概述,由此完成国际贸易数据高速检索方法设计。对比实验验证:所提出的高速检索方法在实际应用中的检索时间和检索正确率均优于传统方法。 展开更多
关键词 top-k查询算法 国际贸易 数据检索 小波分解
在线阅读 下载PDF
Optimizing Multimodal Data Queries in Data Lakes
14
作者 Runqun Xiong Shiyuan Zhao +1 位作者 Ciyuan Chen Zhuqing Xu 《Tsinghua Science and Technology》 2025年第6期2625-2637,共13页
This paper addresses the challenge of efficiently querying multimodal related data in data lakes,a large-scale storage and management system that supports heterogeneous data formats,including structured,semi-structure... This paper addresses the challenge of efficiently querying multimodal related data in data lakes,a large-scale storage and management system that supports heterogeneous data formats,including structured,semi-structured,and unstructured data.Multimodal data queries are crucial because they enable seamless retrieval of related data across modalities,such as tables,images,and text,which has applications in fields like e-commerce,healthcare,and education.However,existing methods primarily focus on single-modality queries,such as joinable or unionable table discovery,and struggle to handle the heterogeneity and lack of metadata in data lakes while balancing accuracy and efficiency.To tackle these challenges,we propose a Multimodal data Query mechanism for Data Lakes(MQDL),which employs a modality-adaptive indexing mechanism raleted and contrastive learning based embeddings to unify representations across modalities.Additionally,we introduce product quantization to optimize candidate verification during queries,reducing computational overhead while maintaining precision.We evaluate MQDL using a table-image dataset across multiple business scenarios,measuring metrics such as precision,recall,and F1-score.Results show that MQDL achieves an accuracy rate of approximately 90%,while demonstrating strong scalability and reduced query response time compared to traditional methods.These findings highlight MQDL's potential to enhance multimodal data retrieval in complex data lake environments. 展开更多
关键词 multimodal data query data lake contrastive learning related data query
原文传递
PAV-A-kNN: A Novel Approachable kNN Query Method in Road Network Environments
15
作者 Kailai Zhou Weikang Xia Jiatai Wang 《Computers, Materials & Continua》 2025年第8期3217-3240,共24页
Ride-hailing(e.g.,DiDi andUber)has become an important tool formodern urban mobility.To improve the utilization efficiency of ride-hailing vehicles,a novel query method,called Approachable k-nearest neighbor(A-kNN),ha... Ride-hailing(e.g.,DiDi andUber)has become an important tool formodern urban mobility.To improve the utilization efficiency of ride-hailing vehicles,a novel query method,called Approachable k-nearest neighbor(A-kNN),has recently been proposed in the industry.Unlike traditional kNN queries,A-kNN considers not only the road network distance but also the availability status of vehicles.In this context,even vehicles with passengers can still be considered potential candidates for dispatch if their destinations are near the requester’s location.The V-Treebased query method,due to its structural characteristics,is capable of efficiently finding k-nearest moving objects within a road network.It is a currently popular query solution in ride-hailing services.However,when vertices to be queried are close in the graph but distant in the index,the V-Tree-based method necessitates the traversal of numerous irrelevant subgraphs,which makes its processing of A-kNN queries less efficient.To address this issue,we optimize the V-Tree-based method and propose a novel index structure,the Path-Accelerated V-Tree(PAV-Tree),to improve query performance by introducing shortcuts.Leveraging this index,we introduce a novel query optimization algorithm,PAVA-kNN,specifically designed to processA-kNNqueries efficiently.Experimental results showthat PAV-A-kNNachieves query times up to 2.2–15 times faster than baseline methods,with microsecond-level latency. 展开更多
关键词 k-nearest neighbor query ride-hailing services V-Tree shortest path
在线阅读 下载PDF
基于效用表的Top-k高效用挖掘算法TKUL
16
作者 高敏节 张美春 《电脑编程技巧与维护》 2025年第10期38-40,共3页
针对现有高效用项集挖掘算法存在的阈值提升缓慢、剪枝效用差等问题,提出了一种能够更加高效地挖掘效用值最大的前k个项集的算法。TKUL(minging Top-K high Utility itemsets based List)算法综合采用RIUQ、CUDQ和EPB阈值提升策略,加快... 针对现有高效用项集挖掘算法存在的阈值提升缓慢、剪枝效用差等问题,提出了一种能够更加高效地挖掘效用值最大的前k个项集的算法。TKUL(minging Top-K high Utility itemsets based List)算法综合采用RIUQ、CUDQ和EPB阈值提升策略,加快最小阈值获取的速度,大大减少了生成的非高效用项集的数量,并通过RUI和EUCPM策略进行剪枝,有效缩小了搜索空间的规模,从而提高了高效用项集的挖掘效率。 展开更多
关键词 关联规则 高效用项集 top-k项集
在线阅读 下载PDF
Power Query与数据透视在工程计量自动汇总统计中的应用
17
作者 季国军 陈华 +1 位作者 张佳龙 薛雯雯 《安装》 2025年第10期95-98,共4页
工程量计量与汇总统计工作贯穿于工程投标阶段、施工阶段和竣工阶段,工程计量后的汇总统计工作枯燥而繁琐,费时费力且容易出错,特别是当源数据发生变动时,汇总统计工作需重头再来,重复性机械性的人工汇总统计工作效率低下。本文结合实... 工程量计量与汇总统计工作贯穿于工程投标阶段、施工阶段和竣工阶段,工程计量后的汇总统计工作枯燥而繁琐,费时费力且容易出错,特别是当源数据发生变动时,汇总统计工作需重头再来,重复性机械性的人工汇总统计工作效率低下。本文结合实际项目案例,介绍了一种快速、准确、高效的工程量汇总统计方法,实现工程量计量汇总自动化更新,提高了工作效率与汇总统计的准确性。 展开更多
关键词 CAD POWER query 数据透视 统计汇总 数据分析
在线阅读 下载PDF
Optimum noise mechanism for diferentially private queries in discrete fnite sets
18
作者 Sachin Kadam Anna Scaglione +3 位作者 Nikhil Ravi Sean Peisert Brent Lunghino Aram Shumavon 《Cybersecurity》 2025年第4期1-19,共19页
The diferential privacy (DP) literature often centers on meeting privacy constraints by introducing noise to the query, typically using a pre-specifed parametric distribution model with one or two degrees of freedom. ... The diferential privacy (DP) literature often centers on meeting privacy constraints by introducing noise to the query, typically using a pre-specifed parametric distribution model with one or two degrees of freedom. However, this emphasis tends to neglect the crucial considerations of response accuracy and utility, especially in the context of categorical or discrete numerical database queries, where the parameters defning the noise distribution are fnite and could be chosen optimally. This paper addresses this gap by introducing a novel framework for designing an optimal noise probability mass function (PMF) tailored to discrete and fnite query sets. Our approach considers the modulo summation of random noise as the DP mechanism, aiming to present a tractable solution that not only satisfes privacy constraints but also minimizes query distortion. Unlike existing approaches focused solely on meet-ingprivacy constraints, our framework seeks to optimize the noise distribution under an arbitrary (ǫ, δ) constraint, thereby enhancing the accuracy and utility of the response. We demonstrate that the optimal PMF can be obtained through solving a mixed-integer linear program. Additionally, closed-form solutions for the optimal PMF are provided, minimizing the probability of error for two specifc cases. Numerical experiments highlight the superior performance of our proposed optimal mechanisms compared to state-of-the-art methods. This paper contributes to the DP literature by presenting a clear and systematic approach to designing noise mechanisms that not only satisfy pri-vacyrequirements but also optimize query distortion. The framework introduced here opens avenues for improved privacy-preserving database queries, ofering signifcant enhancements in response accuracy and utility. 展开更多
关键词 Diferential privacy Optimum noise mechanism Discrete queries MILP Error rate
原文传递
LRP:learned robust data partitioning for efficient processing of large dynamic queries
19
作者 Pengju LIU Pan CAI +2 位作者 Kai ZHONG Cuiping LI Hong CHEN 《Frontiers of Computer Science》 2025年第9期43-60,共18页
The interconnection between query processing and data partitioning is pivotal for the acceleration of massive data processing during query execution,primarily by minimizing the number of scanned block files.Existing p... The interconnection between query processing and data partitioning is pivotal for the acceleration of massive data processing during query execution,primarily by minimizing the number of scanned block files.Existing partitioning techniques predominantly focus on query accesses on numeric columns for constructing partitions,often overlooking non-numeric columns and thus limiting optimization potential.Additionally,these techniques,despite creating fine-grained partitions from representative queries to enhance system performance,experience from notable performance declines due to unpredictable fluctuations in future queries.To tackle these issues,we introduce LRP,a learned robust partitioning system for dynamic query processing.LRP first proposes a method for data and query encoding that captures comprehensive column access patterns from historical queries.It then employs Multi-Layer Perceptron and Long Short-Term Memory networks to predict shifts in the distribution of historical queries.To create high-quality,robust partitions based on these predictions,LRP adopts a greedy beam search algorithm for optimal partition division and implements a data redundancy mechanism to share frequently accessed data across partitions.Experimental evaluations reveal that LRP yields partitions with more stable performance under incoming queries and significantly surpasses state-of-the-art partitioning methods. 展开更多
关键词 data partitioning data encoding query prediction beam search data redundancy
原文传递
Cryptanalysis and improvement to the quantum private query protocol for enhancing database privacy
20
作者 Zhengda Shen Wenzhu Shao +4 位作者 Zhigang Li Xiaoyu Peng Nankun Mu Mahabubur Rahman Miraj Bin Liu 《Communications in Theoretical Physics》 2025年第6期122-128,共7页
In order to protect the privacy of the query user and database,some QKD-based quantum private query(QPQ)protocols were proposed.One example is the protocol proposed by Zhou et al,in which the user makes initial quantu... In order to protect the privacy of the query user and database,some QKD-based quantum private query(QPQ)protocols were proposed.One example is the protocol proposed by Zhou et al,in which the user makes initial quantum states and derives the key bit by comparing the initial quantum state and the outcome state returned from the database by ctrl or shift mode,instead of announcing two non-orthogonal qubits as others which may leak part secret information.To some extent,the security of the database and the privacy of the user are strengthened.Unfortunately,we find that in this protocol,the dishonest user could be obtained,utilizing unambiguous state discrimination,much more database information than that is analyzed in Zhou et al's original research.To strengthen the database security,we improved the mentioned protocol by modifying the information returned by the database in various ways.The analysis indicates that the security of the improved protocols is greatly enhanced. 展开更多
关键词 quantum private query quantum key distribution quantum cryptography
原文传递
上一页 1 2 250 下一页 到第
使用帮助 返回顶部