期刊文献+
共找到2,179篇文章
< 1 2 109 >
每页显示 20 50 100
Semantic-based query processing for relational data integration 被引量:1
1
作者 苗壮 张亚非 +2 位作者 王进鹏 陆建江 周波 《Journal of Southeast University(English Edition)》 EI CAS 2011年第1期22-25,共4页
To solve the query processing correctness problem for semantic-based relational data integration,the semantics of SAPRQL(simple protocol and RDF query language) queries is defined.In the course of query rewriting,al... To solve the query processing correctness problem for semantic-based relational data integration,the semantics of SAPRQL(simple protocol and RDF query language) queries is defined.In the course of query rewriting,all relative tables are found and decomposed into minimal connectable units.Minimal connectable units are joined according to semantic queries to produce the semantically correct query plans.Algorithms for query rewriting and transforming are presented.Computational complexity of the algorithms is discussed.Under the worst case,the query decomposing algorithm can be finished in O(n2) time and the query rewriting algorithm requires O(nm) time.And the performance of the algorithms is verified by experiments,and experimental results show that when the length of query is less than 8,the query processing algorithms can provide satisfactory performance. 展开更多
关键词 data integration relational database simple protocol and RDF query language(SPARQL) minimal connectable unit query processing
在线阅读 下载PDF
Design and development of real-time query platform for big data based on hadoop 被引量:1
2
作者 刘小利 Xu Pandeng +1 位作者 Liu Mingliang Zhu Guobin 《High Technology Letters》 EI CAS 2015年第2期231-238,共8页
This paper designs and develops a framework on a distributed computing platform for massive multi-source spatial data using a column-oriented database(HBase).This platform consists of four layers including ETL(extract... This paper designs and develops a framework on a distributed computing platform for massive multi-source spatial data using a column-oriented database(HBase).This platform consists of four layers including ETL(extraction transformation loading) tier,data processing tier,data storage tier and data display tier,achieving long-term store,real-time analysis and inquiry for massive data.Finally,a real dataset cluster is simulated,which are made up of 39 nodes including 2 master nodes and 37 data nodes,and performing function tests of data importing module and real-time query module,and performance tests of HDFS's I/O,the MapReduce cluster,batch-loading and real-time query of massive data.The test results indicate that this platform achieves high performance in terms of response time and linear scalability. 展开更多
关键词 big data massive data storage real-time query HADOOP distributed computing
在线阅读 下载PDF
Multidimensional Data Querying on Tree-Structured Overlay
3
作者 XU Lizhen WANG Shiyuan 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1367-1372,共6页
Multidimensional data query has been gaining much interest in database research communities in recent years, yet many of the existing studies focus mainly on ten tralized systems. A solution to querying in Peer-to-Pee... Multidimensional data query has been gaining much interest in database research communities in recent years, yet many of the existing studies focus mainly on ten tralized systems. A solution to querying in Peer-to-Peer(P2P) environment was proposed to achieve both low processing cost in terms of the number of peers accessed and search messages and balanced query loads among peers. The system is based on a balanced tree structured P2P network. By partitioning the query space intelligently, the amount of query forwarding is effectively controlled, and the number of peers involved and search messages are also limited. Dynamic load balancing can be achieved during space partitioning and query resolving. Extensive experiments confirm the effectiveness and scalability of our algorithms on P2P networks. 展开更多
关键词 range query skyline query P2P indexing multi-dimensional data partition
在线阅读 下载PDF
Linked-Tree: An Aggregate Query Algorithm Based on Sliding Window over Data Stream
4
作者 YU Yaxin WANG Guoren +1 位作者 SU Dong ZHU Xinhua 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1114-1119,共6页
How to process aggregate queries over data streams efficiently and effectively have been becoming hot re search topics in both academic community and industrial community. Aiming at the issues, a novel Linked-tree alg... How to process aggregate queries over data streams efficiently and effectively have been becoming hot re search topics in both academic community and industrial community. Aiming at the issues, a novel Linked-tree algorithm based on sliding window is proposed in this paper. Due to the proposal of concept area, the Linked-tree algorithm reuses many primary results in last window and then avoids lots of unnecessary repeated comparison operations between two successive windows. As a result, execution efficiency of MAX query is improved dramatically. In addition, since the size of memory is relevant to the number of areas but irrelevant to the size of sliding window, memory is economized greatly. The extensive experimental results show that the performance of Linked-tree algorithm has significant improvement gains over the traditional SC (Simple Compared) algorithm and Ranked-tree algorithm. 展开更多
关键词 data streams sliding window aggregate query area HOP
在线阅读 下载PDF
Supporting Various Top-k Queries over Uncertain Datasets
5
作者 LI Wenfeng FU Zufa +2 位作者 WANG Liwei LI Deyi PENG Zhiyong 《Wuhan University Journal of Natural Sciences》 CAS 2014年第1期84-92,共9页
There have been many researches and semantics in answering top-k queries on uncertain data in various applications. However, most of these semantics must consume much of their time in computing position probability. O... There have been many researches and semantics in answering top-k queries on uncertain data in various applications. However, most of these semantics must consume much of their time in computing position probability. Our approach to support various top-k queries is based on position probability distribution (PPD) sharing. In this paper, a PPD-tree structure and several basic operations on it are proposed to support various top-k queries. In addition, we proposed an approximation method to improve the efficiency of PPD generation. We also verify the effectiveness and efficiency of our approach by both theoretical analysis and experiments. 展开更多
关键词 top-k queries uncertain data position probability distribution
原文传递
A Shallow Parsing Approach to Natural Language Queries of a Database
6
作者 Richard Skeggs Stasha Lauria 《Journal of Software Engineering and Applications》 2019年第9期365-382,共18页
The performance and reliability of converting natural language into structured query language can be problematic in handling nuances that are prevalent in natural language. Relational databases are not designed to und... The performance and reliability of converting natural language into structured query language can be problematic in handling nuances that are prevalent in natural language. Relational databases are not designed to understand language nuance, therefore the question why we must handle nuance has to be asked. This paper is looking at an alternative solution for the conversion of a Natural Language Query into a Structured Query Language (SQL) capable of being used to search a relational database. The process uses the natural language concept, Part of Speech to identify words that can be used to identify database tables and table columns. The use of Open NLP based grammar files, as well as additional configuration files, assist in the translation from natural language to query language. Having identified which tables and which columns contain the pertinent data the next step is to create the SQL statement. 展开更多
关键词 NLIDB NATURAL LANGUAGE Processing dataBASE query data MINING
在线阅读 下载PDF
Management and instant query of distributed oil and gas production dynamic data
7
作者 WANG Hongliang MU Longxin +2 位作者 SHI Fugeng LIU Kaiming QIAN Yurong 《Petroleum Exploration and Development》 2019年第5期1014-1021,共8页
The multidimensional analysis engine data management platform is constructed using big data distributed storage and parallel computing,data warehouse modeling technology,realizing the optimal management and instant qu... The multidimensional analysis engine data management platform is constructed using big data distributed storage and parallel computing,data warehouse modeling technology,realizing the optimal management and instant query of distributed oil and gas production dynamic big data.The centralized management and quick response of the production data of more than 36×10^4 oil,gas and water wells is realized.Multidimensional analysis subject model of oil,gas and water well production is built to pretreat the relevant data.At the level of China National Petroleum Corporation(CNPC),the rapid analysis and applications such as oil and gas production tracking,early production warning of key oilfields,analysis of low production wells and long shutdown wells,classification of reservoir development laws have been realized,and the processing time has been shortened from 1 d to 5 s.The basic unit of oil and gas production analysis is refined from oilfield to single well,making the production management more detailed.The process can be traced step by step according to CNPC,oil field company,field,block and single well,and the oil and gas production performance of each unit can be mastered in real time. 展开更多
关键词 PRODUCTION performance big data parallel computation MULTIDIMENSIONAL analysis optimal MANAGEMENT INSTANT query early PRODUCTION WARNING
在线阅读 下载PDF
Visual Composition of Complex Queries on an Integrative Genomic and Proteomic Data Warehouse
8
作者 Francesco Pessina Marco Masseroli Arif Canakoglu 《Engineering(科研)》 2013年第10期94-98,共5页
Biomedical questions are usually complex and regard several different life science aspects. Numerous valuable and he- terogeneous data are increasingly available to answer such questions. Yet, they are dispersedly sto... Biomedical questions are usually complex and regard several different life science aspects. Numerous valuable and he- terogeneous data are increasingly available to answer such questions. Yet, they are dispersedly stored and difficult to be queried comprehensively. We created a Genomic and Proteomic Data Warehouse (GPDW) that integrates data provided by some of the main bioinformatics databases. It adopts a modular integrated data schema and several metadata to describe the integrated data, their sources and their location in the GPDW. Here, we present the Web application that we developed to enable any user to easily compose queries, although complex, on all data integrated in the GPDW. It is publicly available at http://www.bioinformatics.dei.polimi.it/GPKB/. Through a visual interface, the user is only required to select the types of data to be included in the query and the conditions on their values to be retrieved. Then, the Web application leverages the metadata and modular schema of the GPDW to automatically compose an efficient SQL query, run it on the GPDW and show the extracted requested data, enriched with links to external data sources. Performed tests demonstrated efficiency and usability of the developed Web application, and showed its and GPDW relevance in supporting answering biomedical questions, also difficult. 展开更多
关键词 SQL query COMPOSITION VISUAL Interface Integrated data Extraction data Warehousing Bioinformatics database
暂未订购
A Systematic Review of Automated Classification for Simple and Complex Query SQL on NoSQL Database
9
作者 Nurhadi Rabiah Abdul Kadir +1 位作者 Ely Salwana Mat Surin Mahidur R.Sarker 《Computer Systems Science & Engineering》 2024年第6期1405-1435,共31页
A data lake(DL),abbreviated as DL,denotes a vast reservoir or repository of data.It accumulates substantial volumes of data and employs advanced analytics to correlate data from diverse origins containing various form... A data lake(DL),abbreviated as DL,denotes a vast reservoir or repository of data.It accumulates substantial volumes of data and employs advanced analytics to correlate data from diverse origins containing various forms of semi-structured,structured,and unstructured information.These systems use a flat architecture and run different types of data analytics.NoSQL databases are nontabular and store data in a different manner than the relational table.NoSQL databases come in various forms,including key-value pairs,documents,wide columns,and graphs,each based on its data model.They offer simpler scalability and generally outperform traditional relational databases.While NoSQL databases can store diverse data types,they lack full support for atomicity,consistency,isolation,and durability features found in relational databases.Consequently,employing machine learning approaches becomes necessary to categorize complex structured query language(SQL)queries.Results indicate that the most frequently used automatic classification technique in processing SQL queries on NoSQL databases is machine learning-based classification.Overall,this study provides an overview of the automatic classification techniques used in processing SQL queries on NoSQL databases.Understanding these techniques can aid in the development of effective and efficient NoSQL database applications. 展开更多
关键词 NoSQL database data lake machine learning ACID complex query smart city
在线阅读 下载PDF
Semantic Analysis of Natural Language Queries for an Object Oriented Database
10
作者 Bentamar Hemerelain Hafida Belbachir 《Journal of Software Engineering and Applications》 2010年第11期1047-1053,共7页
This paper presents the semantic analysis of queries written in natural language (French) and dedicated to the object oriented data bases. The studied queries include one or two nominal groups (NG) articulating around... This paper presents the semantic analysis of queries written in natural language (French) and dedicated to the object oriented data bases. The studied queries include one or two nominal groups (NG) articulating around a verb. A NG consists of one or several keywords (application dependent noun or value). Simple semantic filters are defined for identifying these keywords which can be of semantic value: class, simple attribute, composed attribute, key value or not key value. Coherence rules and coherence constraints are introduced, to check the validity of the co-occurrence of two consecutive nouns in complex NG. If a query is constituted of a single NG, no further analysis is required. Otherwise, if a query covers two valid NG, it is a subject of studying the semantic coherence of the verb and both NG which are attached to it. 展开更多
关键词 query NOMINAL Group Natural Language OBJECT Oriented data Base SEMANTIC Validation
暂未订购
Desktop Data Driven Approach to Personalize Query Recommendation
11
作者 Xiao-yun Li Ying Yu 《国际计算机前沿大会会议论文集》 2017年第1期25-27,共3页
Query recommendation is an effective method to help users describe their search intentions.In a personalized system,cold-start and the data sparsity were unavoidable,which directly lead to deficient performance of per... Query recommendation is an effective method to help users describe their search intentions.In a personalized system,cold-start and the data sparsity were unavoidable,which directly lead to deficient performance of personalizing.As a significant part of a user’s personal information space,a personal computer owns lots of documents relevant to his or her interest.Therefore,desktop data was introduced to construct a user’s preference model.Furthermore,considering the variety of desktop data,relationship between search task and work task was simultaneously exploited to predict a user’s specific information need.Ten volunteers joined experiments to evaluate the potential of desktop data.A series of experiments were conducted and the results proved that desktop data greatly contributed to providing effective personalized reference words.Besides,the results demonstrated that a user’s long-term interest model performed steadier than work task context,but the most valuable words were the top-3 words extracted from the work context. 展开更多
关键词 query RECOMMENDATION DESKTOP data USER model Work TASK SEARCH TASK
在线阅读 下载PDF
A solution of spatial query processing and query optimization for spatial databases
12
作者 YUANJie XIEKun-qing +2 位作者 MAXiu-jun ZHANGMin SUNLe-bin 《重庆邮电学院学报(自然科学版)》 2004年第5期165-172,共8页
Recently, attention has been focused on spatial query language which is used to query spatial databases. A design of spatial query language has been presented in this paper by extending the standard relational databas... Recently, attention has been focused on spatial query language which is used to query spatial databases. A design of spatial query language has been presented in this paper by extending the standard relational database query language SQL. It recognizes the significantly different requirements of spatial data handling and overcomes the inherent problems of the application of conventional database query languages. This design is based on an extended spatial data model, including the spatial data types and the spatial operators on them. The processing and optimization of spatial queries have also been discussed in this design. In the end, an implementation of this design is given in a spatial query subsystem. 展开更多
关键词 空间数据库 询问语言 空间数据模型 空间操作 最优化
在线阅读 下载PDF
Optimization of RDF link traversal based query execution 被引量:2
13
作者 朱艳琴 花岭 《Journal of Southeast University(English Edition)》 EI CAS 2013年第1期27-32,共6页
Aiming at the problem that only some types of SPARQL ( simple protocal and resource description framework query language) queries can be answered by using the current resource description framework link traversal ba... Aiming at the problem that only some types of SPARQL ( simple protocal and resource description framework query language) queries can be answered by using the current resource description framework link traversal based query execution (RDF-LTE) approach, this paper discusses how the execution order of the triple pattern affects the query results and cost based on concrete SPARQL queries, and analyzes two properties of the web of linked data, missing backward links and missing contingency solution. Then three heuristic principles for logic query plan optimization, namely, the filtered basic graph pattern (FBGP) principle, the triple pattern chain principle and the seed URIs principle, are proposed. The three principles contribute to decrease the intermediate solutions and increase the types of queries that can be answered. The effectiveness and feasibility of the proposed approach is evaluated. The experimental results show that more query results can be returned with less cost, thus enabling users to develop the full potential of the web of linked data. 展开更多
关键词 web of linked data resource description framework link traversal based query execution (RDF-LTE) SPARQL query query optimization
在线阅读 下载PDF
基于Term-Query-URL异构信息网络的查询推荐 被引量:3
14
作者 刘钰峰 李仁发 《湖南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2014年第5期106-112,共7页
查询推荐是一种帮助搜索引擎更好的理解用户检索需求的方法.基于查询的上下文片段训练词汇和查询之间的语义关系,同时结合查询和URL的点击图以及查询中的序列行为构建Term Query URL异构信息网络,采用重启动随机游走(Random Walk withR... 查询推荐是一种帮助搜索引擎更好的理解用户检索需求的方法.基于查询的上下文片段训练词汇和查询之间的语义关系,同时结合查询和URL的点击图以及查询中的序列行为构建Term Query URL异构信息网络,采用重启动随机游走(Random Walk withRestart,RWR)进行查询推荐.综合利用语义信息和日志信息,提高了稀疏查询的推荐效果.基于概率语言模型构造查询的词汇向量,可以为新的查询进行查询推荐.在大规模商业搜索引擎查询日志上的实验表明本文方法相比传统的查询推荐方法性能提升约为3%~10%. 展开更多
关键词 信息检索 查询推荐 点击日志 重启动随机游走
在线阅读 下载PDF
基于XQuery符号表示法的模糊时空数据查询 被引量:1
15
作者 柏禄一 严丽 马宗民 《东北大学学报(自然科学版)》 EI CAS CSCD 北大核心 2013年第4期505-508,共4页
讨论了模糊时空数据查询的概念,研究了模糊时空数据时态查询、空间查询、属性查询和时空查询的各类查询形式及查询特性,基于XQuery提出了模糊时空数据查询的统一符号表示法,并对典型查询进行了举例说明.此外,还讨论了XQuery模糊时空扩... 讨论了模糊时空数据查询的概念,研究了模糊时空数据时态查询、空间查询、属性查询和时空查询的各类查询形式及查询特性,基于XQuery提出了模糊时空数据查询的统一符号表示法,并对典型查询进行了举例说明.此外,还讨论了XQuery模糊时空扩展问题,包括XQuery表达上的扩展、处理上的扩展及体系结构上的扩展.提出的表示法可以表示时空查询语言STQL和移动目标查询语言FTL中规定的各类查询,并可以消除由不同查询表现形式带来的语义混淆,以对各类模糊时空数据查询进行统一表示. 展开更多
关键词 模糊时空数据 查询语言 Xquery FLWOR语法 符号表示法
在线阅读 下载PDF
Data partitioning based on sampling for power load streams
16
作者 王永利 徐宏炳 +2 位作者 董逸生 钱江波 刘学军 《Journal of Southeast University(English Edition)》 EI CAS 2005年第3期293-298,共6页
A novel data streams partitioning method is proposed to resolve problems of range-aggregation continuous queries over parallel streams for power industry.The first step of this method is to parallel sample the data,wh... A novel data streams partitioning method is proposed to resolve problems of range-aggregation continuous queries over parallel streams for power industry.The first step of this method is to parallel sample the data,which is implemented as an extended reservoir-sampling algorithm.A skip factor based on the change ratio of data-values is introduced to describe the distribution characteristics of data-values adaptively.The second step of this method is to partition the fluxes of data streams averagely,which is implemented with two alternative equal-depth histogram generating algorithms that fit the different cases:one for incremental maintenance based on heuristics and the other for periodical updates to generate an approximate partition vector.The experimental results on actual data prove that the method is efficient,practical and suitable for time-varying data streams processing. 展开更多
关键词 data streams continuous queries parallel processing sampling data partitioning
在线阅读 下载PDF
结合重写与数据并行的XQuery查询优化
17
作者 陈荣鑫 《陕西科技大学学报(自然科学版)》 2011年第6期75-79,93,共6页
XQuery查询优化是提升查询引擎性能的关键途径.根据XQuery语言特点和多数据源的查询需求,通过在XQuery语言层的重写优化获取高效的查询计划;为适应多核计算环境,通过中间语言层的并行原语实现数据并行处理,进一步提升系统性能.开发查询... XQuery查询优化是提升查询引擎性能的关键途径.根据XQuery语言特点和多数据源的查询需求,通过在XQuery语言层的重写优化获取高效的查询计划;为适应多核计算环境,通过中间语言层的并行原语实现数据并行处理,进一步提升系统性能.开发查询引擎原型系统,实例测试表明,该优化方法能有效提升XQuery查询性能. 展开更多
关键词 Xquery语言 查询优化 查询重写 数据并行
在线阅读 下载PDF
基于XQuery的异构数据源查询处理 被引量:3
18
作者 严小泉 刘渊 《计算机工程》 CAS CSCD 北大核心 2009年第14期87-89,107,共4页
异构数据源的集成问题是当前数据处理领域内研究的热点,它能更有效地利用信息资源,更好地实现数据共享。介绍一种基于Mediator-Wrapper中间层的异构数据源集成系统框架,对XQuery查询处理过程及其关键问题,如查询分解和优化技术进行深入... 异构数据源的集成问题是当前数据处理领域内研究的热点,它能更有效地利用信息资源,更好地实现数据共享。介绍一种基于Mediator-Wrapper中间层的异构数据源集成系统框架,对XQuery查询处理过程及其关键问题,如查询分解和优化技术进行深入研究,并结合实例进一步说明异构数据源中查询分解和优化的具体实现。 展开更多
关键词 异构数据源 查询分解 查询优化
在线阅读 下载PDF
基于XQuery的数据集成系统中的查询分解算法 被引量:1
19
作者 廖伟 廖湖声 任宇 《通讯和计算机(中英文版)》 2005年第6期24-30,共7页
XQucry查询语言使用XML作为抽象数据模型。可以对基于XML的数据源作查询,无论这些数据源是真正的XML文件或者是中间件提供的XML视图。本文研究了以XQuery作为查询语言的数据集成系统中的查询分解算法。在XQucry语言的层次,利用它的语... XQucry查询语言使用XML作为抽象数据模型。可以对基于XML的数据源作查询,无论这些数据源是真正的XML文件或者是中间件提供的XML视图。本文研究了以XQuery作为查询语言的数据集成系统中的查询分解算法。在XQucry语言的层次,利用它的语言特点实现了多数据源的查询分解算法。 展开更多
关键词 查询分解 Xquery 数据集成
在线阅读 下载PDF
Choosing meaningful structure data for improving web search
20
作者 郭茜 杨晓春 +1 位作者 于戈 李广翱 《Journal of Southeast University(English Edition)》 EI CAS 2008年第3期343-346,共4页
In order to improve the quality of web search,a new query expansion method by choosing meaningful structure data from a domain database is proposed.It categories attributes into three different classes,named as concep... In order to improve the quality of web search,a new query expansion method by choosing meaningful structure data from a domain database is proposed.It categories attributes into three different classes,named as concept attribute,context attribute and meaningless attribute,according to their semantic features which are document frequency features and distinguishing capability features.It also defines the semantic relevance between two attributes when they have correlations in the database.Then it proposes trie-bitmap structure and pair pointer tables to implement efficient algorithms for discovering attribute semantic feature and detecting their semantic relevances.By using semantic attributes and their semantic relevances,expansion words can be generated and embedded into a vector space model with interpolation parameters.The experiments use an IMDB movie database and real texts collections to evaluate the proposed method by comparing its performance with a classical vector space model.The results show that the proposed method can improve text search efficiently and also improve both semantic features and semantic relevances with good separation capabilities. 展开更多
关键词 WEB SEMANTIC attributes relationship structure data query expansion
在线阅读 下载PDF
上一页 1 2 109 下一页 到第
使用帮助 返回顶部