期刊文献+
共找到67篇文章
< 1 2 4 >
每页显示 20 50 100
Load-balancing data distribution in publish/subscribe mode
1
作者 李凯 汪芸 +1 位作者 殷奕 袁飞飞 《Journal of Southeast University(English Edition)》 EI CAS 2014年第4期428-433,共6页
To improve data distribution efficiency a load-balancing data distribution LBDD method is proposed in publish/subscribe mode.In the LBDD method subscribers are involved in distribution tasks and data transfers while r... To improve data distribution efficiency a load-balancing data distribution LBDD method is proposed in publish/subscribe mode.In the LBDD method subscribers are involved in distribution tasks and data transfers while receiving data themselves.A dissemination tree is constructed among the subscribers based on MD5 where the publisher acts as the root. The proposed method provides bucket construction target selection and path updates furthermore the property of one-way dissemination is proven.That the average out-going degree of a node is 2 is guaranteed with the proposed LBDD.The experiments on data distribution delay data distribution rate and load distribution are conducted. Experimental results show that the LBDD method aids in shaping the task load between the publisher and subscribers and outperforms the point-to-point approach. 展开更多
关键词 data distribution publish/subscribe mode load balance dissemination tree
在线阅读 下载PDF
Data distribution inference attack in federated learning via reinforcement learning support
2
作者 Dongxiao Yu Hengming Zhang +1 位作者 Yan Huang Zhenzhen Xie 《High-Confidence Computing》 2025年第1期47-55,共9页
Federated Learning(FL)is currently a widely used collaborative learning framework,and the distinguished feature of FL is that the clients involved in training do not need to share raw data,but only transfer the model ... Federated Learning(FL)is currently a widely used collaborative learning framework,and the distinguished feature of FL is that the clients involved in training do not need to share raw data,but only transfer the model parameters to share knowledge,and finally get a global model with improved performance.However,recent studies have found that sharing model parameters may still lead to privacy leakage.From the shared model parameters,local training data can be reconstructed and thus lead to a threat to individual privacy and security.We observed that most of the current attacks are aimed at client-specific data reconstruction,while limited attention is paid to the information leakage of the global model.In our work,we propose a novel FL attack based on shared model parameters that can deduce the data distribution of the global model.Different from other FL attacks that aim to infer individual clients’raw data,the data distribution inference attack proposed in this work shows that the attackers can have the capability to deduce the data distribution information behind the global model.We argue that such information is valuable since the training data behind a welltrained global model indicates the common knowledge of a specific task,such as social networks and e-commerce applications.To implement such an attack,our key idea is to adopt a deep reinforcement learning approach to guide the attack process,where the RL agent adjusts the pseudo-data distribution automatically until it is similar to the ground truth data distribution.By a carefully designed Markov decision proces(MDP)process,our implementation ensures our attack can have stable performance and experimental results verify the effectiveness of our proposed inference attack. 展开更多
关键词 Sharing model parameters data distribution attacks Federated learning Reinforcement learning
在线阅读 下载PDF
A Survey and Experimental Review on Data Distribution Strategies for Parallel Spatial Clustering Algorithms 被引量:1
3
作者 Jagat Sesh Challa Navneet Goyal +3 位作者 Amogh Sharma Nikhil Sreekumar Sundar Balasubramaniam Poonam Goyal 《Journal of Computer Science & Technology》 SCIE EI CSCD 2024年第3期610-636,共27页
The advent of Big Data has led to the rapid growth in the usage of parallel clustering algorithms that work over distributed computing frameworks such as MPI,MapReduce,and Spark.An important step for any parallel clus... The advent of Big Data has led to the rapid growth in the usage of parallel clustering algorithms that work over distributed computing frameworks such as MPI,MapReduce,and Spark.An important step for any parallel clustering algorithm is the distribution of data amongst the cluster nodes.This step governs the methodology and performance of the entire algorithm.Researchers typically use random,or a spatial/geometric distribution strategy like kd-tree based partitioning and grid-based partitioning,as per the requirements of the algorithm.However,these strategies are generic and are not tailor-made for any specific parallel clustering algorithm.In this paper,we give a very comprehensive literature survey of MPI-based parallel clustering algorithms with special reference to the specific data distribution strategies they employ.We also propose three new data distribution strategies namely Parameterized Dimensional Split for parallel density-based clustering algorithms like DBSCAN and OPTICS,Cell-Based Dimensional Split for dGridSLINK,which is a grid-based hierarchical clustering algorithm that exhibits efficiency for disjoint spatial distribution,and Projection-Based Split,which is a generic distribution strategy.All of these preserve spatial locality,achieve disjoint partitioning,and ensure good data load balancing.The experimental analysis shows the benefits of using the proposed data distribution strategies for algorithms they are designed for,based on which we give appropriate recommendations for their usage. 展开更多
关键词 parallel data mining data distribution parallel clustering spatial locality preservation
原文传递
A parallel matching algorithm based on order relation for HLA data distribution management 被引量:1
4
作者 Yanbing Liu Hongbo Sun +1 位作者 Wenhui Fan Tianyuan Xiao 《International Journal of Modeling, Simulation, and Scientific Computing》 EI 2015年第2期1-15,共15页
In distribution simulation based on High-level architecture(HLA),data distribution management(DDM)is one of HLA services for the purpose of filtering the unnecessary data transferring over the network.DDM admits the s... In distribution simulation based on High-level architecture(HLA),data distribution management(DDM)is one of HLA services for the purpose of filtering the unnecessary data transferring over the network.DDM admits the sending federates and the receiving federates to express their interest using update regions and subscription regions in a multidimensional routing space.There are several matching algorithms to obtain overlap information between the update regions and subscription regions.When the number of regions increase sharply,the matching process is time consuming.However,the existing algorithms is hard to be parallelized to take advantage of the computing capabilities of multi-core processors.To reduce the computational overhead of region matching,we propose a parallel algorithm based on order relation to accelerate the matching process.The new matching algorithm adopts divide-and-conquer approach to divide the regions into multiple region bound sublists,each of which comprises parts of region bounds.To calculate the intersection inside and amongst the region bound sublists,two matching rules are presented.This approach has good performance since it performs region matching on the sublists parallel and does not require unnecessary comparisons within regions in different sublists.Theoretical analysis has been carried out for the proposed algorithm and experimental result shows that the proposed algorithm has better performance than major existing DDM matching algorithms. 展开更多
关键词 High-level architecture data distribution management matching algorithm
原文传递
Supporting Flexible Data Distributionin Software DSMs
5
作者 洪锦伟 陈国良 张兆庆 《Journal of Computer Science & Technology》 SCIE EI CSCD 2000年第5期445-452,共8页
Page-based software DSM systems suffer from false sharing caused by the large sharing granularity, and only support one-dimension Block or Cyclicblock data distribution schemes. Thus applications running on them will... Page-based software DSM systems suffer from false sharing caused by the large sharing granularity, and only support one-dimension Block or Cyclicblock data distribution schemes. Thus applications running on them will suffer from poor data locality and will be able to exploit parallelism only when using a large number of processors. In this paper, a way towards supporting flexible data distribution (FDD) on software DSM system is presented. Small granularity-tunable blocks, the size of which can be set by compiler or programmer, are used to overlap the working data sets distributed among processors. The FDD was implemented on a software DSM system called JIAJIA. Compared with Block/Cyclic-block distribution schemes used by most DSM systems now, experiments show that the proposed way of flexible data distribution is more effective. The performance of the applications used in the experiments is significantly improved. 展开更多
关键词 DSM JIAJIA data distribution address computation Dawning
原文传递
Spectral clustering eigenvector selection of hyperspectral image based on the coincidence degree of data distribution
6
作者 Zhongliang Ren Qiuping Zhai Lin Sun 《International Journal of Digital Earth》 SCIE EI 2023年第1期3489-3512,共24页
Spectral clustering is a well-regarded subspace clustering algorithm that exhibits outstanding performance in hyperspectral image classification through eigenvalue decomposition of the Laplacian matrix.However,its cla... Spectral clustering is a well-regarded subspace clustering algorithm that exhibits outstanding performance in hyperspectral image classification through eigenvalue decomposition of the Laplacian matrix.However,its classification accuracy is severely limited by the selected eigenvectors,and the commonly used eigenvectors not only fail to guarantee the inclusion of detailed discriminative information,but also have high computational complexity.To address these challenges,we proposed an intuitive eigenvector selection method based on the coincidence degree of data distribution(CDES).First,the clustering result of improved k-means,which can well reflect the spatial distribution of various types was used as the reference map.Then,the adjusted Rand index and adjusted mutual information were calculated to assess the data distribution consistency between each eigenvector and the reference map.Finally,the eigenvectors with high coincidence degrees were selected for clustering.A case study on hyperspectral mineral mapping demonstrated that the mapping accuracies of CDES are approximately 56.3%,15.5%,and 10.5%higher than those of the commonly used top,high entropy,and high relevance eigenvectors,and CDES can save more than 99%of the eigenvector selection time.Especially,due to the unsupervised nature of k-means,CDES provides a novel solution for autonomous feature selection of hyperspectral images. 展开更多
关键词 Eigenvector selection spectral clustering coincidence degree of data distribution hyperspectral mineral mapping
原文传递
Similarities and differences of city-size distributions in three main urban agglomerations of China from 1992 to 2015: A comparative study based on nighttime light data 被引量:17
7
作者 高宾 黄庆旭 +1 位作者 何春阳 窦银银 《Journal of Geographical Sciences》 SCIE CSCD 2017年第5期533-545,共13页
Comparing the city-size distribution at the urban agglomeration(UA) scale is important for understanding the processes of urban development. However, comparative studies of city-size distribution among China's thre... Comparing the city-size distribution at the urban agglomeration(UA) scale is important for understanding the processes of urban development. However, comparative studies of city-size distribution among China's three largest UAs, the Beijing-Tianjin-Hebei agglomeration(BTHA), the Yangtze River Delta agglomeration(YRDA), and the Pearl River Delta agglomeration(PRDA), remain inadequate due to the limitation of data availability. Therefore, using urban data derived from time-series nighttime light data, the common characteristics and distinctive features of city-size distribution among the three UAs from 1992 to 2015 were compared by the Pareto regression and the rank clock method. We identified two common features. First, the city-size distribution became more even. The Pareto exponents increased by 0.17, 0.12, and 0.01 in the YRDA, BTHA, and PRDA, respectively. Second, the average ranks of small cities ascended, being 0.55, 0.08 and 0.04 in the three UAs, respectively. However, the average ranks of large and medium cities in the three UAs experienced different trajectories, which are closely related to the similarities and differences in the driving forces for the development of UAs. Place-based measures are encouraged to promote a coordinated development among cities of differing sizes in the three UAs. 展开更多
关键词 city-size distribution comparative study nighttime light data rank clock urban agglomeration
原文传递
Co-seismic fault geometry and slip distribution of the 26 December 2004, giant Sumatra–Andaman earthquake constrained by GPS, coral reef, and remote sensing data 被引量:1
8
作者 Yongge Wan Zheng-kang Shen +5 位作者 Min Wang Yuehua Zeng Jichao Huang Xiang Li Huawei Cui Xiwei Gao 《Earthquake Science》 CSCD 2015年第3期187-195,共9页
We analyze co-seismic displacement field of the 26 December 2004, giant Sumatra–Andaman earthquake derived from Global Position System observations,geological vertical measurement of coral head, and pivot line observ... We analyze co-seismic displacement field of the 26 December 2004, giant Sumatra–Andaman earthquake derived from Global Position System observations,geological vertical measurement of coral head, and pivot line observed through remote sensing. Using the co-seismic displacement field and AK135 spherical layered Earth model, we invert co-seismic slip distribution along the seismic fault. We also search the best fault geometry model to fit the observed data. Assuming that the dip angle linearly increases in downward direction, the postfit residual variation of the inversed geometry model with dip angles linearly changing along fault strike are plotted. The geometry model with local minimum misfits is the one with dip angle linearly increasing along strike from 4.3oin top southernmost patch to 4.5oin top northernmost path and dip angle linearly increased. By using the fault shape and geodetic co-seismic data, we estimate the slip distribution on the curved fault. Our result shows that the earthquake ruptured *200-km width down to a depth of about 60 km.0.5–12.5 m of thrust slip is resolved with the largest slip centered around the central section of the rupture zone78N–108N in latitude. The estimated seismic moment is8.2 9 1022 N m, which is larger than estimation from the centroid moment magnitude(4.0 9 1022 N m), and smaller than estimation from normal-mode oscillation data modeling(1.0 9 1023 N m). 展开更多
关键词 Sumatra–Andaman earthquake Fault geometry Co-seismic slip distribution Geodetic data
在线阅读 下载PDF
A Robust Framework for Multimodal Sentiment Analysis with Noisy Labels Generated from Distributed Data Annotation 被引量:1
9
作者 Kai Jiang Bin Cao Jing Fan 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第6期2965-2984,共20页
Multimodal sentiment analysis utilizes multimodal data such as text,facial expressions and voice to detect people’s attitudes.With the advent of distributed data collection and annotation,we can easily obtain and sha... Multimodal sentiment analysis utilizes multimodal data such as text,facial expressions and voice to detect people’s attitudes.With the advent of distributed data collection and annotation,we can easily obtain and share such multimodal data.However,due to professional discrepancies among annotators and lax quality control,noisy labels might be introduced.Recent research suggests that deep neural networks(DNNs)will overfit noisy labels,leading to the poor performance of the DNNs.To address this challenging problem,we present a Multimodal Robust Meta Learning framework(MRML)for multimodal sentiment analysis to resist noisy labels and correlate distinct modalities simultaneously.Specifically,we propose a two-layer fusion net to deeply fuse different modalities and improve the quality of the multimodal data features for label correction and network training.Besides,a multiple meta-learner(label corrector)strategy is proposed to enhance the label correction approach and prevent models from overfitting to noisy labels.We conducted experiments on three popular multimodal datasets to verify the superiority of ourmethod by comparing it with four baselines. 展开更多
关键词 Distributed data collection multimodal sentiment analysis meta learning learn with noisy labels
在线阅读 下载PDF
Comparative Study and Spatial-Temporal Distribution of Regolith and Rock Geochemical Data from Xingmeng-North China
10
作者 TANG Kun WANG Xueqiu +3 位作者 CHI Qinghua ZHOU Jian LIU Dongsheng LIU Hanliang 《Acta Geologica Sinica(English Edition)》 SCIE CAS CSCD 2017年第S1期229-230,共2页
1 Introduction Geochemical mapping at national and continental scales continues to present challenges worldwide due to variations in geologic and geotectonic units.Use of the proper sampling media can provide rich inf... 1 Introduction Geochemical mapping at national and continental scales continues to present challenges worldwide due to variations in geologic and geotectonic units.Use of the proper sampling media can provide rich information on 展开更多
关键词 In Comparative Study and Spatial-Temporal distribution of Regolith and Rock Geochemical data from Xingmeng-North China ROCK REE
在线阅读 下载PDF
经多端柔性变电站互联的交直流混合配电网数据驱动分布鲁棒运行模型
11
作者 郑焕坤 孙耀斌 韦凯国 《华北电力大学学报(自然科学版)》 北大核心 2025年第3期12-20,31,共10页
随着分布式可再生能源渗透率的提高,交直流混合配电网呈现出明显的分区特性和较强的运行不确定性。多端口电力电子变压器(power electronic transformer,PET)能够用来构建一个功率灵活的通道,以完成交直流混合配电网中几个区域之间的电... 随着分布式可再生能源渗透率的提高,交直流混合配电网呈现出明显的分区特性和较强的运行不确定性。多端口电力电子变压器(power electronic transformer,PET)能够用来构建一个功率灵活的通道,以完成交直流混合配电网中几个区域之间的电能交换。基于此文章提出经多端柔性变电站互联的交直流混合配电网两阶段分布鲁棒运行模型,以储能装置运行维护成本、购电成本、弃风处罚成本、网损和微型燃气轮机发电成本之和最小为优化目标,采用1-范数和∞-范数描述风电输出选择典型场景的概率分布,解决风电的不确定性问题,建立数据驱动分布鲁棒优化模型。然后,采用列约束生成算法(column-and-constraint generation,CCG)求解模型。最后,通过IEEE 33节点算例对模型进行验证,该模型可以有效降低交直流混合配电网运行成本。 展开更多
关键词 交直流混合配电网 多端口电力电子变压器 不确定性 数据驱动分布鲁棒优化
在线阅读 下载PDF
Data-driven intelligent monitoring system for key variables in wastewater treatment process 被引量:6
12
作者 Honggui Han Shuguang Zhu +1 位作者 Junfei Qiao Min Guo 《Chinese Journal of Chemical Engineering》 SCIE EI CAS CSCD 2018年第10期2093-2101,共9页
In wastewater treatment process(WWTP), the accurate and real-time monitoring values of key variables are crucial for the operational strategies. However, most of the existing methods have difficulty in obtaining the r... In wastewater treatment process(WWTP), the accurate and real-time monitoring values of key variables are crucial for the operational strategies. However, most of the existing methods have difficulty in obtaining the real-time values of some key variables in the process. In order to handle this issue, a data-driven intelligent monitoring system, using the soft sensor technique and data distribution service, is developed to monitor the concentrations of effluent total phosphorous(TP) and ammonia nitrogen(NH_4-N). In this intelligent monitoring system, a fuzzy neural network(FNN) is applied for designing the soft sensor model, and a principal component analysis(PCA) method is used to select the input variables of the soft sensor model. Moreover, data transfer software is exploited to insert the soft sensor technique to the supervisory control and data acquisition(SCADA) system. Finally, this proposed intelligent monitoring system is tested in several real plants to demonstrate the reliability and effectiveness of the monitoring performance. 展开更多
关键词 data-DRIVEN Soft sensor Intelligent monitoring system data distribution service Wastewater treatment process
在线阅读 下载PDF
Distributed Parallelization of a Global Atmospheric Data Objective Analysis System 被引量:2
13
作者 赵军 宋君强 李振军 《Advances in Atmospheric Sciences》 SCIE CAS CSCD 2003年第1期159-163,共5页
It is difficult to parallelize a subsistent sequential algorithm. Through analyzing the sequential algorithm of a Global Atmospheric Data Objective Analysis System, this article puts forward a distributed parallel alg... It is difficult to parallelize a subsistent sequential algorithm. Through analyzing the sequential algorithm of a Global Atmospheric Data Objective Analysis System, this article puts forward a distributed parallel algorithm that statically distributes data on a massively parallel processing (MPP) computer. The algorithm realizes distributed parailelization by extracting the analysis boxes and model grid point Iatitude rows with leaped steps, and by distributing the data to different processors. The parallel algorithm achieves good load balancing, high parallel efficiency, and low parallel cost. Performance experiments on a MPP computer arc also presented. 展开更多
关键词 distributed parailelization analysis box data distribution objective analysis
在线阅读 下载PDF
稀缺价格与需求数据驱动的风险规避报童决策
14
作者 赵梦蝶 王长军 周赛玉 《中国管理科学》 北大核心 2025年第2期232-241,共10页
本文研究价格和需求双随机变动下的数据驱动报童决策。其中,考虑决策者可利用的价格和需求历史数据是稀缺的,以及决策者可能具有的风险态度。为此,基于条件风险价值(CVaR)度量,构建两种稀缺数据驱动的鲁棒报童模型:分布式鲁棒CVaR模型... 本文研究价格和需求双随机变动下的数据驱动报童决策。其中,考虑决策者可利用的价格和需求历史数据是稀缺的,以及决策者可能具有的风险态度。为此,基于条件风险价值(CVaR)度量,构建两种稀缺数据驱动的鲁棒报童模型:分布式鲁棒CVaR模型和鲁棒Copula-CVaR模型,并分别给出了等价的半定规划和线性规划形式。最后,将两者与现有的Copula-CVaR模型进行仿真比较。结果表明:随着风险容忍程度的降低,决策者趋向规避风险,此时三种模型的最优订货量均下降;且在同一风险程度下,分布式鲁棒CVaR模型给出的订货量最为保守。此外,样本外测试集结果显示:在市场表现上,两种鲁棒模型均优于已有模型。其中,当决策者风险容忍程度较高时,分布式鲁棒CVaR更优;反之,鲁棒Copula-CVaR更优。在市场结果预估上,分布式鲁棒CVaR总体最好。 展开更多
关键词 稀缺数据驱动 报童模型 条件风险价值 分布式鲁棒 鲁棒Copula
原文传递
含风电电力系统源荷储协同分布鲁棒优化调度
15
作者 张子见 何宇 +3 位作者 张靖 郭元萍 王志杨 胡祥谢 《电子科技》 2025年第8期1-10,共10页
针对随着电力市场化改革的不断推进,大用户直购过程对电力系统灵活性的影响问题,文中提出了考虑风电不确定性与大用户直购电的电力系统分布鲁棒优化调度模型。为降低风电不确定性对电力系统影响,考虑风电预测误差的时序性与区间性,提出... 针对随着电力市场化改革的不断推进,大用户直购过程对电力系统灵活性的影响问题,文中提出了考虑风电不确定性与大用户直购电的电力系统分布鲁棒优化调度模型。为降低风电不确定性对电力系统影响,考虑风电预测误差的时序性与区间性,提出一种具有区间特性的一阶马尔可夫链模型,构建数据驱动的日前两阶段分布鲁棒优化模型。模型的第一阶段以风电场场站总收益最大为目标函数,制定日前第一阶段鲁棒调度方案。日前第二阶段通过调整区域内的可控机组出力来灵活应对风电出力的不确定性。研究结果验证了分布鲁棒优化算法的有效性,并证明了大用户直购电参与调度能够有效提高系统的经济性和调峰能力。 展开更多
关键词 风电 大用户直购电 分布鲁棒优化 马尔可夫链 数据驱动 优化调度 调峰
在线阅读 下载PDF
GF-3 data real-time processing method based on multi-satellite distributed data processing system 被引量:7
16
作者 YANG Jun CAO Yan-dong +2 位作者 SUN Guang-cai XING Meng-dao GUO Liang 《Journal of Central South University》 SCIE EI CAS CSCD 2020年第3期842-852,共11页
Due to the limited scenes that synthetic aperture radar(SAR)satellites can detect,the full-track utilization rate is not high.Because of the computing and storage limitation of one satellite,it is difficult to process... Due to the limited scenes that synthetic aperture radar(SAR)satellites can detect,the full-track utilization rate is not high.Because of the computing and storage limitation of one satellite,it is difficult to process large amounts of data of spaceborne synthetic aperture radars.It is proposed to use a new method of networked satellite data processing for improving the efficiency of data processing.A multi-satellite distributed SAR real-time processing method based on Chirp Scaling(CS)imaging algorithm is studied in this paper,and a distributed data processing system is built with field programmable gate array(FPGA)chips as the kernel.Different from the traditional CS algorithm processing,the system divides data processing into three stages.The computing tasks are reasonably allocated to different data processing units(i.e.,satellites)in each stage.The method effectively saves computing and storage resources of satellites,improves the utilization rate of a single satellite,and shortens the data processing time.Gaofen-3(GF-3)satellite SAR raw data is processed by the system,with the performance of the method verified. 展开更多
关键词 synthetic aperture radar full-track utilization rate distributed data processing CS imaging algorithm field programmable gate array Gaofen-3
在线阅读 下载PDF
Distributed anonymous data perturbation method for privacy-preserving data mining 被引量:4
17
作者 Feng LI Jin MA Jian-hua LI 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2009年第7期952-963,共12页
Privacy is a critical requirement in distributed data mining. Cryptography-based secure multiparty computation is a main approach for privacy preserving. However, it shows poor performance in large scale distributed s... Privacy is a critical requirement in distributed data mining. Cryptography-based secure multiparty computation is a main approach for privacy preserving. However, it shows poor performance in large scale distributed systems. Meanwhile, data perturbation techniques are comparatively efficient but are mainly used in centralized privacy-preserving data mining (PPDM). In this paper, we propose a light-weight anonymous data perturbation method for efficient privacy preserving in distributed data mining. We first define the privacy constraints for data perturbation based PPDM in a semi-honest distributed environment. Two protocols are proposed to address these constraints and protect data statistics and the randomization process against collusion attacks: the adaptive privacy-preserving summary protocol and the anonymous exchange protocol. Finally, a distributed data perturbation framework based on these protocols is proposed to realize distributed PPDM. Experiment results show that our approach achieves a high security level and is very efficient in a large scale distributed environment. 展开更多
关键词 Privacy-preserving data mining (PPDM) Distributed data mining data perturbation
原文传递
A Distributed Data Mining System Based on Multi-agent Technology 被引量:1
18
作者 郭黎明 张艳珍 《Journal of Donghua University(English Edition)》 EI CAS 2006年第6期80-83,共4页
Distributed Data Mining is expected to discover preciously unknown, implicit and valuable information from massive data set inherently distributed over a network. In recent years several approaches to distributed data... Distributed Data Mining is expected to discover preciously unknown, implicit and valuable information from massive data set inherently distributed over a network. In recent years several approaches to distributed data mining have been developed, but only a few of them make use of intelligent agents. This paper provides the reason for applying Multi-Agent Technology in Distributed Data Mining and presents a Distributed Data Mining System based on Multi-Agent Technology that deals with heterogeneity in such environment. Based on the advantages of both the CS model and agent-based model, the system is being able to address the specific concern of increasing scalability and enhancing performance. 展开更多
关键词 Distributed data Mining MULTI-AGENT CORBA Client/Server.
在线阅读 下载PDF
Remote Control for the HT-7 Distributed Data Acquisition System
19
作者 岳冬利 罗家融 +1 位作者 王枫 朱琳 《Plasma Science and Technology》 SCIE EI CAS CSCD 2003年第4期1881-1886,共6页
HT-7 is the first superconducting tokamak device for fusion research in China. Many experiments have been done in the machine since 1994, and lots of satisfactory results have been achieved in the fusion research fiel... HT-7 is the first superconducting tokamak device for fusion research in China. Many experiments have been done in the machine since 1994, and lots of satisfactory results have been achieved in the fusion research field on HT-7 tokamak [1]. With the development of fusion research, remote control of experiment becomes more and more important to improve experimental efficiency and expand research results. This paper will describe a RCS (Remote Control System), the combined model of Browser/Server and Client/Server, based on Internet of HT-7 distributed data acquisition system (HT7DAS). By means of RCS, authorized users all over the world can control and configure HT7DAS remotely. The RCS is designed to improve the flexibility, opening, reliability and efficiency of HT7DAS. In the paper, the whole process of design along with implementation of the system and some key items are discussed in detail. The System has been successfully operated during HT-7 experiment in 2002 campaign period. 展开更多
关键词 TOKAMAK HT-7 distributed data acquisition
在线阅读 下载PDF
An Adaptive Privacy Preserving Framework for Distributed Association Rule Mining in Healthcare Databases
20
作者 Hasanien K.Kuba Mustafa A.Azzawi +2 位作者 Saad M.Darwish Oday A.Hassen Ansam A.Abdulhussein 《Computers, Materials & Continua》 SCIE EI 2023年第2期4119-4133,共15页
It is crucial,while using healthcare data,to assess the advantages of data privacy against the possible drawbacks.Data from several sources must be combined for use in many data mining applications.The medical practit... It is crucial,while using healthcare data,to assess the advantages of data privacy against the possible drawbacks.Data from several sources must be combined for use in many data mining applications.The medical practitioner may use the results of association rule mining performed on this aggregated data to better personalize patient care and implement preventive measures.Historically,numerous heuristics(e.g.,greedy search)and metaheuristics-based techniques(e.g.,evolutionary algorithm)have been created for the positive association rule in privacy preserving data mining(PPDM).When it comes to connecting seemingly unrelated diseases and drugs,negative association rules may be more informative than their positive counterparts.It is well-known that during negative association rules mining,a large number of uninteresting rules are formed,making this a difficult problem to tackle.In this research,we offer an adaptive method for negative association rule mining in vertically partitioned healthcare datasets that respects users’privacy.The applied approach dynamically determines the transactions to be interrupted for information hiding,as opposed to predefining them.This study introduces a novel method for addressing the problem of negative association rules in healthcare data mining,one that is based on the Tabu-genetic optimization paradigm.Tabu search is advantageous since it removes a huge number of unnecessary rules and item sets.Experiments using benchmark healthcare datasets prove that the discussed scheme outperforms state-of-the-art solutions in terms of decreasing side effects and data distortions,as measured by the indicator of hiding failure. 展开更多
关键词 Distributed data mining evolutionary computation sanitization process healthcare informatics
在线阅读 下载PDF
上一页 1 2 4 下一页 到第
使用帮助 返回顶部