The Internet now is a large-scale platform with big data. Finding truth from a huge dataset has attracted extensive attention, which can maintain the quality of data collected by users and provide users with accurate ...The Internet now is a large-scale platform with big data. Finding truth from a huge dataset has attracted extensive attention, which can maintain the quality of data collected by users and provide users with accurate and efficient data. However, current truth finder algorithms are unsatisfying, because of their low accuracy and complication. This paper proposes a truth finder algorithm based on entity attributes (TFAEA). Based on the iterative computation of source reliability and fact accuracy, TFAEA considers the interactive degree among facts and the degree of dependence among sources, to simplify the typical truth finder algorithms. In order to improve the accuracy of them, TFAEA combines the one-way text similarity and the factual conflict to calculate the mutual support degree among facts. Furthermore, TFAEA utilizes the symmetric saturation of data sources to calculate the degree of dependence among sources. The experimental results show that TFAEA is not only more stable, but also more accurate than the typical truth finder algorithms.展开更多
With the extensive application of software collaborative development technology,the processing of code data generated in programming scenes has become a research hotspot.In the collaborative programming process,differ...With the extensive application of software collaborative development technology,the processing of code data generated in programming scenes has become a research hotspot.In the collaborative programming process,different users can submit code in a distributed way.The consistency of code grammar can be achieved by syntax constraints.However,when different users work on the same code in semantic development programming practices,the development factors of different users will inevitably lead to the problem of data semantic conflict.In this paper,the characteristics of code segment data in a programming scene are considered.The code sequence can be obtained by disassembling the code segment using lexical analysis technology.Combined with a traditional solution of a data conflict problem,the code sequence can be taken as the declared value object in the data conflict resolution problem.Through the similarity analysis of code sequence objects,the concept of the deviation degree between the declared value object and the truth value object is proposed.A multi-truth discovery algorithm,called the multiple truth discovery algorithm based on deviation(MTDD),is proposed.The basic methods,such as Conflict Resolution on Heterogeneous Data,Voting-K,and MTRuths_Greedy,are compared to verify the performance and precision of the proposed MTDD algorithm.展开更多
With the rocketing progress of the Internet, it is easier for people to get information about the objects that they are interested in. However, this information usually has conflicts. In order to resolve conflicts and...With the rocketing progress of the Internet, it is easier for people to get information about the objects that they are interested in. However, this information usually has conflicts. In order to resolve conflicts and get the true information, truth discovery has been proposed and received widespread attention. Many algorithms have been proposed to adapt to different scenarios. This paper aims to investigate these algorithms and summarize them from the perspective of algorithm models and specific concepts. Some classic datasets and evaluation metrics are given in this paper. Some future directions for readers are also provided to better understand the field of truth discovery.展开更多
With the continuous expansion of the data center network scale, changing network requirements, and increasing pressure on network bandwidth, the traditional network architecture can no longer meet people’s needs. The...With the continuous expansion of the data center network scale, changing network requirements, and increasing pressure on network bandwidth, the traditional network architecture can no longer meet people’s needs. The development of software defined networks has brought new opportunities and challenges to future networks. The data and control separation characteristics of SDN improve the performance of the entire network. Researchers have integrated SDN architecture into data centers to improve network resource utilization and performance. This paper first introduces the basic concepts of SDN and data center networks. Then it discusses SDN-based load balancing mechanisms for data centers from different perspectives. Finally, it summarizes and looks forward to the study on SDN-based load balancing mechanisms and its development trend.展开更多
准确判别燃爆状态是测量燃爆延滞期并计算爆发点参数的关键。针对单一传感器判别效果不佳、多个传感器判别结果相互冲突的问题,利用D-S(Dempster-Shafer)证据理论对冲突证据进行联合判别。首先根据含能材料燃爆特性和爆发点测试原理,设...准确判别燃爆状态是测量燃爆延滞期并计算爆发点参数的关键。针对单一传感器判别效果不佳、多个传感器判别结果相互冲突的问题,利用D-S(Dempster-Shafer)证据理论对冲突证据进行联合判别。首先根据含能材料燃爆特性和爆发点测试原理,设计了基于温度和声音的联合判别装置;从实验数据出发,采用模型拟合提取温度特征值,以及声音信号最大值为声音特征值。其次,根据Sigmoid模型求解出BPA(Basic Probability Assignment)函数,并通过信度熵对可能存在冲突的BPA函数值进行预处理;最终,利用D-S证据理论进行燃爆状态联合判别。实验结果表明,所提方法有效提高了实验装置的鲁棒性和状态判别的置信概率,燃爆判别准确率达到了96.5%,优于温度、声音等单一传感器的判别效果。展开更多
近年来,随机排列集(Random permutation set (RPS))理论的提出吸引了许多学者对其进行探索.该理论从集合的角度对信息进行处理和融合,从而实现对有序信息不确定性的研究.然而,随机排列集理论中的组合规则在处理冲突信息时存在局限性,利...近年来,随机排列集(Random permutation set (RPS))理论的提出吸引了许多学者对其进行探索.该理论从集合的角度对信息进行处理和融合,从而实现对有序信息不确定性的研究.然而,随机排列集理论中的组合规则在处理冲突信息时存在局限性,利用理论中现有的算法做决策时,会出现反直觉的结果.为了解决这些问题,提出了一个基于Symmetric Renyi-Permutation (SRP)散度和随机排列集熵的融合算法.该算法首先通过SRP散度计算RPS的差异并进行加权得到子集,以充分考虑每个有效RPS对融合结果的影响.然后为了降低有序性信息的不确定性,对子集的熵进行计算确定融合所用的最终加权子集,然后做出决策.实验结果表明,该算法解决了有序信息冲突的问题,并且具有较高的准确率.展开更多
基金supported by the National Natural Science Foundation of China(61472192)the Scientific and Technological Support Project(Society)of Jiangsu Province(BE2016776)
文摘The Internet now is a large-scale platform with big data. Finding truth from a huge dataset has attracted extensive attention, which can maintain the quality of data collected by users and provide users with accurate and efficient data. However, current truth finder algorithms are unsatisfying, because of their low accuracy and complication. This paper proposes a truth finder algorithm based on entity attributes (TFAEA). Based on the iterative computation of source reliability and fact accuracy, TFAEA considers the interactive degree among facts and the degree of dependence among sources, to simplify the typical truth finder algorithms. In order to improve the accuracy of them, TFAEA combines the one-way text similarity and the factual conflict to calculate the mutual support degree among facts. Furthermore, TFAEA utilizes the symmetric saturation of data sources to calculate the degree of dependence among sources. The experimental results show that TFAEA is not only more stable, but also more accurate than the typical truth finder algorithms.
基金supported by the National Key R&D Program of China(No.2018YFB1003905)the National Natural Science Foundation of China under Grant(No.61971032)Fundamental Research Funds for the Central Universities(No.FRF-TP-18-008A3).
文摘With the extensive application of software collaborative development technology,the processing of code data generated in programming scenes has become a research hotspot.In the collaborative programming process,different users can submit code in a distributed way.The consistency of code grammar can be achieved by syntax constraints.However,when different users work on the same code in semantic development programming practices,the development factors of different users will inevitably lead to the problem of data semantic conflict.In this paper,the characteristics of code segment data in a programming scene are considered.The code sequence can be obtained by disassembling the code segment using lexical analysis technology.Combined with a traditional solution of a data conflict problem,the code sequence can be taken as the declared value object in the data conflict resolution problem.Through the similarity analysis of code sequence objects,the concept of the deviation degree between the declared value object and the truth value object is proposed.A multi-truth discovery algorithm,called the multiple truth discovery algorithm based on deviation(MTDD),is proposed.The basic methods,such as Conflict Resolution on Heterogeneous Data,Voting-K,and MTRuths_Greedy,are compared to verify the performance and precision of the proposed MTDD algorithm.
基金Fundamental Research Funds for the Central Universities,China (No. 22D111207)。
文摘With the rocketing progress of the Internet, it is easier for people to get information about the objects that they are interested in. However, this information usually has conflicts. In order to resolve conflicts and get the true information, truth discovery has been proposed and received widespread attention. Many algorithms have been proposed to adapt to different scenarios. This paper aims to investigate these algorithms and summarize them from the perspective of algorithm models and specific concepts. Some classic datasets and evaluation metrics are given in this paper. Some future directions for readers are also provided to better understand the field of truth discovery.
文摘With the continuous expansion of the data center network scale, changing network requirements, and increasing pressure on network bandwidth, the traditional network architecture can no longer meet people’s needs. The development of software defined networks has brought new opportunities and challenges to future networks. The data and control separation characteristics of SDN improve the performance of the entire network. Researchers have integrated SDN architecture into data centers to improve network resource utilization and performance. This paper first introduces the basic concepts of SDN and data center networks. Then it discusses SDN-based load balancing mechanisms for data centers from different perspectives. Finally, it summarizes and looks forward to the study on SDN-based load balancing mechanisms and its development trend.
文摘准确判别燃爆状态是测量燃爆延滞期并计算爆发点参数的关键。针对单一传感器判别效果不佳、多个传感器判别结果相互冲突的问题,利用D-S(Dempster-Shafer)证据理论对冲突证据进行联合判别。首先根据含能材料燃爆特性和爆发点测试原理,设计了基于温度和声音的联合判别装置;从实验数据出发,采用模型拟合提取温度特征值,以及声音信号最大值为声音特征值。其次,根据Sigmoid模型求解出BPA(Basic Probability Assignment)函数,并通过信度熵对可能存在冲突的BPA函数值进行预处理;最终,利用D-S证据理论进行燃爆状态联合判别。实验结果表明,所提方法有效提高了实验装置的鲁棒性和状态判别的置信概率,燃爆判别准确率达到了96.5%,优于温度、声音等单一传感器的判别效果。