As a new type of production factor in healthcare,healthcare data elements have been rapidly integrated into various health production processes,such as clinical assistance,health management,biological testing,and oper...As a new type of production factor in healthcare,healthcare data elements have been rapidly integrated into various health production processes,such as clinical assistance,health management,biological testing,and operation and supervision[1,2].Healthcare data elements include biolog.ical and clinical data that are related to disease,environ-mental health data that are associated with life,and operational and healthcare management data that are related to healthcare activities(Figure 1).Activities such as the construction of a data value assessment system,the devel-opment of a data circulation and sharing platform,and the authorization of data compliance and operation products support the strong growth momentum of the market for health care data elements in China[3].展开更多
The data production elements are driving profound transformations in the real economy across production objects,methods,and tools,generating significant economic effects such as industrial structure upgrading.This pap...The data production elements are driving profound transformations in the real economy across production objects,methods,and tools,generating significant economic effects such as industrial structure upgrading.This paper aims to reveal the impact mechanism of the data elements on the“three transformations”(high-end,intelligent,and green)in the manufacturing sector,theoretically elucidating the intrinsic mechanisms by which the data elements influence these transformations.The study finds that the data elements significantly enhance the high-end,intelligent,and green levels of China's manufacturing industry.In terms of the pathways of impact,the data elements primarily influence the development of high-tech industries and overall green technological innovation,thereby affecting the high-end,intelligent,and green transformation of the industry.展开更多
With increasing demand for data circulation,ensuring data security and privacy is paramount,specifically protecting privacy while maximizing utility.Blockchain,while decentralized and transparent,faces challenges in p...With increasing demand for data circulation,ensuring data security and privacy is paramount,specifically protecting privacy while maximizing utility.Blockchain,while decentralized and transparent,faces challenges in privacy protection and data verification,especially for sensitive data.Existing schemes often suffer from inefficiency and high overhead.We propose a privacy protection scheme using BGV homomorphic encryption and Pedersen Secret Sharing.This scheme enables secure computation on encrypted data,with Pedersen sharding and verifying the private key,ensuring data consistency and immutability.The blockchain framework manages key shards,verifies secrets,and aids security auditing.This approach allows for trusted computation without revealing the underlying data.Preliminary results demonstrate the scheme's feasibility in ensuring data privacy and security,making data available but not visible.This study provides an effective solution for data sharing and privacy protection in blockchain applications.展开更多
This article explores the characteristics of data resources from the perspective of production factors,analyzes the demand for trustworthy circulation technology,designs a fusion architecture and related solutions,inc...This article explores the characteristics of data resources from the perspective of production factors,analyzes the demand for trustworthy circulation technology,designs a fusion architecture and related solutions,including multi-party data intersection calculation,distributed machine learning,etc.It also compares performance differences,conducts formal verification,points out the value and limitations of architecture innovation,and looks forward to future opportunities.展开更多
This report analyzes the existing problems in terminology referring to clinical symptoms of traditional Chinese medicine(TCM)from the viewpoint of data sharing and elaborates the necessity of establishing a standard d...This report analyzes the existing problems in terminology referring to clinical symptoms of traditional Chinese medicine(TCM)from the viewpoint of data sharing and elaborates the necessity of establishing a standard directory of clinical data elements of TCM.We evaluated the principles and methods of data element extraction according to the status quo of the clinical information system and characteristics of symptoms for TCM and consequently proposed a three-layer model for optimal extraction.展开更多
For digitalization of traditional Chinese medicine(TCM),research is being conducted on objectivization of diagnosis and treatment,mathematical models of TCM theories,and application of modern information technology to...For digitalization of traditional Chinese medicine(TCM),research is being conducted on objectivization of diagnosis and treatment,mathematical models of TCM theories,and application of modern information technology to digitize the vast amounts of existing information.However,the author believes that TCM practitioners should first conduct a systematic and comprehensive refined analysis on the knowledge of TCM and unify data elements used in computer intelligence to avoid ambiguity.Thus,we must overcome the epistemological constraints and carefully analyze the relationship among data elements to achieve systematic results and administer TCM appropriately.展开更多
As the era of large-scale highway maintenance arrives,the maintenance strategies have transitioned to a holistic approach that prioritizes safety,economic feasibility,and environmental sustainability.This research int...As the era of large-scale highway maintenance arrives,the maintenance strategies have transitioned to a holistic approach that prioritizes safety,economic feasibility,and environmental sustainability.This research introduces a multi-objective optimization model for highway maintenance that incorporates the interplay of decision-maker preferences across three key objectives:Highway safety performance,maintenance engineering cost,and carbon emissions.This study employs a large-sample data analysis on a subset of the Lianhuo Highway network,which includes 2,842 pavement sections.This approach mitigates the impact of outliers,ensuring a substantial data buffer that fortifies the model’s capacity for generalization and bolsters its robustness.The findings reveal a Pareto-optimal relationship among the three scrutinized variables.A particularly noteworthy observation is the M-shaped trajectory of carbon emissions,which initially rise,then decline,and ultimately rebound,contingent upon the selected maintenance strategy.Furthermore,an examination of the relationship between maintenance costs and safety performance discloses a trend of diminishing marginal returns,illustrating that the incremental gains in safety performance attenuate as maintenance investment escalates.展开更多
This paper explores the development of interpretable data elements from raw data using Kolmogorov-Arnold Networks(KAN).With the exponential growth of data in contemporary society,there is an urgent need for effective ...This paper explores the development of interpretable data elements from raw data using Kolmogorov-Arnold Networks(KAN).With the exponential growth of data in contemporary society,there is an urgent need for effective data processing methods to unlock the full potential of this resource.The study focuses on the application of KAN in the transportation sector to transform raw traffic data into meaningful data elements.The core of the research is the KANT-GCN model,which synergizes Kolmogorov-Arnold Networks with Temporal Graph Convolutional Networks(T-GCN).This innovative model demonstrates superior performance in predicting traffic speeds,outperforming existing methods in terms of accuracy,reliability,and interpretability.The model was evaluated using real-world datasets from Shenzhen,Los Angeles,and the San Francisco Bay Area,showing significant improvements in different metrics.The paper highlights the potential of KAN-T-GCN to revolutionize data-driven decision-making in traffic management and other sectors,underscoring its ability to handle dynamic updates and maintain data integrity.展开更多
The increasing dependence on data highlights the need for a detailed understanding of its behavior,encompassing the challenges involved in processing and evaluating it.However,current research lacks a comprehensive st...The increasing dependence on data highlights the need for a detailed understanding of its behavior,encompassing the challenges involved in processing and evaluating it.However,current research lacks a comprehensive structure for measuring the worth of data elements,hindering effective navigation of the changing digital environment.This paper aims to fill this research gap by introducing the innovative concept of“data components.”It proposes a graphtheoretic representation model that presents a clear mathematical definition and demonstrates the superiority of data components over traditional processing methods.Additionally,the paper introduces an information measurement model that provides a way to calculate the information entropy of data components and establish their increased informational value.The paper also assesses the value of information,suggesting a pricing mechanism based on its significance.In conclusion,this paper establishes a robust framework for understanding and quantifying the value of implicit information in data,laying the groundwork for future research and practical applications.展开更多
Flexible roll forming is a promising manufacturing method for the production of variable cross section products. Considering the large plastic strain in this forming process which is much larger than that of uniform d...Flexible roll forming is a promising manufacturing method for the production of variable cross section products. Considering the large plastic strain in this forming process which is much larger than that of uniform deformation phase of uniaxial tensile test, the widely adopted method of simulating the forming processes with non-supplemented material data from uniaxial tensile test will certainly lead to large error. To reduce this error, the material data is supplemented based on three constitutive models. Then a finite element model of a six passes flexible roll forming process is established based on the supplemented material data and the original material data from the uniaxial tensile test. The flexible roll forming experiment of a B pillar reinforcing plate is carried out to verify the proposed method. Final cross section shapes of the experimental and the simulated results are compared. It is shown that the simulation calculated with supplemented material data based on Swift model agrees well with the experimental results, while the simulation based on original material data could not predict the actual deformation accurately. The results indicate that this material supplement method is reliable and indispensible, and the simulation model can well reflect the real metal forming process. Detailed analysis of the distribution and history of plastic strain at different positions are performed. A new material data supplement method is proposed to tackle the problem which is ignored in other roll forming simulations, and thus the forming process simulation accuracy can be greatly improved.展开更多
For the regression model about longitudinal data, we combine the robust estimation equation with the elemental empirical likelihood method, and propose an efficient robust estimator, where the robust estimation equati...For the regression model about longitudinal data, we combine the robust estimation equation with the elemental empirical likelihood method, and propose an efficient robust estimator, where the robust estimation equation is based on bounded scoring function and the covariate depended weight function. This method reduces the influence of outliers in response variables and covariates on parameter estimation, takes into account the correlation between data, and improves the efficiency of estimation. The simulation results show that the proposed method is robust and efficient.展开更多
A multivariate statistical analysis was performed on multi-element soil geochemical data from the Koda Hill-Bulenga gold prospects in the Wa-Lawra gold belt, northwest Ghana. The objectives of the study were to define...A multivariate statistical analysis was performed on multi-element soil geochemical data from the Koda Hill-Bulenga gold prospects in the Wa-Lawra gold belt, northwest Ghana. The objectives of the study were to define gold relationships with other trace elements to determine possible pathfinder elements for gold from the soil geochemical data. The study focused on seven elements, namely, Au, Fe, Pb, Mn, Ag, As and Cu. Factor analysis and hierarchical cluster analysis were performed on the analyzed samples. Factor analysis explained 79.093% of the total variance of the data through three factors. This had the gold factor being factor 3, having associations of copper, iron, lead and manganese and accounting for 20.903% of the total variance. From hierarchical clustering, gold was also observed to be clustering with lead, copper, arsenic and silver. There was further indication that, gold concentrations were lower than that of its associations. It can be inferred from the results that, the occurrence of gold and its associated elements can be linked to both primary dispersion from underlying rocks and secondary processes such as lateritization. This data shows that Fe and Mn strongly associated with gold, and alongside Pb, Ag, As and Cu, these elements can be used as pathfinders for gold in the area, with ferruginous zones as targets.展开更多
An experience is presented using the finite element method (FEM) and data mining (DM) techniques to develop models that can be used to optimieze the skin-pass rolling process based on its operating conditions. A F...An experience is presented using the finite element method (FEM) and data mining (DM) techniques to develop models that can be used to optimieze the skin-pass rolling process based on its operating conditions. A FE model based on a real skin-pass process is built and validated. Based on this model, a group of FE models is simulated with different adjustment parameters and with different materials for the sheet; both variables are chosen from pre-set ranges, From all FE model simulations, a database is generated; this database is made up of the above mentioned adjustment parameters, sheet properties and the variables of the process arising from the simulation of the model. Various types of data mining algorithms are used to develop predictive models for each of the variables of the process.The best predictive models can be used to predict experimentally hard-to-measure variables (internal stresses, internal straine, etc.) which are useful in the optimal design of the process or to be applied in real time control systems of a skin-pass process in -plant.展开更多
Exploiting mobile elements (MEs) to accomplish data collection in wireless sensor networks (WSNs) can improve the energy efficiency of sensor nodes, and prolong network lifetime. However, it will lead to large dat...Exploiting mobile elements (MEs) to accomplish data collection in wireless sensor networks (WSNs) can improve the energy efficiency of sensor nodes, and prolong network lifetime. However, it will lead to large data collection latency for the network, which is unacceptable for data-critical applications. In this paper, we address this problem by minimizing the traveling length of MEs. Our methods mainly consist of two steps: we first construct a virtual grid network and select the minimal stop point set (SPS) from it; then, we make optimal scheduling for the MEs based on the SPS in order to minimize their traveling length. Different implementations of genetic algorithm (GA) are used to solve the problem. Our methods are evaluated by extensive simulations. The results show that these methods can greatly reduce the traveling length of MEs, and decrease the data collection latency.展开更多
基金supported by National Natural Science Foundation of China(Grants 72474022,71974011,72174022,71972012,71874009)"BIT think tank"Promotion Plan of Science and Technology Innovation Program of Beijing Institute of Technology(Grants 2024CX14017,2023CX13029).
文摘As a new type of production factor in healthcare,healthcare data elements have been rapidly integrated into various health production processes,such as clinical assistance,health management,biological testing,and operation and supervision[1,2].Healthcare data elements include biolog.ical and clinical data that are related to disease,environ-mental health data that are associated with life,and operational and healthcare management data that are related to healthcare activities(Figure 1).Activities such as the construction of a data value assessment system,the devel-opment of a data circulation and sharing platform,and the authorization of data compliance and operation products support the strong growth momentum of the market for health care data elements in China[3].
文摘The data production elements are driving profound transformations in the real economy across production objects,methods,and tools,generating significant economic effects such as industrial structure upgrading.This paper aims to reveal the impact mechanism of the data elements on the“three transformations”(high-end,intelligent,and green)in the manufacturing sector,theoretically elucidating the intrinsic mechanisms by which the data elements influence these transformations.The study finds that the data elements significantly enhance the high-end,intelligent,and green levels of China's manufacturing industry.In terms of the pathways of impact,the data elements primarily influence the development of high-tech industries and overall green technological innovation,thereby affecting the high-end,intelligent,and green transformation of the industry.
基金supported by the National Key Research and Development Plan in China(Grant No.2020YFB1005500)。
文摘With increasing demand for data circulation,ensuring data security and privacy is paramount,specifically protecting privacy while maximizing utility.Blockchain,while decentralized and transparent,faces challenges in privacy protection and data verification,especially for sensitive data.Existing schemes often suffer from inefficiency and high overhead.We propose a privacy protection scheme using BGV homomorphic encryption and Pedersen Secret Sharing.This scheme enables secure computation on encrypted data,with Pedersen sharding and verifying the private key,ensuring data consistency and immutability.The blockchain framework manages key shards,verifies secrets,and aids security auditing.This approach allows for trusted computation without revealing the underlying data.Preliminary results demonstrate the scheme's feasibility in ensuring data privacy and security,making data available but not visible.This study provides an effective solution for data sharing and privacy protection in blockchain applications.
文摘This article explores the characteristics of data resources from the perspective of production factors,analyzes the demand for trustworthy circulation technology,designs a fusion architecture and related solutions,including multi-party data intersection calculation,distributed machine learning,etc.It also compares performance differences,conducts formal verification,points out the value and limitations of architecture innovation,and looks forward to future opportunities.
基金funding support from the Innovation Platform Open Fund Project of Hunan Provincial Universities (No. 13K076)National Key Discipline Open Fund Project of TCM diagnostics in Hunan University of Chinese Medicine (2015zyzd18)
文摘This report analyzes the existing problems in terminology referring to clinical symptoms of traditional Chinese medicine(TCM)from the viewpoint of data sharing and elaborates the necessity of establishing a standard directory of clinical data elements of TCM.We evaluated the principles and methods of data element extraction according to the status quo of the clinical information system and characteristics of symptoms for TCM and consequently proposed a three-layer model for optimal extraction.
基金the funding support from the National Natural Science Foundation of China(No.81373702)
文摘For digitalization of traditional Chinese medicine(TCM),research is being conducted on objectivization of diagnosis and treatment,mathematical models of TCM theories,and application of modern information technology to digitize the vast amounts of existing information.However,the author believes that TCM practitioners should first conduct a systematic and comprehensive refined analysis on the knowledge of TCM and unify data elements used in computer intelligence to avoid ambiguity.Thus,we must overcome the epistemological constraints and carefully analyze the relationship among data elements to achieve systematic results and administer TCM appropriately.
基金Supported by the National Natural Science Foundation of China(72471223,72231010)。
文摘As the era of large-scale highway maintenance arrives,the maintenance strategies have transitioned to a holistic approach that prioritizes safety,economic feasibility,and environmental sustainability.This research introduces a multi-objective optimization model for highway maintenance that incorporates the interplay of decision-maker preferences across three key objectives:Highway safety performance,maintenance engineering cost,and carbon emissions.This study employs a large-sample data analysis on a subset of the Lianhuo Highway network,which includes 2,842 pavement sections.This approach mitigates the impact of outliers,ensuring a substantial data buffer that fortifies the model’s capacity for generalization and bolsters its robustness.The findings reveal a Pareto-optimal relationship among the three scrutinized variables.A particularly noteworthy observation is the M-shaped trajectory of carbon emissions,which initially rise,then decline,and ultimately rebound,contingent upon the selected maintenance strategy.Furthermore,an examination of the relationship between maintenance costs and safety performance discloses a trend of diminishing marginal returns,illustrating that the incremental gains in safety performance attenuate as maintenance investment escalates.
基金supported by the EU H2020 Research and Innovation Program under the Marie Sklodowska-Curie Grant Agreement(Project-DEEP,Grant No.101109045)the National Natural Science Foundation of China(No.NSFC 61925105 and 62171257)the Tsinghua University-China Mobile Communications Group Co.,Ltd.Joint Institute,and the Fundamental Research Funds for the Central Universities,China(No.FRF-NP-20-03).
文摘This paper explores the development of interpretable data elements from raw data using Kolmogorov-Arnold Networks(KAN).With the exponential growth of data in contemporary society,there is an urgent need for effective data processing methods to unlock the full potential of this resource.The study focuses on the application of KAN in the transportation sector to transform raw traffic data into meaningful data elements.The core of the research is the KANT-GCN model,which synergizes Kolmogorov-Arnold Networks with Temporal Graph Convolutional Networks(T-GCN).This innovative model demonstrates superior performance in predicting traffic speeds,outperforming existing methods in terms of accuracy,reliability,and interpretability.The model was evaluated using real-world datasets from Shenzhen,Los Angeles,and the San Francisco Bay Area,showing significant improvements in different metrics.The paper highlights the potential of KAN-T-GCN to revolutionize data-driven decision-making in traffic management and other sectors,underscoring its ability to handle dynamic updates and maintain data integrity.
基金supported by the EU H2020 Research and Innovation Program under the Marie Sklodowska-Curie Grant Agreement(Project-DEEP,Grant number:101109045)National Key R&D Program of China with Grant number 2018YFB1800804+2 种基金the National Natural Science Foundation of China(Nos.NSFC 61925105,and 62171257)Tsinghua University-China Mobile Communications Group Co.,Ltd,Joint Institutethe Fundamental Research Funds for the Central Universities,China(No.FRF-NP-20-03)。
文摘The increasing dependence on data highlights the need for a detailed understanding of its behavior,encompassing the challenges involved in processing and evaluating it.However,current research lacks a comprehensive structure for measuring the worth of data elements,hindering effective navigation of the changing digital environment.This paper aims to fill this research gap by introducing the innovative concept of“data components.”It proposes a graphtheoretic representation model that presents a clear mathematical definition and demonstrates the superiority of data components over traditional processing methods.Additionally,the paper introduces an information measurement model that provides a way to calculate the information entropy of data components and establish their increased informational value.The paper also assesses the value of information,suggesting a pricing mechanism based on its significance.In conclusion,this paper establishes a robust framework for understanding and quantifying the value of implicit information in data,laying the groundwork for future research and practical applications.
基金Supported by National Natural Science Foundation of China(Grant Nos.51205004,51475003)Beijing Municipal Natural Science Foundation of China(Grant No.3152010)Beijing Municipal Education Committee Science and Technology Program,China(Grant No.KM201510009004)
文摘Flexible roll forming is a promising manufacturing method for the production of variable cross section products. Considering the large plastic strain in this forming process which is much larger than that of uniform deformation phase of uniaxial tensile test, the widely adopted method of simulating the forming processes with non-supplemented material data from uniaxial tensile test will certainly lead to large error. To reduce this error, the material data is supplemented based on three constitutive models. Then a finite element model of a six passes flexible roll forming process is established based on the supplemented material data and the original material data from the uniaxial tensile test. The flexible roll forming experiment of a B pillar reinforcing plate is carried out to verify the proposed method. Final cross section shapes of the experimental and the simulated results are compared. It is shown that the simulation calculated with supplemented material data based on Swift model agrees well with the experimental results, while the simulation based on original material data could not predict the actual deformation accurately. The results indicate that this material supplement method is reliable and indispensible, and the simulation model can well reflect the real metal forming process. Detailed analysis of the distribution and history of plastic strain at different positions are performed. A new material data supplement method is proposed to tackle the problem which is ignored in other roll forming simulations, and thus the forming process simulation accuracy can be greatly improved.
文摘For the regression model about longitudinal data, we combine the robust estimation equation with the elemental empirical likelihood method, and propose an efficient robust estimator, where the robust estimation equation is based on bounded scoring function and the covariate depended weight function. This method reduces the influence of outliers in response variables and covariates on parameter estimation, takes into account the correlation between data, and improves the efficiency of estimation. The simulation results show that the proposed method is robust and efficient.
文摘A multivariate statistical analysis was performed on multi-element soil geochemical data from the Koda Hill-Bulenga gold prospects in the Wa-Lawra gold belt, northwest Ghana. The objectives of the study were to define gold relationships with other trace elements to determine possible pathfinder elements for gold from the soil geochemical data. The study focused on seven elements, namely, Au, Fe, Pb, Mn, Ag, As and Cu. Factor analysis and hierarchical cluster analysis were performed on the analyzed samples. Factor analysis explained 79.093% of the total variance of the data through three factors. This had the gold factor being factor 3, having associations of copper, iron, lead and manganese and accounting for 20.903% of the total variance. From hierarchical clustering, gold was also observed to be clustering with lead, copper, arsenic and silver. There was further indication that, gold concentrations were lower than that of its associations. It can be inferred from the results that, the occurrence of gold and its associated elements can be linked to both primary dispersion from underlying rocks and secondary processes such as lateritization. This data shows that Fe and Mn strongly associated with gold, and alongside Pb, Ag, As and Cu, these elements can be used as pathfinders for gold in the area, with ferruginous zones as targets.
基金Item Sponsored by Spanish Ministry of Education and Science(DPI2007-61090)European Commission Research Programme of the Research Fund for Coal and Steel(RFS-PR-06035)
文摘An experience is presented using the finite element method (FEM) and data mining (DM) techniques to develop models that can be used to optimieze the skin-pass rolling process based on its operating conditions. A FE model based on a real skin-pass process is built and validated. Based on this model, a group of FE models is simulated with different adjustment parameters and with different materials for the sheet; both variables are chosen from pre-set ranges, From all FE model simulations, a database is generated; this database is made up of the above mentioned adjustment parameters, sheet properties and the variables of the process arising from the simulation of the model. Various types of data mining algorithms are used to develop predictive models for each of the variables of the process.The best predictive models can be used to predict experimentally hard-to-measure variables (internal stresses, internal straine, etc.) which are useful in the optimal design of the process or to be applied in real time control systems of a skin-pass process in -plant.
基金supported by Tianjin Municipal Information Industry Office (No. 082044012)
文摘Exploiting mobile elements (MEs) to accomplish data collection in wireless sensor networks (WSNs) can improve the energy efficiency of sensor nodes, and prolong network lifetime. However, it will lead to large data collection latency for the network, which is unacceptable for data-critical applications. In this paper, we address this problem by minimizing the traveling length of MEs. Our methods mainly consist of two steps: we first construct a virtual grid network and select the minimal stop point set (SPS) from it; then, we make optimal scheduling for the MEs based on the SPS in order to minimize their traveling length. Different implementations of genetic algorithm (GA) are used to solve the problem. Our methods are evaluated by extensive simulations. The results show that these methods can greatly reduce the traveling length of MEs, and decrease the data collection latency.