Laser-induced breakdown spectroscopy(LIBS)has become a widely used atomic spectroscopic technique for rapid coal analysis.However,the vast amount of spectral information in LIBS contains signal uncertainty,which can a...Laser-induced breakdown spectroscopy(LIBS)has become a widely used atomic spectroscopic technique for rapid coal analysis.However,the vast amount of spectral information in LIBS contains signal uncertainty,which can affect its quantification performance.In this work,we propose a hybrid variable selection method to improve the performance of LIBS quantification.Important variables are first identified using Pearson's correlation coefficient,mutual information,least absolute shrinkage and selection operator(LASSO)and random forest,and then filtered and combined with empirical variables related to fingerprint elements of coal ash content.Subsequently,these variables are fed into a partial least squares regression(PLSR).Additionally,in some models,certain variables unrelated to ash content are removed manually to study the impact of variable deselection on model performance.The proposed hybrid strategy was tested on three LIBS datasets for quantitative analysis of coal ash content and compared with the corresponding data-driven baseline method.It is significantly better than the variable selection only method based on empirical knowledge and in most cases outperforms the baseline method.The results showed that on all three datasets the hybrid strategy for variable selection combining empirical knowledge and data-driven algorithms achieved the lowest root mean square error of prediction(RMSEP)values of 1.605,3.478 and 1.647,respectively,which were significantly lower than those obtained from multiple linear regression using only 12 empirical variables,which are 1.959,3.718 and 2.181,respectively.The LASSO-PLSR model with empirical support and 20 selected variables exhibited a significantly improved performance after variable deselection,with RMSEP values dropping from 1.635,3.962 and 1.647 to 1.483,3.086 and 1.567,respectively.Such results demonstrate that using empirical knowledge as a support for datadriven variable selection can be a viable approach to improve the accuracy and reliability of LIBS quantification.展开更多
Iced transmission line galloping poses a significant threat to the safety and reliability of power systems,leading directly to line tripping,disconnections,and power outages.Existing early warning methods of iced tran...Iced transmission line galloping poses a significant threat to the safety and reliability of power systems,leading directly to line tripping,disconnections,and power outages.Existing early warning methods of iced transmission line galloping suffer from issues such as reliance on a single data source,neglect of irregular time series,and lack of attention-based closed-loop feedback,resulting in high rates of missed and false alarms.To address these challenges,we propose an Internet of Things(IoT)empowered early warning method of transmission line galloping that integrates time series data from optical fiber sensing and weather forecast.Initially,the method applies a primary adaptive weighted fusion to the IoT empowered optical fiber real-time sensing data and weather forecast data,followed by a secondary fusion based on a Back Propagation(BP)neural network,and uses the K-medoids algorithm for clustering the fused data.Furthermore,an adaptive irregular time series perception adjustment module is introduced into the traditional Gated Recurrent Unit(GRU)network,and closed-loop feedback based on attentionmechanism is employed to update network parameters through gradient feedback of the loss function,enabling closed-loop training and time series data prediction of the GRU network model.Subsequently,considering various types of prediction data and the duration of icing,an iced transmission line galloping risk coefficient is established,and warnings are categorized based on this coefficient.Finally,using an IoT-driven realistic dataset of iced transmission line galloping,the effectiveness of the proposed method is validated through multi-dimensional simulation scenarios.展开更多
Viral infectious diseases,characterized by their intricate nature and wide-ranging diversity,pose substantial challenges in the domain of data management.The vast volume of data generated by these diseases,spanning fr...Viral infectious diseases,characterized by their intricate nature and wide-ranging diversity,pose substantial challenges in the domain of data management.The vast volume of data generated by these diseases,spanning from the molecular mechanisms within cells to large-scale epidemiological patterns,has surpassed the capabilities of traditional analytical methods.In the era of artificial intelligence(AI)and big data,there is an urgent necessity for the optimization of these analytical methods to more effectively handle and utilize the information.Despite the rapid accumulation of data associated with viral infections,the lack of a comprehensive framework for integrating,selecting,and analyzing these datasets has left numerous researchers uncertain about which data to select,how to access it,and how to utilize it most effectively in their research.This review endeavors to fill these gaps by exploring the multifaceted nature of viral infectious diseases and summarizing relevant data across multiple levels,from the molecular details of pathogens to broad epidemiological trends.The scope extends from the micro-scale to the macro-scale,encompassing pathogens,hosts,and vectors.In addition to data summarization,this review thoroughly investigates various dataset sources.It also traces the historical evolution of data collection in the field of viral infectious diseases,highlighting the progress achieved over time.Simultaneously,it evaluates the current limitations that impede data utilization.Furthermore,we propose strategies to surmount these challenges,focusing on the development and application of advanced computational techniques,AI-driven models,and enhanced data integration practices.By providing a comprehensive synthesis of existing knowledge,this review is designed to guide future research and contribute to more informed approaches in the surveillance,prevention,and control of viral infectious diseases,particularly within the context of the expanding big-data landscape.展开更多
The Intelligent Internet of Things(IIoT)involves real-world things that communicate or interact with each other through networking technologies by collecting data from these“things”and using intelligent approaches,s...The Intelligent Internet of Things(IIoT)involves real-world things that communicate or interact with each other through networking technologies by collecting data from these“things”and using intelligent approaches,such as Artificial Intelligence(AI)and machine learning,to make accurate decisions.Data science is the science of dealing with data and its relationships through intelligent approaches.Most state-of-the-art research focuses independently on either data science or IIoT,rather than exploring their integration.Therefore,to address the gap,this article provides a comprehensive survey on the advances and integration of data science with the Intelligent IoT(IIoT)system by classifying the existing IoT-based data science techniques and presenting a summary of various characteristics.The paper analyzes the data science or big data security and privacy features,including network architecture,data protection,and continuous monitoring of data,which face challenges in various IoT-based systems.Extensive insights into IoT data security,privacy,and challenges are visualized in the context of data science for IoT.In addition,this study reveals the current opportunities to enhance data science and IoT market development.The current gap and challenges faced in the integration of data science and IoT are comprehensively presented,followed by the future outlook and possible solutions.展开更多
Air pollution in China covers a large area with complex sources and formation mechanisms,making it a unique place to conduct air pollution and atmospheric chemistry research.The National Natural Science Foundation of ...Air pollution in China covers a large area with complex sources and formation mechanisms,making it a unique place to conduct air pollution and atmospheric chemistry research.The National Natural Science Foundation of China’s Major Research Plan entitled“Fundamental Researches on the Formation and Response Mechanism of the Air Pollution Complex in China”(or the Plan)has funded 76 research projects to explore the causes of air pollution in China,and the key processes of air pollution in atmospheric physics and atmospheric chemistry.In order to summarize the abundant data from the Plan and exhibit the long-term impacts domestically and internationally,an integration project is responsible for collecting the various types of data generated by the 76 projects of the Plan.This project has classified and integrated these data,forming eight categories containing 258 datasets and 15 technical reports in total.The integration project has led to the successful establishment of the China Air Pollution Data Center(CAPDC)platform,providing storage,retrieval,and download services for the eight categories.This platform has distinct features including data visualization,related project information querying,and bilingual services in both English and Chinese,which allows for rapid searching and downloading of data and provides a solid foundation of data and support for future related research.Air pollution control in China,especially in the past decade,is undeniably a global exemplar,and this data center is the first in China to focus on research into the country’s air pollution complex.展开更多
Water content, whether as free or lattice-bound water, is a crucial factor in determining the Earth's internal thermal state and plays a key role in volcanic eruptions, melting phenomena, and mantle convection rat...Water content, whether as free or lattice-bound water, is a crucial factor in determining the Earth's internal thermal state and plays a key role in volcanic eruptions, melting phenomena, and mantle convection rates. As electrical conductivity in the Earth's interior is highly sensitive to water content, it is an important geophysical parameter for understanding the deep Earth water content. Since its launch on May 21, 2023, the MSS-1(Macao Science Satellite-1) mission has operated for nearly one year, with its magnetometer achieving a precision of higher than 0.5 nT after orbital testing and calibration. Orbiting at 450 kilometers with a unique 41-degree inclination, the satellite enables high-density observations across multiple local times, allowing detailed monitoring of low-latitude regions and enhancing data for global conductivity imaging. To better understand the global distribution of water within the Earth's interior, it is crucial to study internal conductivity structure and water content distribution. To this aim, we introduce a method for using MSS-1 data to estamate induced magnetic fields related to magnetospheric currents. We then develop a trans-dimensional Bayesian approach to reveal Earth's internal conductivity, providing probable conductivity structure with an uncertainty analysis. Finally, by integrating known mineral composition, pressure, and temperature distribution within the mantle, we estimate the water content range in the mantle transition zone, concluding that this region may contain the equivalent of up to 3.0 oceans of water, providing compelling evidence that supports the hypothesis of a deep water cycle within the Earth's interior.展开更多
As a new type of production factor in healthcare,healthcare data elements have been rapidly integrated into various health production processes,such as clinical assistance,health management,biological testing,and oper...As a new type of production factor in healthcare,healthcare data elements have been rapidly integrated into various health production processes,such as clinical assistance,health management,biological testing,and operation and supervision[1,2].Healthcare data elements include biolog.ical and clinical data that are related to disease,environ-mental health data that are associated with life,and operational and healthcare management data that are related to healthcare activities(Figure 1).Activities such as the construction of a data value assessment system,the devel-opment of a data circulation and sharing platform,and the authorization of data compliance and operation products support the strong growth momentum of the market for health care data elements in China[3].展开更多
As smart grid technology rapidly advances,the vast amount of user data collected by smart meter presents significant challenges in data security and privacy protection.Current research emphasizes data security and use...As smart grid technology rapidly advances,the vast amount of user data collected by smart meter presents significant challenges in data security and privacy protection.Current research emphasizes data security and user privacy concerns within smart grids.However,existing methods struggle with efficiency and security when processing large-scale data.Balancing efficient data processing with stringent privacy protection during data aggregation in smart grids remains an urgent challenge.This paper proposes an AI-based multi-type data aggregation method designed to enhance aggregation efficiency and security by standardizing and normalizing various data modalities.The approach optimizes data preprocessing,integrates Long Short-Term Memory(LSTM)networks for handling time-series data,and employs homomorphic encryption to safeguard user privacy.It also explores the application of Boneh Lynn Shacham(BLS)signatures for user authentication.The proposed scheme’s efficiency,security,and privacy protection capabilities are validated through rigorous security proofs and experimental analysis.展开更多
Rice is a poor source of folate,an essential micronutrient for the body.Biofortification offers an effective way to enhance the folate content of rice and alleviate folate deficiencies in humans.In this study,we confi...Rice is a poor source of folate,an essential micronutrient for the body.Biofortification offers an effective way to enhance the folate content of rice and alleviate folate deficiencies in humans.In this study,we confirmed that OsADCS and OsGTPCHI,encoding the initial enzymes necessary for folate synthesis,positively regulate folate accumulation in knockout mutants of both japonica and indica rice backgrounds.The folate content in the low-folate japonica variety was slightly increased by the expression of the indica alleles driven by the endosperm-specific promoter.We further obtained co-expression lines by stacking OsADCS and OsGTPCHI genes;the folate accumulation in brown rice and polished rice reached 5.65μg/g and 2.95μg/g,respectively,representing 37.9-fold and 26.5-fold increases compared with the wild type.Transcriptomic analysis of rice grains from six transgenic lines showed that folate changes affected biological pathways involved in the synthesis and metabolism of rice seed storage substances,while the expression of other folate synthesis genes was weakly regulated.In addition,we identified Aus rice as a high-folate germplasm carrying superior haplotypes of OsADCS and OsGTPCHI through natural variation.This study provides an alternative and effective complementary strategy for rice biofortification,promoting the rational combination of metabolic engineering and conventional breeding to breed high-folate varieties.展开更多
Earth’s internal core and crustal magnetic fields,as measured by geomagnetic satellites like MSS-1(Macao Science Satellite-1)and Swarm,are vital for understanding core dynamics and tectonic evolution.To model these i...Earth’s internal core and crustal magnetic fields,as measured by geomagnetic satellites like MSS-1(Macao Science Satellite-1)and Swarm,are vital for understanding core dynamics and tectonic evolution.To model these internal magnetic fields accurately,data selection based on specific criteria is often employed to minimize the influence of rapidly changing current systems in the ionosphere and magnetosphere.However,the quantitative impact of various data selection criteria on internal geomagnetic field modeling is not well understood.This study aims to address this issue and provide a reference for constructing and applying geomagnetic field models.First,we collect the latest MSS-1 and Swarm satellite magnetic data and summarize widely used data selection criteria in geomagnetic field modeling.Second,we briefly describe the method to co-estimate the core,crustal,and large-scale magnetospheric fields using satellite magnetic data.Finally,we conduct a series of field modeling experiments with different data selection criteria to quantitatively estimate their influence.Our numerical experiments confirm that without selecting data from dark regions and geomagnetically quiet times,the resulting internal field differences at the Earth’s surface can range from tens to hundreds of nanotesla(nT).Additionally,we find that the uncertainties introduced into field models by different data selection criteria are significantly larger than the measurement accuracy of modern geomagnetic satellites.These uncertainties should be considered when utilizing constructed magnetic field models for scientific research and applications.展开更多
Many fields,such as neuroscience,are experiencing the vast prolife ration of cellular data,underscoring the need fo r organizing and interpreting large datasets.A popular approach partitions data into manageable subse...Many fields,such as neuroscience,are experiencing the vast prolife ration of cellular data,underscoring the need fo r organizing and interpreting large datasets.A popular approach partitions data into manageable subsets via hierarchical clustering,but objective methods to determine the appropriate classification granularity are missing.We recently introduced a technique to systematically identify when to stop subdividing clusters based on the fundamental principle that cells must differ more between than within clusters.Here we present the corresponding protocol to classify cellular datasets by combining datadriven unsupervised hierarchical clustering with statistical testing.These general-purpose functions are applicable to any cellular dataset that can be organized as two-dimensional matrices of numerical values,including molecula r,physiological,and anatomical datasets.We demonstrate the protocol using cellular data from the Janelia MouseLight project to chara cterize morphological aspects of neurons.展开更多
The accelerated advancement of the Internet of Things(IoT)has generated substantial data,including sensitive and private information.Consequently,it is imperative to guarantee the security of data sharing.While facili...The accelerated advancement of the Internet of Things(IoT)has generated substantial data,including sensitive and private information.Consequently,it is imperative to guarantee the security of data sharing.While facilitating fine-grained access control,Ciphertext Policy Attribute-Based Encryption(CP-ABE)can effectively ensure the confidentiality of shared data.Nevertheless,the conventional centralized CP-ABE scheme is plagued by the issues of keymisuse,key escrow,and large computation,which will result in security risks.This paper suggests a lightweight IoT data security sharing scheme that integrates blockchain technology and CP-ABE to address the abovementioned issues.The integrity and traceability of shared data are guaranteed by the use of blockchain technology to store and verify access transactions.The encryption and decryption operations of the CP-ABE algorithm have been implemented using elliptic curve scalarmultiplication to accommodate lightweight IoT devices,as opposed to themore arithmetic bilinear pairing found in the traditional CP-ABE algorithm.Additionally,a portion of the computation is delegated to the edge nodes to alleviate the computational burden on users.A distributed key management method is proposed to address the issues of key escrow andmisuse.Thismethod employs the edge blockchain to facilitate the storage and distribution of attribute private keys.Meanwhile,data security sharing is enhanced by combining off-chain and on-chain ciphertext storage.The security and performance analysis indicates that the proposed scheme is more efficient and secure.展开更多
Carbon can change the phase components of low-density steels and influence the mechanical properties.In this study,a new method to control the carbon content and avoid the formation ofδ-ferrite by decarburization tre...Carbon can change the phase components of low-density steels and influence the mechanical properties.In this study,a new method to control the carbon content and avoid the formation ofδ-ferrite by decarburization treatment was proposed.The microstructural changes and mechanical characteristics with carbon content induced by decarburization were systematically examined.Crussard-Jaoul(C-J)analysis was employed to examine the work hardening characteristics during the tensile test.During decarburization by heat treatments,the carbon content within the austenite phase decreased,while Mn and Al were almost unchanged;this made the steel with full austenite transform into the austenite and ferrite dual phase.Meanwhile,(Ti,V)C carbides existed in both matrix phase and the mole fraction almost the same.In addition,the formation of other carbides restrained.Carbon loss induced a decrease in strength due to the weakening of the carbon solid solution.For the steel with the single austinite,the deformation mode of austenite was the dislocation planar glide,resulting in the formation of microbands.For the dual-phase steel,the deformation occurred by the dislocation planar glide of austenite first,with the increase in strain,the cross slip of ferrite took place,forming dislocation cells in ferrite.At the late stage of deformation,the work hardening of austinite increased rapidly,while that of ferrite increased slightly.展开更多
调查和分析元数据标准在健康科学数据中的应用现状,有助于为我国健康科学数据描述中元数据标准的选择、健康科学数据平台的建设提供参考。通过网络调研法对科学数据仓储注册系统(registry of research data repositories,re3data)中的...调查和分析元数据标准在健康科学数据中的应用现状,有助于为我国健康科学数据描述中元数据标准的选择、健康科学数据平台的建设提供参考。通过网络调研法对科学数据仓储注册系统(registry of research data repositories,re3data)中的健康科学数据管理平台进行调研,梳理所应用的元数据标准,分析典型元数据标准在平台中的应用情况,并归纳其在健康科学数据描述中的适用性。re3data中各健康科学数据平台共使用14种元数据标准,其中DC、DataCite、DDI、仓储自建元数据标准的使用最为广泛,多数平台组合使用多种元数据标准。各类元数据标准可分为通用型、社会科学型、自建型3类,分别适用于描述健康科学数据通用属性、社会科学研究产生的健康科学数据、特色和专业性强及政府开放的健康科学数据。展开更多
In order to achieve the large-scale application of manufactured sand in railway high-strength concrete structure,a series of high-strength manufactured sand concrete(HMC)are prepared by taking the manufactured sand li...In order to achieve the large-scale application of manufactured sand in railway high-strength concrete structure,a series of high-strength manufactured sand concrete(HMC)are prepared by taking the manufactured sand lithology(tuff,limestone,basalt,granite),stone powder content(0,5%,10%,15%)and concrete strength grade(C60,C80,C100)as variables.The evolution of mechanical properties of HMC and the correlation between cubic compressive strength and other mechanical properties are studied.Compared to river sand,manufactured sand enhances the cubic compressive strength,axial compressive strength and elastic modulus of concrete,while its potential microcracks weaken the flexural strength and splitting tensile strength of concrete.Stone powder content displays both positive and negative effects on mechanical properties of HMC,and the stone powder content is suggested to be less than 10%.The empirical formulas between cubic compressive strength and other mechanical properties are proposed.展开更多
Gas content serves as a critical indicator for assessing the resource potential of deep coal mines and forecasting coal mine gas outburst risks.However,existing sampling technologies face challenges in maintaining the...Gas content serves as a critical indicator for assessing the resource potential of deep coal mines and forecasting coal mine gas outburst risks.However,existing sampling technologies face challenges in maintaining the integrity of gas content within samples and are often constrained by estimation errors inherent in empirical formulas,which results in inaccurate gas content measurements.This study introduces a lightweight,in-situ pressure-and gas-preserved corer designed to collect coal samples under the pressure conditions at the sampling point,effectively preventing gas loss during transfer and significantly improving measurement accuracy.Additionally,a gas migration model for deep coal mines was developed to elucidate gas migration characteristics under pressure-preserved coring conditions.The model offers valuable insights for optimizing coring parameters,demonstrating that both minimizing the coring hole diameter and reducing the pressure difference between the coring-point pressure and the original pore pressure can effectively improve the precision of gas content measurements.Coring tests conducted at an experimental base validated the performance of the corer and its effectiveness in sample collection.Furthermore,successful horizontal coring tests conducted in an underground coal mine roadway demonstrated that the measured gas content using pressure-preserved coring was 34%higher than that obtained through open sampling methods.展开更多
This study presents an innovative approach to enhancing the security of visual medical data in the generative AI environment through the integration of blockchain technology.By combining the strengths of blockchain an...This study presents an innovative approach to enhancing the security of visual medical data in the generative AI environment through the integration of blockchain technology.By combining the strengths of blockchain and generative AI,the research team aimed to address the timely challenge of safeguarding visual medical content.The participating researchers conducted a comprehensive analysis,examining the vulnerabilities of medical AI services,personal information protection issues,and overall security weaknesses.This multi faceted exploration led to an indepth evaluation of the model’s performance and security.Notably,the correlation between accuracy,detection rate,and error rate was scrutinized.This analysis revealed insights into the model’s strengths and limitations,while the consideration of standard deviation shed light on the model’s stability and performance variability.The study proposed practical improvements,emphasizing the reduction of false negatives to enhance detection rate and leveraging blockchain technology to ensure visual data integrity in medical applications.Applying blockchain to generative AI-created medical content addresses key personal information protection issues.By utilizing the distributed ledger system of blockchain,the research team aimed to protect the privacy and integrity of medical data especially medical images.This approach not only enhances security but also enables transparent and tamperproof record-keeping.Additionally,the use of generative AI models ensures the creation of novel medical content without compromising personal information,further safeguarding patient privacy.In conclusion,this study showcases the potential of blockchain-based solutions in the medical field,particularly in securing sensitive medical data and protecting patient privacy.The proposed approach,combining blockchain and generative AI,offers a promising direction toward more robust and secure medical content management.Further research and advancements in this area will undoubtedly contribute to the development of robust and privacy-preserving healthcare systems,and visual diagnostic systems.展开更多
Open networks and heterogeneous services in the Internet of Vehicles(IoV)can lead to security and privacy challenges.One key requirement for such systems is the preservation of user privacy,ensuring a seamless experie...Open networks and heterogeneous services in the Internet of Vehicles(IoV)can lead to security and privacy challenges.One key requirement for such systems is the preservation of user privacy,ensuring a seamless experience in driving,navigation,and communication.These privacy needs are influenced by various factors,such as data collected at different intervals,trip durations,and user interactions.To address this,the paper proposes a Support Vector Machine(SVM)model designed to process large amounts of aggregated data and recommend privacy preserving measures.The model analyzes data based on user demands and interactions with service providers or neighboring infrastructure.It aims to minimize privacy risks while ensuring service continuity and sustainability.The SVMmodel helps validate the system’s reliability by creating a hyperplane that distinguishes between maximum and minimum privacy recommendations.The results demonstrate the effectiveness of the proposed SVM model in enhancing both privacy and service performance.展开更多
This article explores the design of a wireless fire alarm system supported by advanced data fusion technology.It includes discussions on the basic design ideas of the wireless fire alarm system,hardware design analysi...This article explores the design of a wireless fire alarm system supported by advanced data fusion technology.It includes discussions on the basic design ideas of the wireless fire alarm system,hardware design analysis,software design analysis,and simulation analysis,all supported by data fusion technology.Hopefully,this analysis can provide some reference for the rational application of data fusion technology to meet the actual design and application requirements of the system.展开更多
We combine gradient data from the Macao Science Satellite-1(MSS-1),CHAllenging Minisatellite Payload(CHAMP),Swarm-A,and Swarm-C satellites to develop a 110-degree lithospheric magnetic field model.We then comprehensiv...We combine gradient data from the Macao Science Satellite-1(MSS-1),CHAllenging Minisatellite Payload(CHAMP),Swarm-A,and Swarm-C satellites to develop a 110-degree lithospheric magnetic field model.We then comprehensively evaluate the performance of the model by power spectral comparisons,correlation analyses,sensitivity matrix assessments,and comparisons with existing lithospheric field models.Results showed that using near east–west gradient data from MSS-1 significantly enhances the model correlation in the spherical harmonic degree(N) range of 45–60 while also mitigating the decline in correlation at higher degrees(N > 60).Furthermore,the unique orbital characteristics of MSS-1 enable its gradient data to provide substantial contributions to modeling in the mid-to low-latitude regions.With continued data acquisition from MSS-1 and further optimization of data processing methods,the performance of the model is expected to improve.展开更多
基金financial supports from National Natural Science Foundation of China(No.62205172)Huaneng Group Science and Technology Research Project(No.HNKJ22-H105)Tsinghua University Initiative Scientific Research Program and the International Joint Mission on Climate Change and Carbon Neutrality。
文摘Laser-induced breakdown spectroscopy(LIBS)has become a widely used atomic spectroscopic technique for rapid coal analysis.However,the vast amount of spectral information in LIBS contains signal uncertainty,which can affect its quantification performance.In this work,we propose a hybrid variable selection method to improve the performance of LIBS quantification.Important variables are first identified using Pearson's correlation coefficient,mutual information,least absolute shrinkage and selection operator(LASSO)and random forest,and then filtered and combined with empirical variables related to fingerprint elements of coal ash content.Subsequently,these variables are fed into a partial least squares regression(PLSR).Additionally,in some models,certain variables unrelated to ash content are removed manually to study the impact of variable deselection on model performance.The proposed hybrid strategy was tested on three LIBS datasets for quantitative analysis of coal ash content and compared with the corresponding data-driven baseline method.It is significantly better than the variable selection only method based on empirical knowledge and in most cases outperforms the baseline method.The results showed that on all three datasets the hybrid strategy for variable selection combining empirical knowledge and data-driven algorithms achieved the lowest root mean square error of prediction(RMSEP)values of 1.605,3.478 and 1.647,respectively,which were significantly lower than those obtained from multiple linear regression using only 12 empirical variables,which are 1.959,3.718 and 2.181,respectively.The LASSO-PLSR model with empirical support and 20 selected variables exhibited a significantly improved performance after variable deselection,with RMSEP values dropping from 1.635,3.962 and 1.647 to 1.483,3.086 and 1.567,respectively.Such results demonstrate that using empirical knowledge as a support for datadriven variable selection can be a viable approach to improve the accuracy and reliability of LIBS quantification.
基金research was funded by Science and Technology Project of State Grid Corporation of China under grant number 5200-202319382A-2-3-XG.
文摘Iced transmission line galloping poses a significant threat to the safety and reliability of power systems,leading directly to line tripping,disconnections,and power outages.Existing early warning methods of iced transmission line galloping suffer from issues such as reliance on a single data source,neglect of irregular time series,and lack of attention-based closed-loop feedback,resulting in high rates of missed and false alarms.To address these challenges,we propose an Internet of Things(IoT)empowered early warning method of transmission line galloping that integrates time series data from optical fiber sensing and weather forecast.Initially,the method applies a primary adaptive weighted fusion to the IoT empowered optical fiber real-time sensing data and weather forecast data,followed by a secondary fusion based on a Back Propagation(BP)neural network,and uses the K-medoids algorithm for clustering the fused data.Furthermore,an adaptive irregular time series perception adjustment module is introduced into the traditional Gated Recurrent Unit(GRU)network,and closed-loop feedback based on attentionmechanism is employed to update network parameters through gradient feedback of the loss function,enabling closed-loop training and time series data prediction of the GRU network model.Subsequently,considering various types of prediction data and the duration of icing,an iced transmission line galloping risk coefficient is established,and warnings are categorized based on this coefficient.Finally,using an IoT-driven realistic dataset of iced transmission line galloping,the effectiveness of the proposed method is validated through multi-dimensional simulation scenarios.
基金supported by the National Natural Science Foundation of China(32370703)the CAMS Innovation Fund for Medical Sciences(CIFMS)(2022-I2M-1-021,2021-I2M-1-061)the Major Project of Guangzhou National Labora-tory(GZNL2024A01015).
文摘Viral infectious diseases,characterized by their intricate nature and wide-ranging diversity,pose substantial challenges in the domain of data management.The vast volume of data generated by these diseases,spanning from the molecular mechanisms within cells to large-scale epidemiological patterns,has surpassed the capabilities of traditional analytical methods.In the era of artificial intelligence(AI)and big data,there is an urgent necessity for the optimization of these analytical methods to more effectively handle and utilize the information.Despite the rapid accumulation of data associated with viral infections,the lack of a comprehensive framework for integrating,selecting,and analyzing these datasets has left numerous researchers uncertain about which data to select,how to access it,and how to utilize it most effectively in their research.This review endeavors to fill these gaps by exploring the multifaceted nature of viral infectious diseases and summarizing relevant data across multiple levels,from the molecular details of pathogens to broad epidemiological trends.The scope extends from the micro-scale to the macro-scale,encompassing pathogens,hosts,and vectors.In addition to data summarization,this review thoroughly investigates various dataset sources.It also traces the historical evolution of data collection in the field of viral infectious diseases,highlighting the progress achieved over time.Simultaneously,it evaluates the current limitations that impede data utilization.Furthermore,we propose strategies to surmount these challenges,focusing on the development and application of advanced computational techniques,AI-driven models,and enhanced data integration practices.By providing a comprehensive synthesis of existing knowledge,this review is designed to guide future research and contribute to more informed approaches in the surveillance,prevention,and control of viral infectious diseases,particularly within the context of the expanding big-data landscape.
基金supported in part by the National Natural Science Foundation of China under Grant 62371181in part by the Changzhou Science and Technology International Cooperation Program under Grant CZ20230029+1 种基金supported by a National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(2021R1A2B5B02087169)supported under the framework of international cooperation program managed by the National Research Foundation of Korea(2022K2A9A1A01098051)。
文摘The Intelligent Internet of Things(IIoT)involves real-world things that communicate or interact with each other through networking technologies by collecting data from these“things”and using intelligent approaches,such as Artificial Intelligence(AI)and machine learning,to make accurate decisions.Data science is the science of dealing with data and its relationships through intelligent approaches.Most state-of-the-art research focuses independently on either data science or IIoT,rather than exploring their integration.Therefore,to address the gap,this article provides a comprehensive survey on the advances and integration of data science with the Intelligent IoT(IIoT)system by classifying the existing IoT-based data science techniques and presenting a summary of various characteristics.The paper analyzes the data science or big data security and privacy features,including network architecture,data protection,and continuous monitoring of data,which face challenges in various IoT-based systems.Extensive insights into IoT data security,privacy,and challenges are visualized in the context of data science for IoT.In addition,this study reveals the current opportunities to enhance data science and IoT market development.The current gap and challenges faced in the integration of data science and IoT are comprehensively presented,followed by the future outlook and possible solutions.
基金supported by the National Natural Science Foundation of China(Grant No.92044303)。
文摘Air pollution in China covers a large area with complex sources and formation mechanisms,making it a unique place to conduct air pollution and atmospheric chemistry research.The National Natural Science Foundation of China’s Major Research Plan entitled“Fundamental Researches on the Formation and Response Mechanism of the Air Pollution Complex in China”(or the Plan)has funded 76 research projects to explore the causes of air pollution in China,and the key processes of air pollution in atmospheric physics and atmospheric chemistry.In order to summarize the abundant data from the Plan and exhibit the long-term impacts domestically and internationally,an integration project is responsible for collecting the various types of data generated by the 76 projects of the Plan.This project has classified and integrated these data,forming eight categories containing 258 datasets and 15 technical reports in total.The integration project has led to the successful establishment of the China Air Pollution Data Center(CAPDC)platform,providing storage,retrieval,and download services for the eight categories.This platform has distinct features including data visualization,related project information querying,and bilingual services in both English and Chinese,which allows for rapid searching and downloading of data and provides a solid foundation of data and support for future related research.Air pollution control in China,especially in the past decade,is undeniably a global exemplar,and this data center is the first in China to focus on research into the country’s air pollution complex.
基金financially supported by the National Natural Science Foundation of China(42250102,42250101)the Macao Foundation.
文摘Water content, whether as free or lattice-bound water, is a crucial factor in determining the Earth's internal thermal state and plays a key role in volcanic eruptions, melting phenomena, and mantle convection rates. As electrical conductivity in the Earth's interior is highly sensitive to water content, it is an important geophysical parameter for understanding the deep Earth water content. Since its launch on May 21, 2023, the MSS-1(Macao Science Satellite-1) mission has operated for nearly one year, with its magnetometer achieving a precision of higher than 0.5 nT after orbital testing and calibration. Orbiting at 450 kilometers with a unique 41-degree inclination, the satellite enables high-density observations across multiple local times, allowing detailed monitoring of low-latitude regions and enhancing data for global conductivity imaging. To better understand the global distribution of water within the Earth's interior, it is crucial to study internal conductivity structure and water content distribution. To this aim, we introduce a method for using MSS-1 data to estamate induced magnetic fields related to magnetospheric currents. We then develop a trans-dimensional Bayesian approach to reveal Earth's internal conductivity, providing probable conductivity structure with an uncertainty analysis. Finally, by integrating known mineral composition, pressure, and temperature distribution within the mantle, we estimate the water content range in the mantle transition zone, concluding that this region may contain the equivalent of up to 3.0 oceans of water, providing compelling evidence that supports the hypothesis of a deep water cycle within the Earth's interior.
基金supported by National Natural Science Foundation of China(Grants 72474022,71974011,72174022,71972012,71874009)"BIT think tank"Promotion Plan of Science and Technology Innovation Program of Beijing Institute of Technology(Grants 2024CX14017,2023CX13029).
文摘As a new type of production factor in healthcare,healthcare data elements have been rapidly integrated into various health production processes,such as clinical assistance,health management,biological testing,and operation and supervision[1,2].Healthcare data elements include biolog.ical and clinical data that are related to disease,environ-mental health data that are associated with life,and operational and healthcare management data that are related to healthcare activities(Figure 1).Activities such as the construction of a data value assessment system,the devel-opment of a data circulation and sharing platform,and the authorization of data compliance and operation products support the strong growth momentum of the market for health care data elements in China[3].
基金supported by the National Key R&D Program of China(No.2023YFB2703700)the National Natural Science Foundation of China(Nos.U21A20465,62302457,62402444,62172292)+4 种基金the Fundamental Research Funds of Zhejiang Sci-Tech University(Nos.23222092-Y,22222266-Y)the Program for Leading Innovative Research Team of Zhejiang Province(No.2023R01001)the Zhejiang Provincial Natural Science Foundation of China(Nos.LQ24F020008,LQ24F020012)the Foundation of State Key Laboratory of Public Big Data(No.[2022]417)the“Pioneer”and“Leading Goose”R&D Program of Zhejiang(No.2023C01119).
文摘As smart grid technology rapidly advances,the vast amount of user data collected by smart meter presents significant challenges in data security and privacy protection.Current research emphasizes data security and user privacy concerns within smart grids.However,existing methods struggle with efficiency and security when processing large-scale data.Balancing efficient data processing with stringent privacy protection during data aggregation in smart grids remains an urgent challenge.This paper proposes an AI-based multi-type data aggregation method designed to enhance aggregation efficiency and security by standardizing and normalizing various data modalities.The approach optimizes data preprocessing,integrates Long Short-Term Memory(LSTM)networks for handling time-series data,and employs homomorphic encryption to safeguard user privacy.It also explores the application of Boneh Lynn Shacham(BLS)signatures for user authentication.The proposed scheme’s efficiency,security,and privacy protection capabilities are validated through rigorous security proofs and experimental analysis.
基金supported by the Central Public-Interest Scientific Institution Basal Research Fund,China(Grant No.CPSIBRF-CNRRI-202403)。
文摘Rice is a poor source of folate,an essential micronutrient for the body.Biofortification offers an effective way to enhance the folate content of rice and alleviate folate deficiencies in humans.In this study,we confirmed that OsADCS and OsGTPCHI,encoding the initial enzymes necessary for folate synthesis,positively regulate folate accumulation in knockout mutants of both japonica and indica rice backgrounds.The folate content in the low-folate japonica variety was slightly increased by the expression of the indica alleles driven by the endosperm-specific promoter.We further obtained co-expression lines by stacking OsADCS and OsGTPCHI genes;the folate accumulation in brown rice and polished rice reached 5.65μg/g and 2.95μg/g,respectively,representing 37.9-fold and 26.5-fold increases compared with the wild type.Transcriptomic analysis of rice grains from six transgenic lines showed that folate changes affected biological pathways involved in the synthesis and metabolism of rice seed storage substances,while the expression of other folate synthesis genes was weakly regulated.In addition,we identified Aus rice as a high-folate germplasm carrying superior haplotypes of OsADCS and OsGTPCHI through natural variation.This study provides an alternative and effective complementary strategy for rice biofortification,promoting the rational combination of metabolic engineering and conventional breeding to breed high-folate varieties.
基金supported by the National Natural Science Foundation of China(42250101)the Macao Foundation。
文摘Earth’s internal core and crustal magnetic fields,as measured by geomagnetic satellites like MSS-1(Macao Science Satellite-1)and Swarm,are vital for understanding core dynamics and tectonic evolution.To model these internal magnetic fields accurately,data selection based on specific criteria is often employed to minimize the influence of rapidly changing current systems in the ionosphere and magnetosphere.However,the quantitative impact of various data selection criteria on internal geomagnetic field modeling is not well understood.This study aims to address this issue and provide a reference for constructing and applying geomagnetic field models.First,we collect the latest MSS-1 and Swarm satellite magnetic data and summarize widely used data selection criteria in geomagnetic field modeling.Second,we briefly describe the method to co-estimate the core,crustal,and large-scale magnetospheric fields using satellite magnetic data.Finally,we conduct a series of field modeling experiments with different data selection criteria to quantitatively estimate their influence.Our numerical experiments confirm that without selecting data from dark regions and geomagnetically quiet times,the resulting internal field differences at the Earth’s surface can range from tens to hundreds of nanotesla(nT).Additionally,we find that the uncertainties introduced into field models by different data selection criteria are significantly larger than the measurement accuracy of modern geomagnetic satellites.These uncertainties should be considered when utilizing constructed magnetic field models for scientific research and applications.
基金supported in part by NIH grants R01NS39600,U01MH114829RF1MH128693(to GAA)。
文摘Many fields,such as neuroscience,are experiencing the vast prolife ration of cellular data,underscoring the need fo r organizing and interpreting large datasets.A popular approach partitions data into manageable subsets via hierarchical clustering,but objective methods to determine the appropriate classification granularity are missing.We recently introduced a technique to systematically identify when to stop subdividing clusters based on the fundamental principle that cells must differ more between than within clusters.Here we present the corresponding protocol to classify cellular datasets by combining datadriven unsupervised hierarchical clustering with statistical testing.These general-purpose functions are applicable to any cellular dataset that can be organized as two-dimensional matrices of numerical values,including molecula r,physiological,and anatomical datasets.We demonstrate the protocol using cellular data from the Janelia MouseLight project to chara cterize morphological aspects of neurons.
文摘The accelerated advancement of the Internet of Things(IoT)has generated substantial data,including sensitive and private information.Consequently,it is imperative to guarantee the security of data sharing.While facilitating fine-grained access control,Ciphertext Policy Attribute-Based Encryption(CP-ABE)can effectively ensure the confidentiality of shared data.Nevertheless,the conventional centralized CP-ABE scheme is plagued by the issues of keymisuse,key escrow,and large computation,which will result in security risks.This paper suggests a lightweight IoT data security sharing scheme that integrates blockchain technology and CP-ABE to address the abovementioned issues.The integrity and traceability of shared data are guaranteed by the use of blockchain technology to store and verify access transactions.The encryption and decryption operations of the CP-ABE algorithm have been implemented using elliptic curve scalarmultiplication to accommodate lightweight IoT devices,as opposed to themore arithmetic bilinear pairing found in the traditional CP-ABE algorithm.Additionally,a portion of the computation is delegated to the edge nodes to alleviate the computational burden on users.A distributed key management method is proposed to address the issues of key escrow andmisuse.Thismethod employs the edge blockchain to facilitate the storage and distribution of attribute private keys.Meanwhile,data security sharing is enhanced by combining off-chain and on-chain ciphertext storage.The security and performance analysis indicates that the proposed scheme is more efficient and secure.
基金financially supported by the National Natural Science Foundation of China(Nos.U2141207,52171111,and 52001083)the Youth Talent Project of China National Nuclear Corporation(No.CNNC2021Y-TEPHEU01)+3 种基金the China Postdoctoral Science Foundation(No.2020M681077)the Natural Science Foundation of Heilongjiang,China(No.LH2019E030)the Heilongjiang Postdoctoral Science Foundation,China(No.LBH-Z19125)he Heilongjiang Touyan Innovation Team Program,China,and the Natural Science Foundation of Heilongjiang(No.LH2020E060)。
文摘Carbon can change the phase components of low-density steels and influence the mechanical properties.In this study,a new method to control the carbon content and avoid the formation ofδ-ferrite by decarburization treatment was proposed.The microstructural changes and mechanical characteristics with carbon content induced by decarburization were systematically examined.Crussard-Jaoul(C-J)analysis was employed to examine the work hardening characteristics during the tensile test.During decarburization by heat treatments,the carbon content within the austenite phase decreased,while Mn and Al were almost unchanged;this made the steel with full austenite transform into the austenite and ferrite dual phase.Meanwhile,(Ti,V)C carbides existed in both matrix phase and the mole fraction almost the same.In addition,the formation of other carbides restrained.Carbon loss induced a decrease in strength due to the weakening of the carbon solid solution.For the steel with the single austinite,the deformation mode of austenite was the dislocation planar glide,resulting in the formation of microbands.For the dual-phase steel,the deformation occurred by the dislocation planar glide of austenite first,with the increase in strain,the cross slip of ferrite took place,forming dislocation cells in ferrite.At the late stage of deformation,the work hardening of austinite increased rapidly,while that of ferrite increased slightly.
文摘调查和分析元数据标准在健康科学数据中的应用现状,有助于为我国健康科学数据描述中元数据标准的选择、健康科学数据平台的建设提供参考。通过网络调研法对科学数据仓储注册系统(registry of research data repositories,re3data)中的健康科学数据管理平台进行调研,梳理所应用的元数据标准,分析典型元数据标准在平台中的应用情况,并归纳其在健康科学数据描述中的适用性。re3data中各健康科学数据平台共使用14种元数据标准,其中DC、DataCite、DDI、仓储自建元数据标准的使用最为广泛,多数平台组合使用多种元数据标准。各类元数据标准可分为通用型、社会科学型、自建型3类,分别适用于描述健康科学数据通用属性、社会科学研究产生的健康科学数据、特色和专业性强及政府开放的健康科学数据。
基金Funded by the National Natural Science Foundation of China(Nos.U1934206,52108260)China Academy of Railway Sciences Fund(No.2021YJ078)+1 种基金Railway Engineering Construction Standard Project(No.2023-BZWW-006)New Cornerstone Science Foundation through the XPLORER PRIZE。
文摘In order to achieve the large-scale application of manufactured sand in railway high-strength concrete structure,a series of high-strength manufactured sand concrete(HMC)are prepared by taking the manufactured sand lithology(tuff,limestone,basalt,granite),stone powder content(0,5%,10%,15%)and concrete strength grade(C60,C80,C100)as variables.The evolution of mechanical properties of HMC and the correlation between cubic compressive strength and other mechanical properties are studied.Compared to river sand,manufactured sand enhances the cubic compressive strength,axial compressive strength and elastic modulus of concrete,while its potential microcracks weaken the flexural strength and splitting tensile strength of concrete.Stone powder content displays both positive and negative effects on mechanical properties of HMC,and the stone powder content is suggested to be less than 10%.The empirical formulas between cubic compressive strength and other mechanical properties are proposed.
基金supported by the National Natural Science Foundation of China(Nos.51827901,42477191,and 52304033)the Fundamental Research Funds for the Central Universities(No.YJ202449)+1 种基金the Open Research Fund of State Key Laboratory of Geomechanics and Geotechnical Engineering,Institute of Rock and Soil Mechanics,Chinese Academy of Sciences(No.SKLGME022009)the China Postdoctoral Science Foundation(No.2023M742446).
文摘Gas content serves as a critical indicator for assessing the resource potential of deep coal mines and forecasting coal mine gas outburst risks.However,existing sampling technologies face challenges in maintaining the integrity of gas content within samples and are often constrained by estimation errors inherent in empirical formulas,which results in inaccurate gas content measurements.This study introduces a lightweight,in-situ pressure-and gas-preserved corer designed to collect coal samples under the pressure conditions at the sampling point,effectively preventing gas loss during transfer and significantly improving measurement accuracy.Additionally,a gas migration model for deep coal mines was developed to elucidate gas migration characteristics under pressure-preserved coring conditions.The model offers valuable insights for optimizing coring parameters,demonstrating that both minimizing the coring hole diameter and reducing the pressure difference between the coring-point pressure and the original pore pressure can effectively improve the precision of gas content measurements.Coring tests conducted at an experimental base validated the performance of the corer and its effectiveness in sample collection.Furthermore,successful horizontal coring tests conducted in an underground coal mine roadway demonstrated that the measured gas content using pressure-preserved coring was 34%higher than that obtained through open sampling methods.
文摘This study presents an innovative approach to enhancing the security of visual medical data in the generative AI environment through the integration of blockchain technology.By combining the strengths of blockchain and generative AI,the research team aimed to address the timely challenge of safeguarding visual medical content.The participating researchers conducted a comprehensive analysis,examining the vulnerabilities of medical AI services,personal information protection issues,and overall security weaknesses.This multi faceted exploration led to an indepth evaluation of the model’s performance and security.Notably,the correlation between accuracy,detection rate,and error rate was scrutinized.This analysis revealed insights into the model’s strengths and limitations,while the consideration of standard deviation shed light on the model’s stability and performance variability.The study proposed practical improvements,emphasizing the reduction of false negatives to enhance detection rate and leveraging blockchain technology to ensure visual data integrity in medical applications.Applying blockchain to generative AI-created medical content addresses key personal information protection issues.By utilizing the distributed ledger system of blockchain,the research team aimed to protect the privacy and integrity of medical data especially medical images.This approach not only enhances security but also enables transparent and tamperproof record-keeping.Additionally,the use of generative AI models ensures the creation of novel medical content without compromising personal information,further safeguarding patient privacy.In conclusion,this study showcases the potential of blockchain-based solutions in the medical field,particularly in securing sensitive medical data and protecting patient privacy.The proposed approach,combining blockchain and generative AI,offers a promising direction toward more robust and secure medical content management.Further research and advancements in this area will undoubtedly contribute to the development of robust and privacy-preserving healthcare systems,and visual diagnostic systems.
基金supported by the Deanship of Graduate Studies and Scientific Research at University of Bisha for funding this research through the promising program under grant number(UB-Promising-33-1445).
文摘Open networks and heterogeneous services in the Internet of Vehicles(IoV)can lead to security and privacy challenges.One key requirement for such systems is the preservation of user privacy,ensuring a seamless experience in driving,navigation,and communication.These privacy needs are influenced by various factors,such as data collected at different intervals,trip durations,and user interactions.To address this,the paper proposes a Support Vector Machine(SVM)model designed to process large amounts of aggregated data and recommend privacy preserving measures.The model analyzes data based on user demands and interactions with service providers or neighboring infrastructure.It aims to minimize privacy risks while ensuring service continuity and sustainability.The SVMmodel helps validate the system’s reliability by creating a hyperplane that distinguishes between maximum and minimum privacy recommendations.The results demonstrate the effectiveness of the proposed SVM model in enhancing both privacy and service performance.
基金Chongqing Engineering University Undergraduate Innovation and Entrepreneurship Training Program Project:Wireless Fire Automatic Alarm System(Project No.:CXCY2024017)Chongqing Municipal Education Commission Science and Technology Research Project:Development and Research of Chongqing Wireless Fire Automatic Alarm System(Project No.:KJQN202401906)。
文摘This article explores the design of a wireless fire alarm system supported by advanced data fusion technology.It includes discussions on the basic design ideas of the wireless fire alarm system,hardware design analysis,software design analysis,and simulation analysis,all supported by data fusion technology.Hopefully,this analysis can provide some reference for the rational application of data fusion technology to meet the actual design and application requirements of the system.
基金the support of the National Natural Science Foundation of China (Nos. 42250103, 41974073, and 41404053)the Macao Foundation and the preresearch project of Civil Aerospace Technologies (Nos. D020308 and D020303)funded by China’s National Space Administration, the Specialized Research Fund for State Key Laboratories。
文摘We combine gradient data from the Macao Science Satellite-1(MSS-1),CHAllenging Minisatellite Payload(CHAMP),Swarm-A,and Swarm-C satellites to develop a 110-degree lithospheric magnetic field model.We then comprehensively evaluate the performance of the model by power spectral comparisons,correlation analyses,sensitivity matrix assessments,and comparisons with existing lithospheric field models.Results showed that using near east–west gradient data from MSS-1 significantly enhances the model correlation in the spherical harmonic degree(N) range of 45–60 while also mitigating the decline in correlation at higher degrees(N > 60).Furthermore,the unique orbital characteristics of MSS-1 enable its gradient data to provide substantial contributions to modeling in the mid-to low-latitude regions.With continued data acquisition from MSS-1 and further optimization of data processing methods,the performance of the model is expected to improve.