Given the swift proliferation of structural health monitoring(SHM)technology within tunnel engineering,there is a demand on proficiently and precisely imputing the missing monitoring data to uphold the precision of di...Given the swift proliferation of structural health monitoring(SHM)technology within tunnel engineering,there is a demand on proficiently and precisely imputing the missing monitoring data to uphold the precision of disaster prediction.In contrast to other SHM datasets,the monitoring data specific to tunnel engineering exhibits pronounced spatiotemporal correlations.Nevertheless,most methodologies fail to adequately combine these types of correlations.Hence,the objective of this study is to develop spatiotemporal recurrent neural network(ST-RNN)model,which exploits spatiotemporal information to effectively impute missing data within tunnel monitoring systems.ST-RNN consists of two moduli:a temporal module employing recurrent neural network(RNN)to capture temporal dependencies,and a spatial module employing multilayer perceptron(MLP)to capture spatial correlations.To confirm the efficacy of the model,several commonly utilized methods are chosen as baselines for conducting comparative analyses.Furthermore,parametric validity experiments are conducted to illustrate the efficacy of the parameter selection process.The experimentation is conducted using original raw datasets wherein various degrees of continuous missing data are deliberately introduced.The experimental findings indicate that the ST-RNN model,incorporating both spatiotemporal modules,exhibits superior interpolation performance compared to other baseline methods across varying degrees of missing data.This affirms the reliability of the proposed model.展开更多
Do you like animals?Animals are cute.Some people like loyal dogs,some like adorable cats,and others prefer fluffy bunnies.But my favorite animals are naughty hamsters because they are full of energy.With just a little...Do you like animals?Animals are cute.Some people like loyal dogs,some like adorable cats,and others prefer fluffy bunnies.But my favorite animals are naughty hamsters because they are full of energy.With just a little food and water,they can thrive.Plus,they are really affordable,unlike cats and dogs that can cost several hundred or even over a thousand yuan.展开更多
Missing values in radionuclide diffusion datasets can undermine the predictive accuracy and robustness of the machine learning(ML)models.In this study,regression-based missing data imputation method using a light grad...Missing values in radionuclide diffusion datasets can undermine the predictive accuracy and robustness of the machine learning(ML)models.In this study,regression-based missing data imputation method using a light gradient boosting machine(LGBM)algorithm was employed to impute more than 60%of the missing data,establishing a radionuclide diffusion dataset containing 16 input features and 813 instances.The effective diffusion coefficient(D_(e))was predicted using ten ML models.The predictive accuracy of the ensemble meta-models,namely LGBM-extreme gradient boosting(XGB)and LGBM-categorical boosting(CatB),surpassed that of the other ML models,with R^(2)values of 0.94.The models were applied to predict the D_(e)values of EuEDTA^(−)and HCrO_(4)^(−)in saturated compacted bentonites at compactions ranging from 1200 to 1800 kg/m^(3),which were measured using a through-diffusion method.The generalization ability of the LGBM-XGB model surpassed that of LGB-CatB in predicting the D_(e)of HCrO_(4)^(−).Shapley additive explanations identified total porosity as the most significant influencing factor.Additionally,the partial dependence plot analysis technique yielded clearer results in the univariate correlation analysis.This study provides a regression imputation technique to refine radionuclide diffusion datasets,offering deeper insights into analyzing the diffusion mechanism of radionuclides and supporting the safety assessment of the geological disposal of high-level radioactive waste.展开更多
Background Chickens and ducks are vital sources of animal protein for humans.Recent pangenome studies suggest that a single genome is insufficient to represent the genetic information of a species,highlighting the nee...Background Chickens and ducks are vital sources of animal protein for humans.Recent pangenome studies suggest that a single genome is insufficient to represent the genetic information of a species,highlighting the need for more comprehensive genomes.The bird genome has more than tens of microchromosomes,but comparative genomics,annotations,and the discovery of variations are hindered by inadequate telomere-to-telomere level assemblies.We aim to complete the chicken and duck genomes,recover missing genes,and reveal common and unique chromosomal features between birds.Results The near telomere-to-telomere genomes of Silkie Gallus gallus and Mallard Anas platyrhynchos were successfully assembled via multiple high-coverage complementary technologies,with quality values of 36.65 and 44.17 for Silkie and Mallard,respectively;and BUSCO scores of 96.55%and 96.97%for Silkie and Mallard,respectively;the mapping rates reached over 99.52%for both assembled genomes,these evaluation results ensured high completeness and accuracy.We successfully annotated 20,253 and 19,621 protein-coding genes for Silkie and Mallard,respectively,and assembled gap-free sex chromosomes in Mallard for the first time.Comparative analysis revealed that microchromosomes differ from macrochromosomes in terms of GC content,repetitive sequence abundance,gene density,and levels of 5mC methylation.Different types of arrangements of centromeric repeat sequence centromeres exist in both Silkie and the Mallard genomes,with Mallard centromeres being invaded by CR1.The highly heterochromatic W chromosome,which serves as a refuge for ERVs,contains disproportionately long ERVs.Both Silkie and the Mallard genomes presented relatively high 5mC methylation levels on sex chromosomes and microchromosomes,and the telomeres and centromeres presented significantly higher 5mC methylation levels than the whole genome.Finally,we recovered 325 missing genes via our new genomes and annotated TNFA in Mallard for the first time,revealing conserved protein structures and tissue-specific expression.Conclusions The near telomere-to-telomere assemblies in Mallard and Silkie,with the first gap-free sex chromosomes in ducks,significantly enhanced our understanding of genetic structures in birds,specifically highlighting the distinctive chromosome features between the chicken and duck genomes.This foundational work also provides a series of newly identified missing genes for further investigation.展开更多
Industrial data mining usually deals with data from different sources.These heterogeneous datasets describe the same object in different views.However,samples from some of the datasets may be lost.Then the remaining s...Industrial data mining usually deals with data from different sources.These heterogeneous datasets describe the same object in different views.However,samples from some of the datasets may be lost.Then the remaining samples do not correspond one-to-one correctly.Mismatched datasets caused by missing samples make the industrial data unavailable for further machine learning.In order to align the mismatched samples,this article presents a cooperative iteration matching method(CIMM)based on the modified dynamic time warping(DTW).The proposed method regards the sequentially accumulated industrial data as the time series.Mismatched samples are aligned by the DTW.In addition,dynamic constraints are applied to the warping distance of the DTW process to make the alignment more efficient.Then a series of models are trained with the cumulated samples iteratively.Several groups of numerical experiments on different missing patterns and missing locations are designed and analyzed to prove the effectiveness and the applicability of the proposed method.展开更多
Background:As the digital age progresses,fear of missing out(FoMO)is becoming increasingly common,and the impact factor of FOMO needs to be further investigated.This study aims to explore the relationship between psyc...Background:As the digital age progresses,fear of missing out(FoMO)is becoming increasingly common,and the impact factor of FOMO needs to be further investigated.This study aims to explore the relationship between psychological security(PS)and FoMO by analyzing the mediating role of social networking addiction(SNA)and the moderating role of social self-efficacy(SSE).Methods:We collected a sample of 1181 college students(with a mean age of 19.671.38 years)from five universities in a province of China's Mainland through cluster sampling.Data±were gathered using the psychological security questionnaire(PSQ),the FoMO scale,the SNA scale,and the perceived social self-efficacy(PSSE)scale.Data analysis employed independent-sample t-tests,one-way analysis of variance(ANOVA),Harman’s single-factor test,confirmatory factor analysis,and moderated mediation analysis.Results:The results of the mediation model and moderated mediation model analyses showed the following key findings:(1)PS is significantly negatively correlated with FoMO;(2)SNA mediates the relationship between PS and FoMO;(3)SSE positively moderates the relationship between PS and FoMO;and(4)SSE also positively moderates the relationship between PS and SNA.Conclusion:University students’PS not only directly impacts FoMO but also indirectly influences it through SNA.Additionally,SSE positively moderates both the direct path and the first half of the mediation path,indicating that enhancing students’PS and SSE can help alleviate their SNA and FoMO,promoting their psychological and behavioral well-being.展开更多
Missing data handling is vital for multi-sensor information fusion fault diagnosis of motors to prevent the accuracy decay or even model failure,and some promising results have been gained in several current studies.T...Missing data handling is vital for multi-sensor information fusion fault diagnosis of motors to prevent the accuracy decay or even model failure,and some promising results have been gained in several current studies.These studies,however,have the following limitations:1)effective supervision is neglected for missing data across different fault types and 2)imbalance in missing rates among fault types results in inadequate learning during model training.To overcome the above limitations,this paper proposes a dynamic relative advantagedriven multi-fault synergistic diagnosis method to accomplish accurate fault diagnosis of motors under imbalanced missing data rates.Firstly,a cross-fault-type generalized synergistic diagnostic strategy is established based on variational information bottleneck theory,which is able to ensure sufficient supervision in handling missing data.Then,a dynamic relative advantage assessment technique is designed to reduce diagnostic accuracy decay caused by imbalanced missing data rates.The proposed method is validated using multi-sensor data from motor fault simulation experiments,and experimental results demonstrate its effectiveness and superiority in improving diagnostic accuracy and generalization under imbalanced missing data rates.展开更多
The physiological structure and growth of trees in extreme environments(freezing temperatures,prolonged drought,wildfires,pest infestations,and diseases)can be inhibited,including radial growth,and stagnant growth or ...The physiological structure and growth of trees in extreme environments(freezing temperatures,prolonged drought,wildfires,pest infestations,and diseases)can be inhibited,including radial growth,and stagnant growth or missing annual rings is highly possible.In this study,we analyzed the radial growth of Siberian larch(Larix sibirica)in the Hongshanzui area of the Altai Mountains,China.The overall missing ring rate at the sampling point was 2.39%,with years with the highest missing rings since meteorological site data were available(1960)identified as 1960,1961,1971,1973,1985,1987,and 1995.Radial growth in high altitudes was mainly affected by temperatures in May and June(average temperature,average minimum temperature,and average maximum temperature).Frequent periods of freezing may lead to missing annual rings.However,while Larix sibirica shows resilience after prolonged freezing temperatures,it still requires time for the trees to return to normal growth levels.展开更多
0 INTRODUCTION Changbaishan volcanism,located on the border of China and North Korea,has been a subject of extensive research due to its unique geological features and active volcanic history(Wan et al.,2024).Two prim...0 INTRODUCTION Changbaishan volcanism,located on the border of China and North Korea,has been a subject of extensive research due to its unique geological features and active volcanic history(Wan et al.,2024).Two primary models have been proposed to explain the origin of Changbaishan volcanism(CV).展开更多
Handling missing data accurately is critical in clinical research, where data quality directly impacts decision-making and patient outcomes. While deep learning (DL) techniques for data imputation have gained attentio...Handling missing data accurately is critical in clinical research, where data quality directly impacts decision-making and patient outcomes. While deep learning (DL) techniques for data imputation have gained attention, challenges remain, especially when dealing with diverse data types. In this study, we introduce a novel data imputation method based on a modified convolutional neural network, specifically, a Deep Residual-Convolutional Neural Network (DRes-CNN) architecture designed to handle missing values across various datasets. Our approach demonstrates substantial improvements over existing imputation techniques by leveraging residual connections and optimized convolutional layers to capture complex data patterns. We evaluated the model on publicly available datasets, including Medical Information Mart for Intensive Care (MIMIC-III and MIMIC-IV), which contain critical care patient data, and the Beijing Multi-Site Air Quality dataset, which measures environmental air quality. The proposed DRes-CNN method achieved a root mean square error (RMSE) of 0.00006, highlighting its high accuracy and robustness. We also compared with Low Light-Convolutional Neural Network (LL-CNN) and U-Net methods, which had RMSE values of 0.00075 and 0.00073, respectively. This represented an improvement of approximately 92% over LL-CNN and 91% over U-Net. The results showed that this DRes-CNN-based imputation method outperforms current state-of-the-art models. These results established DRes-CNN as a reliable solution for addressing missing data.展开更多
The Central Institute of Forensic Science(CIFS)has been providing DNA testing services to Thai people since 2002.Bone accounts for majority of the biological specimens tested,constituting approximately 26%in total evi...The Central Institute of Forensic Science(CIFS)has been providing DNA testing services to Thai people since 2002.Bone accounts for majority of the biological specimens tested,constituting approximately 26%in total evidence.DNA recovery from the bone is challenging owing to degradation and the presence of inhibitors.Therefore,guidelines for bone selection,extraction,and DNA typing are essential for the routine laboratory of CIFS to maximize DNA yield,and minimize time and cost.In this study,we extracted three types of bones:femur,occipital,and petrous,from 12 bodies using a modified organic extraction and silica-based method.The success rate of the Short Tandem Repeat(STR)typing was determined through the number of reportable loci.Furthermore,analysis of mitochondrial DNA(mtDNA)was performed using the massively parallel sequencing technique.Coverage and variant analyses of all samples were evaluated.The results indicate that the femur exhibits the highest success rate in STR typing.The results,in decreasing order,are as follows:femur>petrous>occipital.We determined that silica-based extraction is the most efficient technique for the STR typing;however,modified organic extraction can be used as an alternative method in obtaining mtDNA.The outcome from this study could serve as a guide for identifying human remains and missing persons in the CIFS laboratory,as well as other Thai forensic laboratories.展开更多
Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disorder characterized by the progressive degeneration of upper and lower motor neurons in the brainstem and spinal cord,leading to muscle weakness,para...Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disorder characterized by the progressive degeneration of upper and lower motor neurons in the brainstem and spinal cord,leading to muscle weakness,paralysis,and respiratory failure (Morgan and Orrell,2016).展开更多
Accurate traffic flow prediction(TFP)is vital for efficient and sustainable transportation management and the development of intelligent traffic systems.However,missing data in real-world traffic datasets poses a sign...Accurate traffic flow prediction(TFP)is vital for efficient and sustainable transportation management and the development of intelligent traffic systems.However,missing data in real-world traffic datasets poses a significant challenge to maintaining prediction precision.This study introduces REPTF-TMDI,a novel method that combines a Reduced Error Pruning Tree Forest(REPTree Forest)with a newly proposed Time-based Missing Data Imputation(TMDI)approach.The REP Tree Forest,an ensemble learning approach,is tailored for time-related traffic data to enhance predictive accuracy and support the evolution of sustainable urbanmobility solutions.Meanwhile,the TMDI approach exploits temporal patterns to estimate missing values reliably whenever empty fields are encountered.The proposed method was evaluated using hourly traffic flow data from a major U.S.roadway spanning 2012-2018,incorporating temporal features(e.g.,hour,day,month,year,weekday),holiday indicator,and weather conditions(temperature,rain,snow,and cloud coverage).Experimental results demonstrated that the REPTF-TMDI method outperformed conventional imputation techniques across various missing data ratios by achieving an average 11.76%improvement in terms of correlation coefficient(R).Furthermore,REPTree Forest achieved improvements of 68.62%in RMSE and 70.52%in MAE compared to existing state-of-the-art models.These findings highlight the method’s ability to significantly boost traffic flow prediction accuracy,even in the presence of missing data,thereby contributing to the broader objectives of sustainable urban transportation systems.展开更多
Rhododendron is famous for its high ornamental value.However,the genus is taxonomically difficult and the relationships within Rhododendron remain unresolved.In addition,the origin of key morphological characters with...Rhododendron is famous for its high ornamental value.However,the genus is taxonomically difficult and the relationships within Rhododendron remain unresolved.In addition,the origin of key morphological characters with high horticulture value need to be explored.Both problems largely hinder utilization of germplasm resources.Most studies attempted to disentangle the phylogeny of Rhododendron,but only used a few genomic markers and lacked large-scale sampling,resulting in low clade support and contradictory phylogenetic signals.Here,we used restriction-site associated DNA sequencing(RAD-seq)data and morphological traits for 144 species of Rhododendron,representing all subgenera and most sections and subsections of this species-rich genus,to decipher its intricate evolutionary history and reconstruct ancestral state.Our results revealed high resolutions at subgenera and section levels of Rhododendron based on RAD-seq data.Both optimal phylogenetic tree and split tree recovered five lineages among Rhododendron.Subg.Therorhodion(cladeⅠ)formed the basal lineage.Subg.Tsutsusi and Azaleastrum formed cladeⅡand had sister relationships.CladeⅢincluded all scaly rhododendron species.Subg.Pentanthera(cladeⅣ)formed a sister group to Subg.Hymenanthes(cladeⅤ).The results of ancestral state reconstruction showed that Rhododendron ancestor was a deciduous woody plant with terminal inflorescence,ten stamens,leaf blade without scales and broadly funnelform corolla with pink or purple color.This study shows significant distinguishability to resolve the evolutionary history of Rhododendron based on high clade support of phylogenetic tree constructed by RAD-seq data.It also provides an example to resolve discordant signals in phylogenetic trees and demonstrates the application feasibility of RAD-seq with large amounts of missing data in deciphering intricate evolutionary relationships.Additionally,the reconstructed ancestral state of six important characters provides insights into the innovation of key characters in Rhododendron.展开更多
With the increasing complexity of production processes,there has been a growing focus on online algorithms within the domain of multivariate statistical process control(SPC).Nonetheless,conventional methods,based on t...With the increasing complexity of production processes,there has been a growing focus on online algorithms within the domain of multivariate statistical process control(SPC).Nonetheless,conventional methods,based on the assumption of complete data obtained at uniform time intervals,exhibit suboptimal performance in the presence of missing data.In our pursuit of maximizing available information,we propose an adaptive exponentially weighted moving average(EWMA)control chart employing a weighted imputation approach that leverages the relationships between complete and incomplete data.Specifically,we introduce two recovery methods:an improved K-Nearest Neighbors imputing value and the conventional univariate EWMA statistic.We then formulate an adaptive weighting function to amalgamate these methods,assigning a diminished weight to the EWMA statistic when the sample information suggests an increased likelihood of the process being out of control,and vice versa.The robustness and sensitivity of the proposed scheme are shown through simulation results and an illustrative example.展开更多
Deformation monitoring is a critical measure for intuitively reflecting the operational behavior of a dam.However,the deformation monitoring data are often incomplete due to environmental changes,monitoring instrument...Deformation monitoring is a critical measure for intuitively reflecting the operational behavior of a dam.However,the deformation monitoring data are often incomplete due to environmental changes,monitoring instrument faults,and human operational errors,thereby often hindering the accurate assessment of actual deformation patterns.This study proposed a method for quantifying deformation similarity between measurement points by recognizing the spatiotemporal characteristics of concrete dam deformation monitoring data.It introduces a spatiotemporal clustering analysis of the concrete dam deformation behavior and employs the support vector machine model to address the missing data in concrete dam deformation monitoring.The proposed method was validated in a concrete dam project,with the model error maintaining within 5%,demonstrating its effectiveness in processing missing deformation data.This approach enhances the capability of early-warning systems and contributes to enhanced dam safety management.展开更多
Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are inc...Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are included below.展开更多
Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are inc...Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are included below.展开更多
Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are inc...Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are included below.展开更多
Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are inc...Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are included below.展开更多
基金supported by the National Natural Science Foundation of China(Grant Nos.51991395 and 42293355)geological survey project of China Geological Survey:Support for Geo-hazard monitoring,early warning and prevention(Grant No.DD20230085).
文摘Given the swift proliferation of structural health monitoring(SHM)technology within tunnel engineering,there is a demand on proficiently and precisely imputing the missing monitoring data to uphold the precision of disaster prediction.In contrast to other SHM datasets,the monitoring data specific to tunnel engineering exhibits pronounced spatiotemporal correlations.Nevertheless,most methodologies fail to adequately combine these types of correlations.Hence,the objective of this study is to develop spatiotemporal recurrent neural network(ST-RNN)model,which exploits spatiotemporal information to effectively impute missing data within tunnel monitoring systems.ST-RNN consists of two moduli:a temporal module employing recurrent neural network(RNN)to capture temporal dependencies,and a spatial module employing multilayer perceptron(MLP)to capture spatial correlations.To confirm the efficacy of the model,several commonly utilized methods are chosen as baselines for conducting comparative analyses.Furthermore,parametric validity experiments are conducted to illustrate the efficacy of the parameter selection process.The experimentation is conducted using original raw datasets wherein various degrees of continuous missing data are deliberately introduced.The experimental findings indicate that the ST-RNN model,incorporating both spatiotemporal modules,exhibits superior interpolation performance compared to other baseline methods across varying degrees of missing data.This affirms the reliability of the proposed model.
文摘Do you like animals?Animals are cute.Some people like loyal dogs,some like adorable cats,and others prefer fluffy bunnies.But my favorite animals are naughty hamsters because they are full of energy.With just a little food and water,they can thrive.Plus,they are really affordable,unlike cats and dogs that can cost several hundred or even over a thousand yuan.
基金supported by the National Natural Science Foundation of China(No.12475340 and 12375350)Special Branch project of South Taihu Lakethe Scientific Research Fund of Zhejiang Provincial Education Department(No.Y202456326).
文摘Missing values in radionuclide diffusion datasets can undermine the predictive accuracy and robustness of the machine learning(ML)models.In this study,regression-based missing data imputation method using a light gradient boosting machine(LGBM)algorithm was employed to impute more than 60%of the missing data,establishing a radionuclide diffusion dataset containing 16 input features and 813 instances.The effective diffusion coefficient(D_(e))was predicted using ten ML models.The predictive accuracy of the ensemble meta-models,namely LGBM-extreme gradient boosting(XGB)and LGBM-categorical boosting(CatB),surpassed that of the other ML models,with R^(2)values of 0.94.The models were applied to predict the D_(e)values of EuEDTA^(−)and HCrO_(4)^(−)in saturated compacted bentonites at compactions ranging from 1200 to 1800 kg/m^(3),which were measured using a through-diffusion method.The generalization ability of the LGBM-XGB model surpassed that of LGB-CatB in predicting the D_(e)of HCrO_(4)^(−).Shapley additive explanations identified total porosity as the most significant influencing factor.Additionally,the partial dependence plot analysis technique yielded clearer results in the univariate correlation analysis.This study provides a regression imputation technique to refine radionuclide diffusion datasets,offering deeper insights into analyzing the diffusion mechanism of radionuclides and supporting the safety assessment of the geological disposal of high-level radioactive waste.
基金supported by the National Key R&D Program of China(2022YFF1000100,2023YFD1300300)the National Natural Science Foundation of China(31572388,31972525)the China Agriculture Research System of MOF and MARA(CARS-41)。
文摘Background Chickens and ducks are vital sources of animal protein for humans.Recent pangenome studies suggest that a single genome is insufficient to represent the genetic information of a species,highlighting the need for more comprehensive genomes.The bird genome has more than tens of microchromosomes,but comparative genomics,annotations,and the discovery of variations are hindered by inadequate telomere-to-telomere level assemblies.We aim to complete the chicken and duck genomes,recover missing genes,and reveal common and unique chromosomal features between birds.Results The near telomere-to-telomere genomes of Silkie Gallus gallus and Mallard Anas platyrhynchos were successfully assembled via multiple high-coverage complementary technologies,with quality values of 36.65 and 44.17 for Silkie and Mallard,respectively;and BUSCO scores of 96.55%and 96.97%for Silkie and Mallard,respectively;the mapping rates reached over 99.52%for both assembled genomes,these evaluation results ensured high completeness and accuracy.We successfully annotated 20,253 and 19,621 protein-coding genes for Silkie and Mallard,respectively,and assembled gap-free sex chromosomes in Mallard for the first time.Comparative analysis revealed that microchromosomes differ from macrochromosomes in terms of GC content,repetitive sequence abundance,gene density,and levels of 5mC methylation.Different types of arrangements of centromeric repeat sequence centromeres exist in both Silkie and the Mallard genomes,with Mallard centromeres being invaded by CR1.The highly heterochromatic W chromosome,which serves as a refuge for ERVs,contains disproportionately long ERVs.Both Silkie and the Mallard genomes presented relatively high 5mC methylation levels on sex chromosomes and microchromosomes,and the telomeres and centromeres presented significantly higher 5mC methylation levels than the whole genome.Finally,we recovered 325 missing genes via our new genomes and annotated TNFA in Mallard for the first time,revealing conserved protein structures and tissue-specific expression.Conclusions The near telomere-to-telomere assemblies in Mallard and Silkie,with the first gap-free sex chromosomes in ducks,significantly enhanced our understanding of genetic structures in birds,specifically highlighting the distinctive chromosome features between the chicken and duck genomes.This foundational work also provides a series of newly identified missing genes for further investigation.
基金the Key National Natural Science Foundation of China(No.U1864211)the National Natural Science Foundation of China(No.11772191)the Natural Science Foundation of Shanghai(No.21ZR1431500)。
文摘Industrial data mining usually deals with data from different sources.These heterogeneous datasets describe the same object in different views.However,samples from some of the datasets may be lost.Then the remaining samples do not correspond one-to-one correctly.Mismatched datasets caused by missing samples make the industrial data unavailable for further machine learning.In order to align the mismatched samples,this article presents a cooperative iteration matching method(CIMM)based on the modified dynamic time warping(DTW).The proposed method regards the sequentially accumulated industrial data as the time series.Mismatched samples are aligned by the DTW.In addition,dynamic constraints are applied to the warping distance of the DTW process to make the alignment more efficient.Then a series of models are trained with the cumulated samples iteratively.Several groups of numerical experiments on different missing patterns and missing locations are designed and analyzed to prove the effectiveness and the applicability of the proposed method.
基金supported by the Jiangxi Province Think Tank Research Project(ZK202406)the 2023 Jiangxi Provincial Health Commission Research Project(52524010)。
文摘Background:As the digital age progresses,fear of missing out(FoMO)is becoming increasingly common,and the impact factor of FOMO needs to be further investigated.This study aims to explore the relationship between psychological security(PS)and FoMO by analyzing the mediating role of social networking addiction(SNA)and the moderating role of social self-efficacy(SSE).Methods:We collected a sample of 1181 college students(with a mean age of 19.671.38 years)from five universities in a province of China's Mainland through cluster sampling.Data±were gathered using the psychological security questionnaire(PSQ),the FoMO scale,the SNA scale,and the perceived social self-efficacy(PSSE)scale.Data analysis employed independent-sample t-tests,one-way analysis of variance(ANOVA),Harman’s single-factor test,confirmatory factor analysis,and moderated mediation analysis.Results:The results of the mediation model and moderated mediation model analyses showed the following key findings:(1)PS is significantly negatively correlated with FoMO;(2)SNA mediates the relationship between PS and FoMO;(3)SSE positively moderates the relationship between PS and FoMO;and(4)SSE also positively moderates the relationship between PS and SNA.Conclusion:University students’PS not only directly impacts FoMO but also indirectly influences it through SNA.Additionally,SSE positively moderates both the direct path and the first half of the mediation path,indicating that enhancing students’PS and SSE can help alleviate their SNA and FoMO,promoting their psychological and behavioral well-being.
文摘Missing data handling is vital for multi-sensor information fusion fault diagnosis of motors to prevent the accuracy decay or even model failure,and some promising results have been gained in several current studies.These studies,however,have the following limitations:1)effective supervision is neglected for missing data across different fault types and 2)imbalance in missing rates among fault types results in inadequate learning during model training.To overcome the above limitations,this paper proposes a dynamic relative advantagedriven multi-fault synergistic diagnosis method to accomplish accurate fault diagnosis of motors under imbalanced missing data rates.Firstly,a cross-fault-type generalized synergistic diagnostic strategy is established based on variational information bottleneck theory,which is able to ensure sufficient supervision in handling missing data.Then,a dynamic relative advantage assessment technique is designed to reduce diagnostic accuracy decay caused by imbalanced missing data rates.The proposed method is validated using multi-sensor data from motor fault simulation experiments,and experimental results demonstrate its effectiveness and superiority in improving diagnostic accuracy and generalization under imbalanced missing data rates.
基金supported by the Natural Science Foundation of Xinjiang Uigur Autonomous Region−Science Fund for Distinguished Young Scholars(2022D01E105)the Natural Key Research and Development Program(Inter-governmental Key and Special Project,2023YFE0102700)+2 种基金Tianshan Talent Training Program−Young Scientific and Technological Innovation Talent(2023TSYCCX0076)the Science and Technology Development Fund Project of Institute of Desert Meteorology,China Meteorological Administration(KJFZ202306,KJFZ202406)Xinjiang Regional collaborative innovation project(2022E01045)。
文摘The physiological structure and growth of trees in extreme environments(freezing temperatures,prolonged drought,wildfires,pest infestations,and diseases)can be inhibited,including radial growth,and stagnant growth or missing annual rings is highly possible.In this study,we analyzed the radial growth of Siberian larch(Larix sibirica)in the Hongshanzui area of the Altai Mountains,China.The overall missing ring rate at the sampling point was 2.39%,with years with the highest missing rings since meteorological site data were available(1960)identified as 1960,1961,1971,1973,1985,1987,and 1995.Radial growth in high altitudes was mainly affected by temperatures in May and June(average temperature,average minimum temperature,and average maximum temperature).Frequent periods of freezing may lead to missing annual rings.However,while Larix sibirica shows resilience after prolonged freezing temperatures,it still requires time for the trees to return to normal growth levels.
基金support from the National Natural Science Foundation of China(No.42276049)。
文摘0 INTRODUCTION Changbaishan volcanism,located on the border of China and North Korea,has been a subject of extensive research due to its unique geological features and active volcanic history(Wan et al.,2024).Two primary models have been proposed to explain the origin of Changbaishan volcanism(CV).
基金supported by the Intelligent System Research Group(ISysRG)supported by Universitas Sriwijaya funded by the Competitive Research 2024.
文摘Handling missing data accurately is critical in clinical research, where data quality directly impacts decision-making and patient outcomes. While deep learning (DL) techniques for data imputation have gained attention, challenges remain, especially when dealing with diverse data types. In this study, we introduce a novel data imputation method based on a modified convolutional neural network, specifically, a Deep Residual-Convolutional Neural Network (DRes-CNN) architecture designed to handle missing values across various datasets. Our approach demonstrates substantial improvements over existing imputation techniques by leveraging residual connections and optimized convolutional layers to capture complex data patterns. We evaluated the model on publicly available datasets, including Medical Information Mart for Intensive Care (MIMIC-III and MIMIC-IV), which contain critical care patient data, and the Beijing Multi-Site Air Quality dataset, which measures environmental air quality. The proposed DRes-CNN method achieved a root mean square error (RMSE) of 0.00006, highlighting its high accuracy and robustness. We also compared with Low Light-Convolutional Neural Network (LL-CNN) and U-Net methods, which had RMSE values of 0.00075 and 0.00073, respectively. This represented an improvement of approximately 92% over LL-CNN and 91% over U-Net. The results showed that this DRes-CNN-based imputation method outperforms current state-of-the-art models. These results established DRes-CNN as a reliable solution for addressing missing data.
文摘The Central Institute of Forensic Science(CIFS)has been providing DNA testing services to Thai people since 2002.Bone accounts for majority of the biological specimens tested,constituting approximately 26%in total evidence.DNA recovery from the bone is challenging owing to degradation and the presence of inhibitors.Therefore,guidelines for bone selection,extraction,and DNA typing are essential for the routine laboratory of CIFS to maximize DNA yield,and minimize time and cost.In this study,we extracted three types of bones:femur,occipital,and petrous,from 12 bodies using a modified organic extraction and silica-based method.The success rate of the Short Tandem Repeat(STR)typing was determined through the number of reportable loci.Furthermore,analysis of mitochondrial DNA(mtDNA)was performed using the massively parallel sequencing technique.Coverage and variant analyses of all samples were evaluated.The results indicate that the femur exhibits the highest success rate in STR typing.The results,in decreasing order,are as follows:femur>petrous>occipital.We determined that silica-based extraction is the most efficient technique for the STR typing;however,modified organic extraction can be used as an alternative method in obtaining mtDNA.The outcome from this study could serve as a guide for identifying human remains and missing persons in the CIFS laboratory,as well as other Thai forensic laboratories.
文摘Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disorder characterized by the progressive degeneration of upper and lower motor neurons in the brainstem and spinal cord,leading to muscle weakness,paralysis,and respiratory failure (Morgan and Orrell,2016).
文摘Accurate traffic flow prediction(TFP)is vital for efficient and sustainable transportation management and the development of intelligent traffic systems.However,missing data in real-world traffic datasets poses a significant challenge to maintaining prediction precision.This study introduces REPTF-TMDI,a novel method that combines a Reduced Error Pruning Tree Forest(REPTree Forest)with a newly proposed Time-based Missing Data Imputation(TMDI)approach.The REP Tree Forest,an ensemble learning approach,is tailored for time-related traffic data to enhance predictive accuracy and support the evolution of sustainable urbanmobility solutions.Meanwhile,the TMDI approach exploits temporal patterns to estimate missing values reliably whenever empty fields are encountered.The proposed method was evaluated using hourly traffic flow data from a major U.S.roadway spanning 2012-2018,incorporating temporal features(e.g.,hour,day,month,year,weekday),holiday indicator,and weather conditions(temperature,rain,snow,and cloud coverage).Experimental results demonstrated that the REPTF-TMDI method outperformed conventional imputation techniques across various missing data ratios by achieving an average 11.76%improvement in terms of correlation coefficient(R).Furthermore,REPTree Forest achieved improvements of 68.62%in RMSE and 70.52%in MAE compared to existing state-of-the-art models.These findings highlight the method’s ability to significantly boost traffic flow prediction accuracy,even in the presence of missing data,thereby contributing to the broader objectives of sustainable urban transportation systems.
基金supported by Ten Thousand Talent Program of Yunnan Province(Grant No.YNWR-QNBJ-2018-174)the Key Basic Research Program of Yunnan Province,China(Grant No.202101BC070003)+3 种基金National Natural Science Foundation of China(Grant No.31901237)Conservation Program for Plant Species with Extremely Small Populations in Yunnan Province(Grant No.2022SJ07X-03)Key Technologies Research for the Germplasmof Important Woody Flowers in Yunnan Province(Grant No.202302AE090018)Natural Science Foundation of Guizhou Province(Grant No.Qiankehejichu-ZK2021yiban 089&Qiankehejichu-ZK2023yiban 035)。
文摘Rhododendron is famous for its high ornamental value.However,the genus is taxonomically difficult and the relationships within Rhododendron remain unresolved.In addition,the origin of key morphological characters with high horticulture value need to be explored.Both problems largely hinder utilization of germplasm resources.Most studies attempted to disentangle the phylogeny of Rhododendron,but only used a few genomic markers and lacked large-scale sampling,resulting in low clade support and contradictory phylogenetic signals.Here,we used restriction-site associated DNA sequencing(RAD-seq)data and morphological traits for 144 species of Rhododendron,representing all subgenera and most sections and subsections of this species-rich genus,to decipher its intricate evolutionary history and reconstruct ancestral state.Our results revealed high resolutions at subgenera and section levels of Rhododendron based on RAD-seq data.Both optimal phylogenetic tree and split tree recovered five lineages among Rhododendron.Subg.Therorhodion(cladeⅠ)formed the basal lineage.Subg.Tsutsusi and Azaleastrum formed cladeⅡand had sister relationships.CladeⅢincluded all scaly rhododendron species.Subg.Pentanthera(cladeⅣ)formed a sister group to Subg.Hymenanthes(cladeⅤ).The results of ancestral state reconstruction showed that Rhododendron ancestor was a deciduous woody plant with terminal inflorescence,ten stamens,leaf blade without scales and broadly funnelform corolla with pink or purple color.This study shows significant distinguishability to resolve the evolutionary history of Rhododendron based on high clade support of phylogenetic tree constructed by RAD-seq data.It also provides an example to resolve discordant signals in phylogenetic trees and demonstrates the application feasibility of RAD-seq with large amounts of missing data in deciphering intricate evolutionary relationships.Additionally,the reconstructed ancestral state of six important characters provides insights into the innovation of key characters in Rhododendron.
文摘With the increasing complexity of production processes,there has been a growing focus on online algorithms within the domain of multivariate statistical process control(SPC).Nonetheless,conventional methods,based on the assumption of complete data obtained at uniform time intervals,exhibit suboptimal performance in the presence of missing data.In our pursuit of maximizing available information,we propose an adaptive exponentially weighted moving average(EWMA)control chart employing a weighted imputation approach that leverages the relationships between complete and incomplete data.Specifically,we introduce two recovery methods:an improved K-Nearest Neighbors imputing value and the conventional univariate EWMA statistic.We then formulate an adaptive weighting function to amalgamate these methods,assigning a diminished weight to the EWMA statistic when the sample information suggests an increased likelihood of the process being out of control,and vice versa.The robustness and sensitivity of the proposed scheme are shown through simulation results and an illustrative example.
基金supported by the National Key R&D Program of China(Grant No.2022YFC3005401)the Fundamental Research Funds for the Central Universities(Grant No.B230201013)+2 种基金the National Natural Science Foundation of China(Grants No.52309152,U2243223,and U23B20150)the Natural Science Foundation of Jiangsu Province(Grant No.BK20220978)the Open Fund of National Dam Safety Research Center(Grant No.CX2023B03).
文摘Deformation monitoring is a critical measure for intuitively reflecting the operational behavior of a dam.However,the deformation monitoring data are often incomplete due to environmental changes,monitoring instrument faults,and human operational errors,thereby often hindering the accurate assessment of actual deformation patterns.This study proposed a method for quantifying deformation similarity between measurement points by recognizing the spatiotemporal characteristics of concrete dam deformation monitoring data.It introduces a spatiotemporal clustering analysis of the concrete dam deformation behavior and employs the support vector machine model to address the missing data in concrete dam deformation monitoring.The proposed method was validated in a concrete dam project,with the model error maintaining within 5%,demonstrating its effectiveness in processing missing deformation data.This approach enhances the capability of early-warning systems and contributes to enhanced dam safety management.
文摘Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are included below.
文摘Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are included below.
文摘Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are included below.
文摘Ethical statements were not included in the published version of the following articles that appeared in previous issues of Journal of Integrative Agriculture.The appropriate statements provided by the Authors are included below.