In their recent paper Pereira et al.(2025)claim that validation is overlooked in mapping and modelling of ecosystem services(ES).They state that“many studies lack critical evaluation of the results and no validation ...In their recent paper Pereira et al.(2025)claim that validation is overlooked in mapping and modelling of ecosystem services(ES).They state that“many studies lack critical evaluation of the results and no validation is provided”and that“the validation step is largely overlooked”.This assertion may have been true several years ago,for example,when Ochoa and Urbina-Cardona(2017)made a similar observation.However,there has been much work on ES model validation over the last decade.展开更多
Objective:This study aimed to examine the reliability and validity of the Chinese version of the Behavioral Inhibition System/Behavioral Activation System(BIS/BAS)scales among stroke survivors.Methods:The cross-sectio...Objective:This study aimed to examine the reliability and validity of the Chinese version of the Behavioral Inhibition System/Behavioral Activation System(BIS/BAS)scales among stroke survivors.Methods:The cross-sectional study was conducted at four comprehensive hospitals in Taizhou,Jiangsu,China.A sample of 232 first-ever stroke survivors were recruited from June to August 2023.Validity was examined using face validity and construct validity,which used confirmatory factor analysis(CFA)and known-group analysis.Reliability was evaluated by internal consistency and test-retest reliability.Results:The BIS/BAS scales demonstrated satisfactory face validity.The findings of CFAs supported the original four-factor structure of BAS-reward,BAS-drive,BAS-fun seeking,and BIS with acceptable model fit indices.Discriminative validity,assessed via known-group analysis,indicated that stroke survivors with probable depression had significantly lower mean BAS-reward,BAS-drive,and BAS-fun seeking scores(P<0.001)and a higher mean BIS score(P=0.028)compared to those without probable depression.The internal consistency,measured by Cronbach’s a coefficients for the subscales,ranged from 0.669 to 0.964.Test-retest reliability,assessed using intra-class correlation coefficients,ranged from 0.61 to 0.93.Conclusions:The Chinese version of the BIS/BAS scales could be a reliable and valid instrument for measuring behavioral activation among stroke survivors.展开更多
Ecosystem services(ES)mapping and models have advanced in recent years.Improvements were made,and the assessments have transitioned from qualitative to quantitative.Although this is an important advancement,the ES map...Ecosystem services(ES)mapping and models have advanced in recent years.Improvements were made,and the assessments have transitioned from qualitative to quantitative.Although this is an important advancement,the ES mapping and modelling validation step has been overlooked,and this raises an important question in the credibility of the outcomes.This has been an important and unsolved issue in the ES research community that needs to be tackled.This highlight paper discusses the importance of validating single ES mapping and models.Conducting this using field or proximal/remote sensing raw data and not data from other models or stakeholder evaluation is important.A validation step should be mandatory in ES frameworks since it can assess the models’veracity,contribute to identifying the model’s weaknesses/strengths and ultimately represent a scientific advance in the field.This is easier to apply to the biophysical mapping and models of regulating and provisioning ES than to cultural ES,as the latter rely more on perception and cultural contexts.Also,ES supply models are easier to validate than demand and flow models.Robust and well-grounded models are essential for ensuring the reliability of individual ES maps and models and should be integrated into decision-making processes.Although several challenges arise related to the costs of data collection,in several cases prohibitive,and the time and the expertise needed to conduct this sampling and analysis,this is likely an imperative step that needs to be considered in the future.This will be beneficial in establishing ES research and improving decision-making and wellbeing.展开更多
Background:Sport climbing is becoming incredibly popular both in the general population and among athletes.No consensus exists regarding evidence-based sport-specific performance evaluation;therefore,this systematic r...Background:Sport climbing is becoming incredibly popular both in the general population and among athletes.No consensus exists regarding evidence-based sport-specific performance evaluation;therefore,this systematic review was aimed at analyzing determinants of sport climbing performance and evaluation methods by comparing climbers of different levels.Methods:PubMed,Scopus,and Web of Science were searched up to December 20,2022.Studies providing the self-reported climbing ability associated with different functional outcomes in groups of climbers of contiguous performance levels were eligible.Results:74 studies were finally included.Various methods have been proposed to evaluate determinants of sport climbing performance.Climbing-specific assessments were able to discriminate climbers of different levels when compared to general functional tests.Test validity resulted high for climbing-specific cardiorespiratory endurance as well as muscular-strength,-endurance,and-power;similarly,reliability was good except for cardiorespiratory endurance.Climbing-specific flexibility assessment resulted in high reliability but moderate validity,whereas balance showed low validity.Considerable conflicting evidence was found regarding anthropometric characteristics.Conclusion:The present analysis identified cardiorespiratory endurance as well as muscular-strength,-endurance,and-power as determinants of sport climbing performance.In contrast,balance,flexibility,and anthropometric characteristics seem to count less.This review also proposes an evidence-based Functional Sport Climbing test battery for asses sing performance determinants,which includes tests that have been identified to be valid,reliable,and feasible.While athletes and coaches should rely on evidence-based and standardized evaluation methods,researchers may design specific large-scale trials as a resource for providing additional,homogenous,and comparable data to improve scientific evidence and professionalism in this popular sport discipline.展开更多
The nature of pramāṅa system,outlined by Dignāga(c.480-540 CE)and further elaborated upon by Dharmakīrti(c.600-660 CE),is an important part of the Buddhist logico-epistemological tradition.Within this pramāṅa syst...The nature of pramāṅa system,outlined by Dignāga(c.480-540 CE)and further elaborated upon by Dharmakīrti(c.600-660 CE),is an important part of the Buddhist logico-epistemological tradition.Within this pramāṅa system,self-awareness(svasaṃvedana)is considered a hallmark of the access to the mental states and factors.However,some of the key research often focuses on interpreting the valid cognition system and self-awareness separately,lacking specific descriptions of their interrelationship.This paper argues that self-awareness is not merely a byproduct of valid cognition but is intimately connected to it.Specifically,I posit that self-awareness should be regarded as the ultimate result of valid cognition within the Pramāṇa system.展开更多
A self-centering bridge bent equipped with energy-dissipation(ED)beams is proposed.Quasi-static tests are conducted on self-centering bridge bents,both with and without ED beams,to validate the accuracy of the corresp...A self-centering bridge bent equipped with energy-dissipation(ED)beams is proposed.Quasi-static tests are conducted on self-centering bridge bents,both with and without ED beams,to validate the accuracy of the corresponding numerical models.The effects of various param-eters,such as the web area of ED beams,prestressing force of tendons,tendon arrangements,and number of column segments,on the seismic performance of self-centering bridge bents with ED beams are evaluated using the validated numerical model.The results demonstrate that the nu-merical models accurately replicate the quasi-static test results,with average errors in the lateral force remaining below 9.6%.The web area of ED beams significantly affects the strength,cumulative energy dissipation,and relative self-centering index(RSI)of the self-centering bridge bents.Increasing the prestressing force enhances the lateral force and self-centering capability of the bridge bents but has minimal effect on their ED capacity.Reducing the num-ber of segments in each column enhances the lateral force and cumulative hysteretic energy dissipation of the self-centering bridge bents while exerting an insignificant effect on the RSI.Thus,the proposed novel system is highly suitable for doubleor multicolumn piers supporting bridges in regions prone to strong earthquakes.展开更多
It is fundamental and useful to investigate how deep learning forecasting models(DLMs)perform compared to operational oceanography forecast systems(OFSs).However,few studies have intercompared their performances using...It is fundamental and useful to investigate how deep learning forecasting models(DLMs)perform compared to operational oceanography forecast systems(OFSs).However,few studies have intercompared their performances using an identical reference.In this study,three physically reasonable DLMs are implemented for the forecasting of the sea surface temperature(SST),sea level anomaly(SLA),and sea surface velocity in the South China Sea.The DLMs are validated against both the testing dataset and the“OceanPredict”Class 4 dataset.Results show that the DLMs'RMSEs against the latter increase by 44%,245%,302%,and 109%for SST,SLA,current speed,and direction,respectively,compared to those against the former.Therefore,different references have significant influences on the validation,and it is necessary to use an identical and independent reference to intercompare the DLMs and OFSs.Against the Class 4 dataset,the DLMs present significantly better performance for SLA than the OFSs,and slightly better performances for other variables.The error patterns of the DLMs and OFSs show a high degree of similarity,which is reasonable from the viewpoint of predictability,facilitating further applications of the DLMs.For extreme events,the DLMs and OFSs both present large but similar forecast errors for SLA and current speed,while the DLMs are likely to give larger errors for SST and current direction.This study provides an evaluation of the forecast skills of commonly used DLMs and provides an example to objectively intercompare different DLMs.展开更多
Background Electrocardiogram(ECG)analysis has emerged as a promising tool for detecting physiological changes linked to non-cardiac disorders.Given the close connection between cardiovascular and neurocognitive health...Background Electrocardiogram(ECG)analysis has emerged as a promising tool for detecting physiological changes linked to non-cardiac disorders.Given the close connection between cardiovascular and neurocognitive health,ECG abnormalities may be present in individuals with co-occurring neurocognitive conditions.This highlights the potential of ECG as a biomarker to improve detection,therapy monitoring and risk stratification in patients with neurocognitive disorders,an area that remains underexplored.Aims We aimed to demonstrate the feasibility of predicting neurocognitive disorders from ECG features across diverse patient populations.Methods ECG features and demographic data were used to predict neurocognitive disorders,as defined by the International Classification of Diseases 10th revision,focusing on dementia,delirium and Parkinson's disease.Internal and external validations were performed using the Medical Information Mart for Intensive CareⅣand ECG-View datasets.Predictive performance was assessed by the area under the receiver operating characteristic curve(AUROC)scores,and Shapley values were used to interpret feature contributions.Results Significant predictive performance was observed for several neurocognitive disorders.The highest predictive performance was observed for F03:dementia,with an internal AUROC of 0.848(95%confidence interval(CI)0.848 to 0.848)and an external AUROC of 0.865(95%CI 0.864 to 0.965),followed by G30:Alzheimer's disease,with an internal AUROC of 0.809(95%CI 0.808 to 0.810)and an external AUROC of 0.863(95%CI 0.863 to 0.864).Feature importance analysis revealed both established and novel ECG correlates.Conclusions These findings suggest that ECG holds promise as a non-invasive,explainable biomarker for selected neurocognitive disorders.This study demonstrates robust performance across cohorts and lays the groundwork for future clinical applications,including early detection and personalised monitoring.展开更多
Pressure-preserved coring technologies are critical for deep-earth resource exploration but are constrained by the inability to achieve multidirectional coring,restricting exploration range while escalating costs and ...Pressure-preserved coring technologies are critical for deep-earth resource exploration but are constrained by the inability to achieve multidirectional coring,restricting exploration range while escalating costs and environmental impacts.We developed a multidirectional pressure-preserved coring system based on magnetic control for deep-earth environments up to 5000 m.The system integrates a magnetically controlled method and key pressure-preserved components to ensure precise self-triggering and self-sealing.It is supported by geometric control equations for optimizing structural stability.Their structure was verified and optimized through theoretical and numerical calculations to meet design objectives.To clarify the self-triggering mechanism in complex environments,a dynamic interference model was established,verifying stability during multidirectional coring.The prototype was fabricated,and functional tests confirmed that it met its design objectives.In a 300-meter-deep test inclined well,10 coring operations were completed with a 100%pressure-preserved success rate,confirming the accuracy of the dynamic interference model analysis.Field trials in a 1970-meter-deep inclined petroleum well,representative of complex environments,demonstrated an in-situ pressure preservation efficiency of 92.18%at 22 MPa.This system innovatively expands the application scope of pressure-preserved coring,providing technical support for efficient and sustainable deep resources exploration and mining.展开更多
BACKGROUND The rising global prevalence of gastroesophageal reflux disease(GERD)has been closely linked to lifestyle changes driven by globalization.GERD imposes a substantial public health burden,affecting quality of...BACKGROUND The rising global prevalence of gastroesophageal reflux disease(GERD)has been closely linked to lifestyle changes driven by globalization.GERD imposes a substantial public health burden,affecting quality of life and leading to potential complications.Early intervention through lifestyle modification can prevent disease onset;however,there is a lack of effective risk prediction models that emphasize primary prevention.AIM To develop and validate a GERD Risk Scoring System(GRSS)aimed at identifying high-risk individuals and promoting primary prevention strategies.METHODS A 45-item questionnaire encompassing major lifestyle and demographic risk factors was developed and validated.It was administered to healthy controls and GERD patients.Two regression models-one using continuous variables and another using categorized variables-were used to develop a computational prediction equation and a clinically applicable scoring scale.An independent validation cohort of 355 participants was used to assess model performance in terms of discrimination(C-index),calibration,sensitivity,specificity,internal consistency(Cronbach's alpha),and test-retest reliability(intraclass correlation coefficient,Bland-Altman analysis).RESULTS Significant associations were observed between GERD and key lifestyle factors.The derived GRSS equation and scoring scale demonstrated strong discriminative ability,with high sensitivity and specificity.The scoring system exhibited excellent internal consistency(Cronbach’s alpha)and strong test-retest reliability.The C-index indicated excellent predictive accuracy in both derivation and validation cohorts.CONCLUSION GRSS offers a novel and validated approach to GERD risk prediction,combining a robust equation for digital applications and a practical scale for clinical use.Its ability to accurately identify at-risk individuals supports a paradigm shift toward primary prevention,underscoring its significance in addressing the growing burden of GERD at the population level.展开更多
Objective To develop and evaluate an automated system for digitizing audiograms,classifying hearing loss levels,and comparing their performance with traditional methods and otolaryngologists'interpretations.Design...Objective To develop and evaluate an automated system for digitizing audiograms,classifying hearing loss levels,and comparing their performance with traditional methods and otolaryngologists'interpretations.Designed and Methods We conducted a retrospective diagnostic study using 1,959 audiogram images from patients aged 7 years and older at the Faculty of Medicine,Vajira Hospital,Navamindradhiraj University.We employed an object detection approach to digitize audiograms and developed multiple machine learning models to classify six hearing loss levels.The dataset was split into 70%training(1,407 images)and 30%testing(352 images)sets.We compared our model's performance with classifications based on manually extracted audiogram values and otolaryngologists'interpretations.Result Our object detection-based model achieved an F1-score of 94.72%in classifying hearing loss levels,comparable to the 96.43%F1-score obtained using manually extracted values.The Light Gradient Boosting Machine(LGBM)model is used as the classifier for the manually extracted data,which achieved top performance with 94.72%accuracy,94.72%f1-score,94.72 recall,and 94.72 precision.In object detection based model,The Random Forest Classifier(RFC)model showed the highest 96.43%accuracy in predicting hearing loss level,with a F1-score of 96.43%,recall of 96.43%,and precision of 96.45%.Conclusion Our proposed automated approach for audiogram digitization and hearing loss classification performs comparably to traditional methods and otolaryngologists'interpretations.This system can potentially assist otolaryngologists in providing more timely and effective treatment by quickly and accurately classifying hearing loss.展开更多
In order to investigate the alternate operation characteristics of a solar-ground source heat pump system(SGSHPS),various alternate operation modes are put forward and defined.A two-dimensional mathematical model wi...In order to investigate the alternate operation characteristics of a solar-ground source heat pump system(SGSHPS),various alternate operation modes are put forward and defined.A two-dimensional mathematical model with freezing/melting phase changes is developed for the heat transfer analysis of the soil.Based on the numerical solution of the model,the variation trends of underground soil temperature of the SGSHPS operated in various alternate operation modes are discussed.The results indicate that,for the day-night and short-time interval alternate operation modes without solar energy,the operation time fraction of a solar heat source should be confined to from 50% to 58% when operated in an alternate period of 24 h.Meanwhile,the disadvantages of a natural resumption of soil temperature can be overcome effectively by solar energy filling,and an optimal operation effect can be achieved by integrating the mode of solar energy filling with other alternate modes.In addition,the accuracy of the presented model is verified by the experimental data of borehole wall temperatures.The conclusions can provide a reference for the optimization operation of the SGSHPS.展开更多
To explore the material basis and mechanisms of the anti-inflammatory effects of Hibiscus mutabilis L..The active ingredients and potential targets of Hibiscus mutabilis L.were obtained through the literature review a...To explore the material basis and mechanisms of the anti-inflammatory effects of Hibiscus mutabilis L..The active ingredients and potential targets of Hibiscus mutabilis L.were obtained through the literature review and SwissADME platform.Genes related to the inflammation were collected using Genecards and OMIM databases,and the intersection genes were submitted on STRING and DAVID websites.Then,the protein interaction network(PPI),gene ontology(GO)and pathway(KEGG)were analyzed.Cytoscape 3.7.2 software was used to construct the“Hibiscus mutabilis L.-active ingredient-target-inflammation”network diagram,and AutoDockTools-1.5.6 software was used for the molecular docking verification.The antiinflammatory effect of Hibiscus mutabilis L.active ingredient was verified by the RAW264.7 inflammatory cell model.The results showed that 11 active components and 94 potential targets,1029 inflammatory targets and 24 intersection targets were obtained from Hibiscus mutabilis L..The key anti-inflammatory active ingredients of Hibiscus mutabilis L.are quercetin,apigenin and luteolin.Its action pathway is mainly related to NF-κB,cancer pathway and TNF signaling pathway.Cell experiments showed that total flavonoids of Hibiscus mutabilis L.could effectively inhibit the expression of tumor necrosis factor(TNF-α),interleukin 8(IL-8)and epidermal growth factor receptor(EGFR)in LPS-induced RAW 264.7 inflammatory cells.It also downregulates the phosphorylation of human nuclear factor ĸB inhibitory protein α(IĸBα)and NF-κB p65 subunit protein(p65).Overall,the anti-inflammatory effect of Hibiscus mutabilis L.is related to many active components,many signal pathways and targets,which provides a theoretical basis for its further development and application.展开更多
Background:Investigators from low-,middle-,and high-income countries representing 6 continents contributed to the development of the Global Adolescent and Child Physical Activity Questionnaire(GAC-PAQ).The GAC-PAQ is ...Background:Investigators from low-,middle-,and high-income countries representing 6 continents contributed to the development of the Global Adolescent and Child Physical Activity Questionnaire(GAC-PAQ).The GAC-PAQ is designed to assess physical activity(PA)across all key domains(i.e.,school,chores,work/volunteering,transport,free time,outdoor time).It aimed to address multiple gaps in global PA surveillance(e.g.,omission of important PA domains,insufficient cultural adaptation,underrepresentation of rural areas in questionnaire validation studies).The purpose of this study was to assess the content validity of the GAC-PAQ among PA experts,8-to 17-year-olds,and one of their parents/guardians,and to discuss changes made to the questionnaire based on participants'feedback.Methods:Sixty-two experts in PA measurement and/or surveillance from 24 countries completed an online survey that included both closed-and open-ended questions about the content validity of the GAC-PAQ.The proportion of experts who agreed or strongly agreed with the items was calculated.Child-parent/guardian dyads from 15 countries(n=250;10-40 per country)participated in a structured cognitive interview to assess the clarity of the questions and response options,and they were encouraged to provide suggestions to improve clarity and facilitate completion of the questionnaire.Participating countries are:Aotearoa New Zealand,Brazil,Canada,China,Colombia,Czech Republic,India,Malawi,Mexico,Nepal,Nigeria,Spain,Sweden,Thailand,and the United Arab Emirates.Interviews were conducted in 13 different languages and structured by PA domain.Generic images were included to help participants in answering questions about PA intensity.Results:Expert agreement with the items for each domain exceeded 75%,and their qualitative feedback was used to revise the questionnaire before cognitive interviews.In general,participants found the questionnaire to be comprehensive.Adolescents(12-17 years)found it easier than children(8-11 years)to answer the questions.Several children struggled to answer questions about the duration and intensity of activities and/or concepts related to travel modes,active trips,and organized activities.Many parents/guardians were unsure about the frequency,duration,and intensity of their children's or adolescents'PA at school and/or recommended using more culturally relevant and appropriate images.Some participants misunderstood the concept of activities that“make you stronger”(intended to assess resistance activities)and/or struggled to differentiate between work,volunteering,and chores.Conclusion:Participants'feedback was used to develop a revised,simplified,and culturally adapted GAC-PAQ,which will be pilot-tested in all15 countries in an App that will include country-specific images and narration in local languages.Further research is needed to assess the reliability and validity of the revised GAC-PAQ.展开更多
This study examines the reliability and validity of AI-generated scoring for continuation writing tasks.By comparing GPT-4 with eight experienced human raters across 21 student responses,it evaluates AI’s consistency...This study examines the reliability and validity of AI-generated scoring for continuation writing tasks.By comparing GPT-4 with eight experienced human raters across 21 student responses,it evaluates AI’s consistency,severity,and alignment with human scoring criteria.Results show that AI exhibits high self-consistency and adapts effectively to different scoring roles(e.g.,teacher vs.highstakes rater).However,AI scores were more lenient than human raters and demonstrated divergent evaluation focuses—prioritizing narrative coherence and emotional depth,while teachers emphasized linguistic accuracy and richness of detail.The findings suggest AI’s potential as a supplementary assessment tool,offering rapid,holistic feedback,but highlight the need for further calibration to align with educational standards.Implications include exploring hybrid evaluation models that leverage the strengths of both AI and human raters to achieve more equitable,efficient,and pedagogically meaningful writing assessments.展开更多
China launched its first spaceborne Precipitation Measurement Radar(PMR)on the FY-3G satellite in April 2023.To achieve the scientific goal of measuring the three-dimensional precipitation structure,evaluating the qua...China launched its first spaceborne Precipitation Measurement Radar(PMR)on the FY-3G satellite in April 2023.To achieve the scientific goal of measuring the three-dimensional precipitation structure,evaluating the quantitative measurement ability of the PMR is critical.China operates more than 250 weather radars over the mainland.Consistency of the spaceborne radar with ground-based radars will enhance precipitation measurement ability,especially over oceans and mountains where observations are sparse.Additionally,the spaceborne radar can be used to evaluate the spatial and temporal homogeneity of the ground-based radar network.This paper focuses on comparing the PMR onboard the FY-3G satellite with S-band China New Generation Weather Radars(CINRADs).A comparison algorithm between the PMR and CINRADs has been developed,incorporating detailed quality control,attenuation correction,data optimization,spatiotemporal matching,non-uniform beam filling constraint,uniformity constraint,and frequency correction.The matched data in typical months of four seasons were selected to carry out the comparison.The data consistency between the PMR and CINRADs was analyzed.The correlation coefficient is 0.87,the deviation is 0.89 dB,and the standard deviation is 2.50 dB,based on 98226 matching samples.The results show the radar reflectivity of the PMR is quite comparable to that of the CINRADs,demonstrating that the PMR data quality is satisfactory and can be used to verify and correct data consistency among multiple ground-based radars.This work also paves the way for data fusion and joint application of satellite and ground radars in the future.展开更多
Background:The World Health Organization Disability Assessment Schedule 2.0(WHODAS 2.0)is a popular tool for eval-uating functioning and disability in a range of population demographics and medical situations.However,...Background:The World Health Organization Disability Assessment Schedule 2.0(WHODAS 2.0)is a popular tool for eval-uating functioning and disability in a range of population demographics and medical situations.However,very little is known about the WHODAS 2.0's validity and reliability,particularly when dealing with potentially life-threatening maternal condi-tions(PLTCs).The aim of this study was to evaluate the validity of the WHODAS 2.0 Tigrigna version.Methods:This cross-sectional study was conducted in Tigray,northern Ethiopia,from December 15 to 20,2023.Following translation and back translation,women who had experienced PLTCs during a recent pregnancy,childbirth,or postpartum period were administered the 36-item WHODAS 2.0 in Tigrigna version 6 months after the childbirth.In total,121 women with a history of PLTCs participated.Cronbach′sαwas used to evaluate internal consistency in all six WHODAS 2.0 domains,while Spearman′s correlation coefficient was used to evaluate convergent validity.With confirmatory factor analysis,construct validity was also examined.Results:All domain scores of the Tigrigna version of the WHODAS 2.0 indicated excellent internal consistency(α=0.917-0.978 for 36 items andα=0.874-0.940 for 12 items),while the Cronbach′sαcoefficients for the summary score were 0.981 and 0.952 for 36 and 12 items,respectively.The convergent validity between the 36-item and 12-item WHODAS 2.0 showed a strong correlation between similar constructs(r=0.909-0.981).Conclusion:Despite the small sample limitation,the WHODAS 2.0 tool adapted to the Tigrigna version indicated an acceptable reliability and validity and therefore could be applied to women with a history of PLTCs at 6 months postpartum.展开更多
It remains difficult to automate the creation and validation of Unified Modeling Language(UML)dia-grams due to unstructured requirements,limited automated pipelines,and the lack of reliable evaluation methods.This stu...It remains difficult to automate the creation and validation of Unified Modeling Language(UML)dia-grams due to unstructured requirements,limited automated pipelines,and the lack of reliable evaluation methods.This study introduces a cohesive architecture that amalgamates requirement development,UML synthesis,and multimodal validation.First,LLaMA-3.2-1B-Instruct was utilized to generate user-focused requirements.Then,DeepSeek-R1-Distill-Qwen-32B applies its reasoning skills to transform these requirements into PlantUML code.Using this dual-LLM pipeline,we constructed a synthetic dataset of 11,997 UML diagrams spanning six major diagram families.Rendering analysis showed that 89.5%of the generated diagrams compile correctly,while invalid cases were detected automatically.To assess quality,we employed a multimodal scoring method that combines Qwen2.5-VL-3B,LLaMA-3.2-11B-Vision-Instruct and Aya-Vision-8B,with weights based on MMMU performance.A study with 94 experts revealed strong alignment between automatic and manual evaluations,yielding a Pearson correlation of r=0.82 and a Fleiss’Kappa of 0.78.This indicates a high degree of concordance between automated metrics and human judgment.Overall,the results demonstrated that our scoring system is effective and that the proposed generation pipeline produces UML diagrams that are both syntactically correct and semantically coherent.More broadly,the system provides a scalable and reproducible foundation for future work in AI-driven software modeling and multimodal verification.展开更多
Accurate precipitation estimation in semiarid,topographically complicated areas is critical for water resource management and climate risk monitoring.This work provides a detailed,multi-scale evaluation of four major ...Accurate precipitation estimation in semiarid,topographically complicated areas is critical for water resource management and climate risk monitoring.This work provides a detailed,multi-scale evaluation of four major satellite precipitation products(CHIRPS,PERSIANN-CDR,IMERG-F v07,and GSMaP)over Isfahan province,Iran,over a 9-year period(2015-2023).The performance of these products was benchmarked against a dense network of 98 rain gauges using a suite of continuous and categorical statistical metrics,following a two-stage quality control protocol to remove outliers and false alarms.The results revealed that the performance of all products improves with temporal aggregation.At the daily level,GSMaP performed marginally better,although all products were linked with considerable uncertainty.At the monthly and annual levels,the GPM-era products(IMERG and GSMaP)clearly beat the other two,establishing themselves as dependable tools for long-term hydro-climatological studies.Error analysis revealed that topography is the dominant regulating factor,creating a systematic elevationdependent bias,largely characterized by underestimation from most products in high-elevation areas,though the PERSIANN-CDR product exhibited a contrasting overestimation tendency.Finally,the findings highlight the importance of implementing local,elevation-dependent calibration before deploying these products in hydrological modeling.展开更多
We present the first systematic experimental validation of return-current-driven cylindrical implosion scaling in micrometer-sized Cu and Al wires irradiated by J-class femtosecond laser pulses.Employing XFEL-based im...We present the first systematic experimental validation of return-current-driven cylindrical implosion scaling in micrometer-sized Cu and Al wires irradiated by J-class femtosecond laser pulses.Employing XFEL-based imaging with sub-micrometer spatial and femtosecond temporal resolution,supported by hydrodynamic and particle-in-cell simulations,we reveal how return current density depends precisely on wire diameter,material properties,and incident laser energy.We identify deviations from simple theoretical predictions due to geometrically influenced electron escape dynamics.These results refine and confirm the scaling laws essential for predictive modeling in high-energy-density physics and inertial fusion research.展开更多
文摘In their recent paper Pereira et al.(2025)claim that validation is overlooked in mapping and modelling of ecosystem services(ES).They state that“many studies lack critical evaluation of the results and no validation is provided”and that“the validation step is largely overlooked”.This assertion may have been true several years ago,for example,when Ochoa and Urbina-Cardona(2017)made a similar observation.However,there has been much work on ES model validation over the last decade.
文摘Objective:This study aimed to examine the reliability and validity of the Chinese version of the Behavioral Inhibition System/Behavioral Activation System(BIS/BAS)scales among stroke survivors.Methods:The cross-sectional study was conducted at four comprehensive hospitals in Taizhou,Jiangsu,China.A sample of 232 first-ever stroke survivors were recruited from June to August 2023.Validity was examined using face validity and construct validity,which used confirmatory factor analysis(CFA)and known-group analysis.Reliability was evaluated by internal consistency and test-retest reliability.Results:The BIS/BAS scales demonstrated satisfactory face validity.The findings of CFAs supported the original four-factor structure of BAS-reward,BAS-drive,BAS-fun seeking,and BIS with acceptable model fit indices.Discriminative validity,assessed via known-group analysis,indicated that stroke survivors with probable depression had significantly lower mean BAS-reward,BAS-drive,and BAS-fun seeking scores(P<0.001)and a higher mean BIS score(P=0.028)compared to those without probable depression.The internal consistency,measured by Cronbach’s a coefficients for the subscales,ranged from 0.669 to 0.964.Test-retest reliability,assessed using intra-class correlation coefficients,ranged from 0.61 to 0.93.Conclusions:The Chinese version of the BIS/BAS scales could be a reliable and valid instrument for measuring behavioral activation among stroke survivors.
基金supported by the project Monetary valuation of soil ecosystem services and creation of initiatives to invest in soil health:setting a framework for the inclusion of soil health in business and in the policy making process(InBestSoil)(Horizon Europe)Grant agreement ID:101091099。
文摘Ecosystem services(ES)mapping and models have advanced in recent years.Improvements were made,and the assessments have transitioned from qualitative to quantitative.Although this is an important advancement,the ES mapping and modelling validation step has been overlooked,and this raises an important question in the credibility of the outcomes.This has been an important and unsolved issue in the ES research community that needs to be tackled.This highlight paper discusses the importance of validating single ES mapping and models.Conducting this using field or proximal/remote sensing raw data and not data from other models or stakeholder evaluation is important.A validation step should be mandatory in ES frameworks since it can assess the models’veracity,contribute to identifying the model’s weaknesses/strengths and ultimately represent a scientific advance in the field.This is easier to apply to the biophysical mapping and models of regulating and provisioning ES than to cultural ES,as the latter rely more on perception and cultural contexts.Also,ES supply models are easier to validate than demand and flow models.Robust and well-grounded models are essential for ensuring the reliability of individual ES maps and models and should be integrated into decision-making processes.Although several challenges arise related to the costs of data collection,in several cases prohibitive,and the time and the expertise needed to conduct this sampling and analysis,this is likely an imperative step that needs to be considered in the future.This will be beneficial in establishing ES research and improving decision-making and wellbeing.
文摘Background:Sport climbing is becoming incredibly popular both in the general population and among athletes.No consensus exists regarding evidence-based sport-specific performance evaluation;therefore,this systematic review was aimed at analyzing determinants of sport climbing performance and evaluation methods by comparing climbers of different levels.Methods:PubMed,Scopus,and Web of Science were searched up to December 20,2022.Studies providing the self-reported climbing ability associated with different functional outcomes in groups of climbers of contiguous performance levels were eligible.Results:74 studies were finally included.Various methods have been proposed to evaluate determinants of sport climbing performance.Climbing-specific assessments were able to discriminate climbers of different levels when compared to general functional tests.Test validity resulted high for climbing-specific cardiorespiratory endurance as well as muscular-strength,-endurance,and-power;similarly,reliability was good except for cardiorespiratory endurance.Climbing-specific flexibility assessment resulted in high reliability but moderate validity,whereas balance showed low validity.Considerable conflicting evidence was found regarding anthropometric characteristics.Conclusion:The present analysis identified cardiorespiratory endurance as well as muscular-strength,-endurance,and-power as determinants of sport climbing performance.In contrast,balance,flexibility,and anthropometric characteristics seem to count less.This review also proposes an evidence-based Functional Sport Climbing test battery for asses sing performance determinants,which includes tests that have been identified to be valid,reliable,and feasible.While athletes and coaches should rely on evidence-based and standardized evaluation methods,researchers may design specific large-scale trials as a resource for providing additional,homogenous,and comparable data to improve scientific evidence and professionalism in this popular sport discipline.
文摘The nature of pramāṅa system,outlined by Dignāga(c.480-540 CE)and further elaborated upon by Dharmakīrti(c.600-660 CE),is an important part of the Buddhist logico-epistemological tradition.Within this pramāṅa system,self-awareness(svasaṃvedana)is considered a hallmark of the access to the mental states and factors.However,some of the key research often focuses on interpreting the valid cognition system and self-awareness separately,lacking specific descriptions of their interrelationship.This paper argues that self-awareness is not merely a byproduct of valid cognition but is intimately connected to it.Specifically,I posit that self-awareness should be regarded as the ultimate result of valid cognition within the Pramāṇa system.
基金The National Natural Science Foundation of China(No.52278189)Zhejiang Provincial Natural Science Foundation of China(No.LY24E080002).
文摘A self-centering bridge bent equipped with energy-dissipation(ED)beams is proposed.Quasi-static tests are conducted on self-centering bridge bents,both with and without ED beams,to validate the accuracy of the corresponding numerical models.The effects of various param-eters,such as the web area of ED beams,prestressing force of tendons,tendon arrangements,and number of column segments,on the seismic performance of self-centering bridge bents with ED beams are evaluated using the validated numerical model.The results demonstrate that the nu-merical models accurately replicate the quasi-static test results,with average errors in the lateral force remaining below 9.6%.The web area of ED beams significantly affects the strength,cumulative energy dissipation,and relative self-centering index(RSI)of the self-centering bridge bents.Increasing the prestressing force enhances the lateral force and self-centering capability of the bridge bents but has minimal effect on their ED capacity.Reducing the num-ber of segments in each column enhances the lateral force and cumulative hysteretic energy dissipation of the self-centering bridge bents while exerting an insignificant effect on the RSI.Thus,the proposed novel system is highly suitable for doubleor multicolumn piers supporting bridges in regions prone to strong earthquakes.
基金supported by the National Natural Science Foundation of China(Grant Nos.42375062 and 42275158)the National Key Scientific and Technological Infrastructure project“Earth System Science Numerical Simulator Facility”(EarthLab)the Natural Science Foundation of Gansu Province(Grant No.22JR5RF1080)。
文摘It is fundamental and useful to investigate how deep learning forecasting models(DLMs)perform compared to operational oceanography forecast systems(OFSs).However,few studies have intercompared their performances using an identical reference.In this study,three physically reasonable DLMs are implemented for the forecasting of the sea surface temperature(SST),sea level anomaly(SLA),and sea surface velocity in the South China Sea.The DLMs are validated against both the testing dataset and the“OceanPredict”Class 4 dataset.Results show that the DLMs'RMSEs against the latter increase by 44%,245%,302%,and 109%for SST,SLA,current speed,and direction,respectively,compared to those against the former.Therefore,different references have significant influences on the validation,and it is necessary to use an identical and independent reference to intercompare the DLMs and OFSs.Against the Class 4 dataset,the DLMs present significantly better performance for SLA than the OFSs,and slightly better performances for other variables.The error patterns of the DLMs and OFSs show a high degree of similarity,which is reasonable from the viewpoint of predictability,facilitating further applications of the DLMs.For extreme events,the DLMs and OFSs both present large but similar forecast errors for SLA and current speed,while the DLMs are likely to give larger errors for SST and current direction.This study provides an evaluation of the forecast skills of commonly used DLMs and provides an example to objectively intercompare different DLMs.
文摘Background Electrocardiogram(ECG)analysis has emerged as a promising tool for detecting physiological changes linked to non-cardiac disorders.Given the close connection between cardiovascular and neurocognitive health,ECG abnormalities may be present in individuals with co-occurring neurocognitive conditions.This highlights the potential of ECG as a biomarker to improve detection,therapy monitoring and risk stratification in patients with neurocognitive disorders,an area that remains underexplored.Aims We aimed to demonstrate the feasibility of predicting neurocognitive disorders from ECG features across diverse patient populations.Methods ECG features and demographic data were used to predict neurocognitive disorders,as defined by the International Classification of Diseases 10th revision,focusing on dementia,delirium and Parkinson's disease.Internal and external validations were performed using the Medical Information Mart for Intensive CareⅣand ECG-View datasets.Predictive performance was assessed by the area under the receiver operating characteristic curve(AUROC)scores,and Shapley values were used to interpret feature contributions.Results Significant predictive performance was observed for several neurocognitive disorders.The highest predictive performance was observed for F03:dementia,with an internal AUROC of 0.848(95%confidence interval(CI)0.848 to 0.848)and an external AUROC of 0.865(95%CI 0.864 to 0.965),followed by G30:Alzheimer's disease,with an internal AUROC of 0.809(95%CI 0.808 to 0.810)and an external AUROC of 0.863(95%CI 0.863 to 0.864).Feature importance analysis revealed both established and novel ECG correlates.Conclusions These findings suggest that ECG holds promise as a non-invasive,explainable biomarker for selected neurocognitive disorders.This study demonstrates robust performance across cohorts and lays the groundwork for future clinical applications,including early detection and personalised monitoring.
基金supported by the National Key Research and Development Program of China(No.2023YFF0615401)Joint Funds of the National Natural Science Foundation of China(No.U24A2087)+1 种基金Research Fund of State Key Laboratory of Geomechanics and Geotechnical Engineering,Institute of Rock and Soil Mechanics,Chinese Academy of Sciences(No.SKLGME022009)the National Natural Science Foundation of China(No.42477191)。
文摘Pressure-preserved coring technologies are critical for deep-earth resource exploration but are constrained by the inability to achieve multidirectional coring,restricting exploration range while escalating costs and environmental impacts.We developed a multidirectional pressure-preserved coring system based on magnetic control for deep-earth environments up to 5000 m.The system integrates a magnetically controlled method and key pressure-preserved components to ensure precise self-triggering and self-sealing.It is supported by geometric control equations for optimizing structural stability.Their structure was verified and optimized through theoretical and numerical calculations to meet design objectives.To clarify the self-triggering mechanism in complex environments,a dynamic interference model was established,verifying stability during multidirectional coring.The prototype was fabricated,and functional tests confirmed that it met its design objectives.In a 300-meter-deep test inclined well,10 coring operations were completed with a 100%pressure-preserved success rate,confirming the accuracy of the dynamic interference model analysis.Field trials in a 1970-meter-deep inclined petroleum well,representative of complex environments,demonstrated an in-situ pressure preservation efficiency of 92.18%at 22 MPa.This system innovatively expands the application scope of pressure-preserved coring,providing technical support for efficient and sustainable deep resources exploration and mining.
文摘BACKGROUND The rising global prevalence of gastroesophageal reflux disease(GERD)has been closely linked to lifestyle changes driven by globalization.GERD imposes a substantial public health burden,affecting quality of life and leading to potential complications.Early intervention through lifestyle modification can prevent disease onset;however,there is a lack of effective risk prediction models that emphasize primary prevention.AIM To develop and validate a GERD Risk Scoring System(GRSS)aimed at identifying high-risk individuals and promoting primary prevention strategies.METHODS A 45-item questionnaire encompassing major lifestyle and demographic risk factors was developed and validated.It was administered to healthy controls and GERD patients.Two regression models-one using continuous variables and another using categorized variables-were used to develop a computational prediction equation and a clinically applicable scoring scale.An independent validation cohort of 355 participants was used to assess model performance in terms of discrimination(C-index),calibration,sensitivity,specificity,internal consistency(Cronbach's alpha),and test-retest reliability(intraclass correlation coefficient,Bland-Altman analysis).RESULTS Significant associations were observed between GERD and key lifestyle factors.The derived GRSS equation and scoring scale demonstrated strong discriminative ability,with high sensitivity and specificity.The scoring system exhibited excellent internal consistency(Cronbach’s alpha)and strong test-retest reliability.The C-index indicated excellent predictive accuracy in both derivation and validation cohorts.CONCLUSION GRSS offers a novel and validated approach to GERD risk prediction,combining a robust equation for digital applications and a practical scale for clinical use.Its ability to accurately identify at-risk individuals supports a paradigm shift toward primary prevention,underscoring its significance in addressing the growing burden of GERD at the population level.
文摘Objective To develop and evaluate an automated system for digitizing audiograms,classifying hearing loss levels,and comparing their performance with traditional methods and otolaryngologists'interpretations.Designed and Methods We conducted a retrospective diagnostic study using 1,959 audiogram images from patients aged 7 years and older at the Faculty of Medicine,Vajira Hospital,Navamindradhiraj University.We employed an object detection approach to digitize audiograms and developed multiple machine learning models to classify six hearing loss levels.The dataset was split into 70%training(1,407 images)and 30%testing(352 images)sets.We compared our model's performance with classifications based on manually extracted audiogram values and otolaryngologists'interpretations.Result Our object detection-based model achieved an F1-score of 94.72%in classifying hearing loss levels,comparable to the 96.43%F1-score obtained using manually extracted values.The Light Gradient Boosting Machine(LGBM)model is used as the classifier for the manually extracted data,which achieved top performance with 94.72%accuracy,94.72%f1-score,94.72 recall,and 94.72 precision.In object detection based model,The Random Forest Classifier(RFC)model showed the highest 96.43%accuracy in predicting hearing loss level,with a F1-score of 96.43%,recall of 96.43%,and precision of 96.45%.Conclusion Our proposed automated approach for audiogram digitization and hearing loss classification performs comparably to traditional methods and otolaryngologists'interpretations.This system can potentially assist otolaryngologists in providing more timely and effective treatment by quickly and accurately classifying hearing loss.
基金The National Key Technology R&D Program of Chinaduring the 11th Five-Year Plan Period(No.2008BAJ12B04)China Postdoctoral Science Foundation(No.20090461050)+1 种基金the Project of Researchand Development of Ministry of Housing and Urban-Rural Development ofChina(No.2008-K1-26)the New Century Talent Project of Yangzhou University for Excellent Young Backbone Teacher(2008)
文摘In order to investigate the alternate operation characteristics of a solar-ground source heat pump system(SGSHPS),various alternate operation modes are put forward and defined.A two-dimensional mathematical model with freezing/melting phase changes is developed for the heat transfer analysis of the soil.Based on the numerical solution of the model,the variation trends of underground soil temperature of the SGSHPS operated in various alternate operation modes are discussed.The results indicate that,for the day-night and short-time interval alternate operation modes without solar energy,the operation time fraction of a solar heat source should be confined to from 50% to 58% when operated in an alternate period of 24 h.Meanwhile,the disadvantages of a natural resumption of soil temperature can be overcome effectively by solar energy filling,and an optimal operation effect can be achieved by integrating the mode of solar energy filling with other alternate modes.In addition,the accuracy of the presented model is verified by the experimental data of borehole wall temperatures.The conclusions can provide a reference for the optimization operation of the SGSHPS.
文摘To explore the material basis and mechanisms of the anti-inflammatory effects of Hibiscus mutabilis L..The active ingredients and potential targets of Hibiscus mutabilis L.were obtained through the literature review and SwissADME platform.Genes related to the inflammation were collected using Genecards and OMIM databases,and the intersection genes were submitted on STRING and DAVID websites.Then,the protein interaction network(PPI),gene ontology(GO)and pathway(KEGG)were analyzed.Cytoscape 3.7.2 software was used to construct the“Hibiscus mutabilis L.-active ingredient-target-inflammation”network diagram,and AutoDockTools-1.5.6 software was used for the molecular docking verification.The antiinflammatory effect of Hibiscus mutabilis L.active ingredient was verified by the RAW264.7 inflammatory cell model.The results showed that 11 active components and 94 potential targets,1029 inflammatory targets and 24 intersection targets were obtained from Hibiscus mutabilis L..The key anti-inflammatory active ingredients of Hibiscus mutabilis L.are quercetin,apigenin and luteolin.Its action pathway is mainly related to NF-κB,cancer pathway and TNF signaling pathway.Cell experiments showed that total flavonoids of Hibiscus mutabilis L.could effectively inhibit the expression of tumor necrosis factor(TNF-α),interleukin 8(IL-8)and epidermal growth factor receptor(EGFR)in LPS-induced RAW 264.7 inflammatory cells.It also downregulates the phosphorylation of human nuclear factor ĸB inhibitory protein α(IĸBα)and NF-κB p65 subunit protein(p65).Overall,the anti-inflammatory effect of Hibiscus mutabilis L.is related to many active components,many signal pathways and targets,which provides a theoretical basis for its further development and application.
基金supported by a Project Grant(Grant No.PJT183705)an Early Career Investigator Prize(Grant No.ECP 184184)from the Canadian Institutes of Health Research+7 种基金a Prentice Institute Research Affiliate Fund Grant from the Prentice Institute for Global Population and Economy(Grant No.G00004116)a Te Herenga Waka Victoria University of Wellington Division of Science Health Engineering Architecture and Design Innovation Faculty Strategic Research Grant(Grant No.FSRG-SHEADI-10724)The Thailand Physical Activity Knowledge Development Centre(TPAK)/Thai Health Promotion Foundation provided funding for the cognitive interviews and pilot study in Thailand(Grant No.66-P1-0473)The University Pablo de Olavide provided a scholarship for 2 undergraduate students working on the project(codes PPI2207 and PPI2308)In the Czech Republicthe study was supported by Palacky University IGA(Grant No.IGA_FTK_2023_017)supported by the Division of Intramural Research at the National Institute on Minority Health and Health Disparities of the National Institutes of Healthsupported by the Key Project of the National Philosophy and Social Science Foundation of China(23&ZD197)。
文摘Background:Investigators from low-,middle-,and high-income countries representing 6 continents contributed to the development of the Global Adolescent and Child Physical Activity Questionnaire(GAC-PAQ).The GAC-PAQ is designed to assess physical activity(PA)across all key domains(i.e.,school,chores,work/volunteering,transport,free time,outdoor time).It aimed to address multiple gaps in global PA surveillance(e.g.,omission of important PA domains,insufficient cultural adaptation,underrepresentation of rural areas in questionnaire validation studies).The purpose of this study was to assess the content validity of the GAC-PAQ among PA experts,8-to 17-year-olds,and one of their parents/guardians,and to discuss changes made to the questionnaire based on participants'feedback.Methods:Sixty-two experts in PA measurement and/or surveillance from 24 countries completed an online survey that included both closed-and open-ended questions about the content validity of the GAC-PAQ.The proportion of experts who agreed or strongly agreed with the items was calculated.Child-parent/guardian dyads from 15 countries(n=250;10-40 per country)participated in a structured cognitive interview to assess the clarity of the questions and response options,and they were encouraged to provide suggestions to improve clarity and facilitate completion of the questionnaire.Participating countries are:Aotearoa New Zealand,Brazil,Canada,China,Colombia,Czech Republic,India,Malawi,Mexico,Nepal,Nigeria,Spain,Sweden,Thailand,and the United Arab Emirates.Interviews were conducted in 13 different languages and structured by PA domain.Generic images were included to help participants in answering questions about PA intensity.Results:Expert agreement with the items for each domain exceeded 75%,and their qualitative feedback was used to revise the questionnaire before cognitive interviews.In general,participants found the questionnaire to be comprehensive.Adolescents(12-17 years)found it easier than children(8-11 years)to answer the questions.Several children struggled to answer questions about the duration and intensity of activities and/or concepts related to travel modes,active trips,and organized activities.Many parents/guardians were unsure about the frequency,duration,and intensity of their children's or adolescents'PA at school and/or recommended using more culturally relevant and appropriate images.Some participants misunderstood the concept of activities that“make you stronger”(intended to assess resistance activities)and/or struggled to differentiate between work,volunteering,and chores.Conclusion:Participants'feedback was used to develop a revised,simplified,and culturally adapted GAC-PAQ,which will be pilot-tested in all15 countries in an App that will include country-specific images and narration in local languages.Further research is needed to assess the reliability and validity of the revised GAC-PAQ.
文摘This study examines the reliability and validity of AI-generated scoring for continuation writing tasks.By comparing GPT-4 with eight experienced human raters across 21 student responses,it evaluates AI’s consistency,severity,and alignment with human scoring criteria.Results show that AI exhibits high self-consistency and adapts effectively to different scoring roles(e.g.,teacher vs.highstakes rater).However,AI scores were more lenient than human raters and demonstrated divergent evaluation focuses—prioritizing narrative coherence and emotional depth,while teachers emphasized linguistic accuracy and richness of detail.The findings suggest AI’s potential as a supplementary assessment tool,offering rapid,holistic feedback,but highlight the need for further calibration to align with educational standards.Implications include exploring hybrid evaluation models that leverage the strengths of both AI and human raters to achieve more equitable,efficient,and pedagogically meaningful writing assessments.
基金jointly supported by the National Natural Science Foundation of China(Grant U2442214)the China Meteorological Administration Youth Innovation Team(Grant No.CMA2024QN10)+1 种基金the National Defense Science and Technology Bureau’s 14th Five-Year Civil Aerospace Preresearch Project(Grant Nos.D030303 and D040204)the International Space Water Cycle Observation Constellation Program(Grant No.183311KYSB20200015).
文摘China launched its first spaceborne Precipitation Measurement Radar(PMR)on the FY-3G satellite in April 2023.To achieve the scientific goal of measuring the three-dimensional precipitation structure,evaluating the quantitative measurement ability of the PMR is critical.China operates more than 250 weather radars over the mainland.Consistency of the spaceborne radar with ground-based radars will enhance precipitation measurement ability,especially over oceans and mountains where observations are sparse.Additionally,the spaceborne radar can be used to evaluate the spatial and temporal homogeneity of the ground-based radar network.This paper focuses on comparing the PMR onboard the FY-3G satellite with S-band China New Generation Weather Radars(CINRADs).A comparison algorithm between the PMR and CINRADs has been developed,incorporating detailed quality control,attenuation correction,data optimization,spatiotemporal matching,non-uniform beam filling constraint,uniformity constraint,and frequency correction.The matched data in typical months of four seasons were selected to carry out the comparison.The data consistency between the PMR and CINRADs was analyzed.The correlation coefficient is 0.87,the deviation is 0.89 dB,and the standard deviation is 2.50 dB,based on 98226 matching samples.The results show the radar reflectivity of the PMR is quite comparable to that of the CINRADs,demonstrating that the PMR data quality is satisfactory and can be used to verify and correct data consistency among multiple ground-based radars.This work also paves the way for data fusion and joint application of satellite and ground radars in the future.
文摘Background:The World Health Organization Disability Assessment Schedule 2.0(WHODAS 2.0)is a popular tool for eval-uating functioning and disability in a range of population demographics and medical situations.However,very little is known about the WHODAS 2.0's validity and reliability,particularly when dealing with potentially life-threatening maternal condi-tions(PLTCs).The aim of this study was to evaluate the validity of the WHODAS 2.0 Tigrigna version.Methods:This cross-sectional study was conducted in Tigray,northern Ethiopia,from December 15 to 20,2023.Following translation and back translation,women who had experienced PLTCs during a recent pregnancy,childbirth,or postpartum period were administered the 36-item WHODAS 2.0 in Tigrigna version 6 months after the childbirth.In total,121 women with a history of PLTCs participated.Cronbach′sαwas used to evaluate internal consistency in all six WHODAS 2.0 domains,while Spearman′s correlation coefficient was used to evaluate convergent validity.With confirmatory factor analysis,construct validity was also examined.Results:All domain scores of the Tigrigna version of the WHODAS 2.0 indicated excellent internal consistency(α=0.917-0.978 for 36 items andα=0.874-0.940 for 12 items),while the Cronbach′sαcoefficients for the summary score were 0.981 and 0.952 for 36 and 12 items,respectively.The convergent validity between the 36-item and 12-item WHODAS 2.0 showed a strong correlation between similar constructs(r=0.909-0.981).Conclusion:Despite the small sample limitation,the WHODAS 2.0 tool adapted to the Tigrigna version indicated an acceptable reliability and validity and therefore could be applied to women with a history of PLTCs at 6 months postpartum.
基金supported by the DH2025-TN07-07 project conducted at the Thai Nguyen University of Information and Communication Technology,Thai Nguyen,Vietnam,with additional support from the AI in Software Engineering Lab.
文摘It remains difficult to automate the creation and validation of Unified Modeling Language(UML)dia-grams due to unstructured requirements,limited automated pipelines,and the lack of reliable evaluation methods.This study introduces a cohesive architecture that amalgamates requirement development,UML synthesis,and multimodal validation.First,LLaMA-3.2-1B-Instruct was utilized to generate user-focused requirements.Then,DeepSeek-R1-Distill-Qwen-32B applies its reasoning skills to transform these requirements into PlantUML code.Using this dual-LLM pipeline,we constructed a synthetic dataset of 11,997 UML diagrams spanning six major diagram families.Rendering analysis showed that 89.5%of the generated diagrams compile correctly,while invalid cases were detected automatically.To assess quality,we employed a multimodal scoring method that combines Qwen2.5-VL-3B,LLaMA-3.2-11B-Vision-Instruct and Aya-Vision-8B,with weights based on MMMU performance.A study with 94 experts revealed strong alignment between automatic and manual evaluations,yielding a Pearson correlation of r=0.82 and a Fleiss’Kappa of 0.78.This indicates a high degree of concordance between automated metrics and human judgment.Overall,the results demonstrated that our scoring system is effective and that the proposed generation pipeline produces UML diagrams that are both syntactically correct and semantically coherent.More broadly,the system provides a scalable and reproducible foundation for future work in AI-driven software modeling and multimodal verification.
文摘Accurate precipitation estimation in semiarid,topographically complicated areas is critical for water resource management and climate risk monitoring.This work provides a detailed,multi-scale evaluation of four major satellite precipitation products(CHIRPS,PERSIANN-CDR,IMERG-F v07,and GSMaP)over Isfahan province,Iran,over a 9-year period(2015-2023).The performance of these products was benchmarked against a dense network of 98 rain gauges using a suite of continuous and categorical statistical metrics,following a two-stage quality control protocol to remove outliers and false alarms.The results revealed that the performance of all products improves with temporal aggregation.At the daily level,GSMaP performed marginally better,although all products were linked with considerable uncertainty.At the monthly and annual levels,the GPM-era products(IMERG and GSMaP)clearly beat the other two,establishing themselves as dependable tools for long-term hydro-climatological studies.Error analysis revealed that topography is the dominant regulating factor,creating a systematic elevationdependent bias,largely characterized by underestimation from most products in high-elevation areas,though the PERSIANN-CDR product exhibited a contrasting overestimation tendency.Finally,the findings highlight the importance of implementing local,elevation-dependent calibration before deploying these products in hydrological modeling.
基金partially supported by the Center for Advanced Systems Understanding(CASUS)financed by Germany’s Federal Ministry of Education and Research(BMBF)+2 种基金the Saxon State Government out of the State Budget approved by the Saxon State Parliamentfunding from the European Union’s Just Transition Fund(JTF)within the project Röntgenlaser-Optimierung der Laserfusion(ROLF),Contract No.5086999001co-financed by the Saxon State Government out of the State Budget approved by the Saxon State Parliament.
文摘We present the first systematic experimental validation of return-current-driven cylindrical implosion scaling in micrometer-sized Cu and Al wires irradiated by J-class femtosecond laser pulses.Employing XFEL-based imaging with sub-micrometer spatial and femtosecond temporal resolution,supported by hydrodynamic and particle-in-cell simulations,we reveal how return current density depends precisely on wire diameter,material properties,and incident laser energy.We identify deviations from simple theoretical predictions due to geometrically influenced electron escape dynamics.These results refine and confirm the scaling laws essential for predictive modeling in high-energy-density physics and inertial fusion research.