Traditional grade-centered evaluation models are inadequate for high-quality software engineering talents in the digital and AI era.This study develops an academic development monitoring system to address shortcomings...Traditional grade-centered evaluation models are inadequate for high-quality software engineering talents in the digital and AI era.This study develops an academic development monitoring system to address shortcomings in dynamics,interdisciplinary integration,and industry adaptability.It builds a multi-dimensional dynamic model covering seven core dimensions with quantitative scoring,non-linear weighting,and DivClust grouping.An intelligent platform with real-time monitoring,early warning,and personalized recommendations integrates AI like multi-modal fusion and large-model diagnosis.The“monitoring-warning-improvement”loop helps optimize training programs,support personalized planning,and bridge talent-industry gaps,enabling digital transformation in software engineering education evaluation.展开更多
In their recent paper Pereira et al.(2025)claim that validation is overlooked in mapping and modelling of ecosystem services(ES).They state that“many studies lack critical evaluation of the results and no validation ...In their recent paper Pereira et al.(2025)claim that validation is overlooked in mapping and modelling of ecosystem services(ES).They state that“many studies lack critical evaluation of the results and no validation is provided”and that“the validation step is largely overlooked”.This assertion may have been true several years ago,for example,when Ochoa and Urbina-Cardona(2017)made a similar observation.However,there has been much work on ES model validation over the last decade.展开更多
BACKGROUND Timely and accurate evaluation of mental disorders in adolescents using appropriate mental health literacy assessment tools is essential for improving their mental health literacy levels.AIM To develop an e...BACKGROUND Timely and accurate evaluation of mental disorders in adolescents using appropriate mental health literacy assessment tools is essential for improving their mental health literacy levels.AIM To develop an evaluation index system for the mental health literacy of adolescent patients with mental disorders,providing a scientific,comprehensive,and reliable tool for the monitoring and intervention of mental health literacy of such patients.METHODS From December 2022 to June 2023,the evaluation index system for mental health literacy of adolescents with mental disorders was developed through literature reviews,semi-structured interviews,expert letter consultations,and the analytic hierarchy process.Based on this index system,a self-assessment questionnaire was compiled and administered to 305 adolescents with mental disorders to test the reliability and validity of the index system.RESULTS The final evaluation index system for mental health literacy of adolescents with mental disorders included 4 first-level indicators,10 second-level indicators,and 52 third-level indicators.The overall Cronbach’sαcoefficient of the index system was 0.957,with a partial reliability of 0.826 and a content validity index of 0.975.The cumulative variance contribution rate of 10 common factors was 66.491%.The correlation coefficients between each dimension and the total questionnaire ranged from 0.672 to 0.724,while the correlation coefficients in each dimension ranged from 0.389 to 0.705.CONCLUSION The evaluation index system for mental health literacy of adolescents with mental disorders,developed in this study,demonstrated notable reliability and validity,making it a valuable tool for evaluating mental health literacy in this population.展开更多
The exponential growth of video content has driven significant advancements in video summarization techniques in recent years.Breakthroughs in deep learning have been particularly transformative,enabling more effectiv...The exponential growth of video content has driven significant advancements in video summarization techniques in recent years.Breakthroughs in deep learning have been particularly transformative,enabling more effective detection of key information and creating new possibilities for video synopsis.To summarize recent progress and accelerate research in this field,this paper provides a comprehensive review of deep learning-based video summarization methods developed over the past decade.We begin by examining the research landscape of video abstraction technologies and identifying core challenges in video summarization.Subsequently,we systematically analyze prevailing deep learning frameworks and methodologies employed in current video summarization systems,offering researchers a clear roadmap of the field's evelution.Unlike previous review works,we first classify research papers based on the structural hierarchy of the video(from frame-level to shot-level to video-level),then further categorize them according to the summary backbone model(feature extraction and spatiotemporal modeling).This approach provides a more systematic and hierarchical organization of the documents.Following this comprehensive review,we summarize the benchmark datasets and evaluation metrics commonly employed in the field.Finally,we analyze persistent challenges and propose insightful directions for future research,providing a forward-looking perspective on video summarization technologies.This systematic literature review is of great reference value to new researchers exploring the fields of deep learning and video summarization.展开更多
The incidence of benign airway stenosis(BAS)is on the rise,and current treatment options are associated with a significant risk of restenosis.Therefore,there is an urgent need to explore new and effective prevention a...The incidence of benign airway stenosis(BAS)is on the rise,and current treatment options are associated with a significant risk of restenosis.Therefore,there is an urgent need to explore new and effective prevention and treatment methods.Animal models serve as essential tools for investigating disease mechanisms and assessing novel therapeutic strategies,and the scientific rigor of their construction and validation significantly impacts the reliability of research findings.This paper systematically reviews the research progress and evaluation systems of BAS animal models over the past decade,aiming to provide a robust foundation for the optimized construction of BAS models,intervention studies,and clinical translation.This effort is intended to facilitate the innovation and advancement in BAS prevention and treatment strategies.展开更多
LargeLanguageModels(LLMs)are increasingly appliedinthe fieldof code translation.However,existing evaluation methodologies suffer from two major limitations:(1)the high overlap between test data and pretraining corpora...LargeLanguageModels(LLMs)are increasingly appliedinthe fieldof code translation.However,existing evaluation methodologies suffer from two major limitations:(1)the high overlap between test data and pretraining corpora,which introduces significant bias in performance evaluation;and(2)mainstream metrics focus primarily on surface-level accuracy,failing to uncover the underlying factors that constrain model capabilities.To address these issues,this paper presents TCode(Translation-Oriented Code Evaluation benchmark)—a complexity-controllable,contamination-free benchmark dataset for code translation—alongside a dedicated static feature sensitivity evaluation framework.The dataset is carefully designed to control complexity along multiple dimensions—including syntactic nesting and expression intricacy—enabling both broad coverage and fine-grained differentiation of sample difficulty.This design supports precise evaluation of model capabilities across a wide spectrum of translation challenges.The proposed evaluation framework introduces a correlation-driven analysis mechanism based on static program features,enabling predictive modeling of translation success from two perspectives:Code Form Complexity(e.g.,code length and character density)and Semantic Modeling Complexity(e.g.,syntactic depth,control-flow nesting,and type system complexity).Empirical evaluations across representative LLMs—including Qwen2.5-72B and Llama3.3-70B—demonstrate that even state-of-the-art models achieve over 80% compilation success on simple samples,but their accuracy drops sharply below 40% on complex cases.Further correlation analysis indicates that Semantic Modeling Complexity alone is correlated with up to 60% of the variance in translation success,with static program features exhibiting nonlinear threshold effects that highlight clear capability boundaries.This study departs fromthe traditional accuracy-centric evaluation paradigm and,for the first time,systematically characterizes the capabilities of large languagemodels in translation tasks through the lens of programstatic features.The findings provide actionable insights for model refinement and training strategy development.展开更多
The construction of spot electricity markets plays a pivotal role in power system reforms,where market clearing systems profoundly influence market efficiency and security.Current clearing systems predominantly adopt ...The construction of spot electricity markets plays a pivotal role in power system reforms,where market clearing systems profoundly influence market efficiency and security.Current clearing systems predominantly adopt a single-system architecture,with research focusing primarily on accelerating solution algorithms through techniques such as high-efficiency parallel solvers and staggered decomposition of mixed-integer programming models.Notably absent are systematic studies evaluating the adaptability of primary-backup clearing systems incontingency scenarios—a critical gap given redundant systems’expanding applications in operational environments.This paper proposes a comprehensive evaluation framework for analyzing dual-system adaptability,demonstrated through an in-depth case study of the Inner Mongolia power market.First,we establish the innovative“Dual-Active Heterogeneous”architecture that enables independent parallelized operation and fault-isolated redundancy.Subsequently,key performance indices are quantitatively evaluated across four critical dimensions:unit commitment decisions,generator output constraints,transmission section congestion patterns,and clearing price formation mechanisms.An integrated fuzzy evaluation methodology incorporating grey relational analysis is employed for objective indicator weighting,enabling systematic quantification of system superiority under specific grid operating states.Empirical results based on actual operational data from 200 generation units demonstrate the framework’s efficacy in guiding optimal system selection,with particularly strong performance observed during peak load periods.The proposed approach shows high generalization potential for other regional markets employing redundant clearing mechanisms—particularly those with increasing renewable penetration and associated uncertainty.展开更多
Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises.While large language models(LLMs)enable automated report generation,this specific domain lack...Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises.While large language models(LLMs)enable automated report generation,this specific domain lacks formal task definitions and corresponding benchmarks.To bridge this gap,we define the Automated Online Public Opinion Report Generation(OPOR-Gen)task and construct OPOR-Bench,an event-centric dataset with 463 crisis events across 108 countries(comprising 8.8 K news articles and 185 K tweets).To evaluate report quality,we propose OPOR-Eval,a novel agent-based framework that simulates human expert evaluation.Validation experiments show OPOR-Eval achieves a high Spearman’s correlation(ρ=0.70)with human judgments,though challenges in temporal reasoning persist.This work establishes an initial foundation for advancing automated public opinion reporting research.展开更多
In the present study,researchers examined a solar off-grid-connected photovoltaic system for a family house in the city of Baghdad.The design was created with the help of the“How to Design PV Program”and the“Renewa...In the present study,researchers examined a solar off-grid-connected photovoltaic system for a family house in the city of Baghdad.The design was created with the help of the“How to Design PV Program”and the“Renewable Energy Investment Calculator(REICAL)”software(Version 1.1).In Iraq,the national grid provides around 71%of the overall electricity demand,though this drops to nearly 50%during extremely hot and cold months,where the supply alternates between four hours on and four hours off.During the off periods,power is generated by local generators at high costs.To promote the adoption of photovoltaic solar systems among Iraqi citizens through loans,three options for meeting 100% of electricity needs have been proposed:an on-grid solution,a hybrid system that supplies 24 h,and an off-grid solution for a 24-h supply.The 12-h off-grid system(hybrid)is both economical and efficient for delivering electricity.Findings reveal that,over 20 years,the system’s output will amount to 141,176.71 kWh,with a payback period of 5.85 years and a performance ratio of 86.2%.Investment outcome data showed a net present value of $6445,and the profitability index was 6.16,indicating the project’s profitability.Additionally,the system could result in a net reduction of CO_(2) emissions totaling 132,810.24 kg.展开更多
Based on the generalized reduced R-matrix theory,the R-matrix analysis code(RAC program)was used to analyze the experimental data of all the nuclear reaction channels related to the 5 He system.The current calculation...Based on the generalized reduced R-matrix theory,the R-matrix analysis code(RAC program)was used to analyze the experimental data of all the nuclear reaction channels related to the 5 He system.The current calculations provide accurate and reliable evaluation data and are in good agreement with the experimental data.In this study,self-consistent evaluation data for each reaction were obtained using multi-channel and multi-energy fitting.In particular,the error propagation theory of generalized least squares was used to determine the error of the evaluation data and the covariance matrix of the integral cross section.This R-matrix analysis for the 5 He system has three features.First,for the first time,the error in the evaluation data of the T(d,n)^(4)He reaction cross section and the covariance matrix of the integral cross section are provided.Second,we used only one set of R-matrix parameters to depict the reaction cross section of each reaction channel of the 5 He system for the entire energy region in our work.Third,in this evaluation,we considered some of the latest measured experimental data,especially after 2000.The T(d,n)^(4)He reaction cross section at 0.1 MeV and below was carefully studied.The effect of different energy levels in T(d,n)^(4)He was analyzed,with the energy levels 3/2^(+)making a major contribution to the cross section,and the role of the S-wave and P-wave from 3/2~-determines the lean forward trend of the angular distributions at 0.01–0.1 MeV.展开更多
In this paper, we present an analytical method for evaluating the stress field within a casing-cement-formation system of oil/gas wells under anisotropic in-situ stresses in the rock formation and uniform pressure wit...In this paper, we present an analytical method for evaluating the stress field within a casing-cement-formation system of oil/gas wells under anisotropic in-situ stresses in the rock formation and uniform pressure within the casing. The present method treats the in-situ stresses in the formation as initial stresses since the in-situ stresses have already developed in the formation before placement of cement and casing into the well. It is demonstrated that, via this treatment, the present method excludes additional displacements within the formation predicted by the existing method, and gives more reasonable stress results. An actual tight-oil well is analyzed using the present and existing analytical methods, as well as the finite element method. Good agreement between the analytical results and the finite element analysis (FEA) results is obtained, validating the present method. It is also evident that, compared with the present method, the existing method overestimates the compressive stress level within the casing and the cement. Finally, the effects of elastic properties of the formation, cement, and inner pressure of casing on stresses within the casing and cement are illustrated with a series of sensitivity analyses.展开更多
Indicator systems of environmental sustainable development in the Poyang Lake Basin are established from 51 elementary indexes by factor analysis, which is composed of four steps such as the factor model, the paramete...Indicator systems of environmental sustainable development in the Poyang Lake Basin are established from 51 elementary indexes by factor analysis, which is composed of four steps such as the factor model, the parameter estimation, the factor rotation and the factor score. Under the condition that the cumulative proportion is greater than 85%, 5 explicit factors of environmental sustainable development as well as its factor score by region are carried out. The result indicates some impact factors to the basin environmental in descending sort order are volume of water, volume of waste gas discharge,volume of solid wasters.the degree to comprehensive utilization of waste gas, waste water and solid wastes, the emission volume of waste gas, waste water and solid wastes. It is helpful and important to provide decision support for constituting sustainable development strategies and evaluate the sustainable development status of each city.展开更多
The Integrated Marine Observing System [IMOS] is an Australian national program for observing the oceans around Australia. As one of its important nodes, the New South Wales Integrated Marine Observing System (NSW-IM...The Integrated Marine Observing System [IMOS] is an Australian national program for observing the oceans around Australia. As one of its important nodes, the New South Wales Integrated Marine Observing System (NSW-IMOS] aims to provide more accurate descriptions of the East Australian Current [EAC]. The purpose of this paper is to evaluate the potential economic benefits from NSW-IMOS. Six related sectors which can potentially be among its main beneficiaries are considered: beach recreation, commercial fishing, recreational fishing, recreational boating, natural hazard predictions, and oil spill mitigation. The 1% constant percentage increase evaluation method is used to estimate the potential economic benefits to these six beneficiaries. By using this method, our study shows that the total potential economic benefit for these sectors is estimated to be $ 6.07 million per year. We consider that this is indicative but not conclusive in demonstrating some of the potential economic benefits that can be provided from information gathered by NSW-IMOS facilities. We conclude with further evaluative approaches that could be used to provide more accurate estimates of potential economic benefits.展开更多
The recent emergence of adaptive language learning systems calls for conceptual work to guide the design of assessment and learning in an adaptive environment.Although adaptive learning might have been touted as a uni...The recent emergence of adaptive language learning systems calls for conceptual work to guide the design of assessment and learning in an adaptive environment.Although adaptive learning might have been touted as a universal cure for learning problems,many adaptive language learning systems fall short of educators’expectations,partly due to a lack of standards and best practices in this area.To fill this gap,this paper proposes some major considerations in designing a high-quality assessment and learning experience in adaptive learning and ways to evaluate an adaptive learning system.The architecture of adaptive learning is decomposed,with a chain of inferences supporting the overall efficacy of an adaptive learning system presented,including user property representation,user property estimation,content representation,user interaction representation,and user interaction impact.A detailed analysis of key validity issues is provided for each inference,which motivates the major considerations in designing and evaluating assessment and learning.The paper first provides an overview of different types of assessment used in adaptive learning and an analysis of the assessment approach,priorities,and design considerations of each to optimize its use in adaptive learning.Then it proposes a framework for evaluating different aspects of an adaptive learning system.Some special connections are made to models,techniques,designs,and technologies specific to language learning and assessment,bringing more relevance to adaptive language learning solutions.Through establishing some guidelines on key aspects to evaluate and how to evaluate them,the work intends to bring more rigor to the field of adaptive language learning systems.展开更多
Ecological municipal solid waste (MSW) treatment systems are complex systems engineering concerning with multiple objectives and hierarchical levels. By combining an extension method with fuzzy logic theory, this pape...Ecological municipal solid waste (MSW) treatment systems are complex systems engineering concerning with multiple objectives and hierarchical levels. By combining an extension method with fuzzy logic theory, this paper investigated key technologies required by the comprehensive evaluation of ecological health. The method includes the construction of an evaluation system, quantification of evaluation indices, development of a matter-element model, development of an extension evaluation method, and assignment of a blended weight that combines subjectively and objectively estimated weights. This approach was used to develop a comprehensive model for evaluating the ecological health of an ecological treatment system for MSW. The model was then applied to a case study, and the results demonstrated that the model is a reasonable and effective.展开更多
The impact trend of reck-coal system was studied by the method of accumulating and releasing of deformation energy and interaction of rock-coal system. The system model of roof-coal-floor was established. Based on the...The impact trend of reck-coal system was studied by the method of accumulating and releasing of deformation energy and interaction of rock-coal system. The system model of roof-coal-floor was established. Based on the RFPA software, rock fracture process analysis system, the numerical test of deformation, fracture and energy transmission of nonlinear and nonhomogeneous rock-coal system, and the numerical test and evaluating method of impact trend of reck-coal system were achieved. When the same coal seam was in different roof and floor conditions, the fracture process of reck-coal system can be classified as gradual, sudden and delayed fracture three kinds, and their impact trend can be classified as void, intense and medium correspondingly. The rock-coal system's impact trend is evaluated by the system impact index p and burst expanding forms. The criteria μ are μ〈1.0, 1.0≤μ〈l .5 and μ≥1.5 when the impact trend is void, intense or medium, which are tested and verified by the No.2 and No.4 coal seams in Sun- cun mine.展开更多
With more application of welding technology in important structures more attention was paid to the evaluation of the safety of welded structures, the life prediction and decision to repair the welded structures. Based...With more application of welding technology in important structures more attention was paid to the evaluation of the safety of welded structures, the life prediction and decision to repair the welded structures. Based on material fracture mechanism and Chinese standard of safety evaluations of pressure vessels, an expert system was developed to evaluate the safety of welded pressure vessels. The system can analyze the weld defects in a pressure vessel, convert different kinds of defects into equivalent cracks and obtain their equivalent sizes. Furthermore, the system can calculate the stress and strain in the positions of weld defects and make decision on whether the defects are tolerable or not according to the code. When it is tolerable, the system will calculate the safety margin. The fatigue life can be predicted if the defects undergo fatigue load too. Moreover, data bases are built for storing mechanical properties of material and evaluated results.展开更多
A computer aided decision support system (merger and acquisition analyzing and evaluating-decision support system (MAAE-DSS)) for analyzing and evaluating corporate merger and acquisition (M&A) strategies, was pro...A computer aided decision support system (merger and acquisition analyzing and evaluating-decision support system (MAAE-DSS)) for analyzing and evaluating corporate merger and acquisition (M&A) strategies, was proposed. Strategic management tools such as scale index-market growth rate matrix (S-M matrix), industrial attraction-corporate strength matrix (I-S matrix), market growth rate-market occupancy matrix (G-O matrix), and life cycle-competitive position matrix (L-C matrix), were applied in the MAAE-DSS with its own data base (DB), model base (MB), method base (MeB), and knowledge base (KB), in order to support the management bureau in the formulation of M&A strategies.展开更多
<div style="text-align:justify;"> <span style="font-family:Verdana;">Evaluating the capacity of lecturer is the key to improve quality of education by improving lecturer capacity in hig...<div style="text-align:justify;"> <span style="font-family:Verdana;">Evaluating the capacity of lecturer is the key to improve quality of education by improving lecturer capacity in higher education institution. Lecturer’s capacity has been evaluated using different parameters in Assosa University, Ethiopia. Mandatorily, lecturers are evaluated using printed check list. For the last few years we observed that, the lecturer efficiency score is found on the shelf and not checked by anyone to know the gaps and to establish follow-up system. The use of intranet based web system is better to use lecturer’s efficiency result to monitor their performance and to establish follow-up mechanism. In this study, a prototype of web based yearbook efficiency management system for evaluating and monitoring the performance of lecturers was designed and developed. The evaluation process in the system was presented according to the university evaluation format. The output generated by the proposed system can be used by lecturers, HoD, HRM and academic managers to monitor teaching performance.</span> </div>展开更多
基金supported by the Research Funding Project for Graduate Education and Teaching Reform of Beijing University of Posts and Telecommunications(No.2024Y036)the Postgraduate Education and Teaching Reform Research Fund Project of Beijing University of Posts and Telecommunications(No.2024Z007)the Postgraduate Education and Teaching Reform Project of Beijing University of Posts and Telecommunications(2025).
文摘Traditional grade-centered evaluation models are inadequate for high-quality software engineering talents in the digital and AI era.This study develops an academic development monitoring system to address shortcomings in dynamics,interdisciplinary integration,and industry adaptability.It builds a multi-dimensional dynamic model covering seven core dimensions with quantitative scoring,non-linear weighting,and DivClust grouping.An intelligent platform with real-time monitoring,early warning,and personalized recommendations integrates AI like multi-modal fusion and large-model diagnosis.The“monitoring-warning-improvement”loop helps optimize training programs,support personalized planning,and bridge talent-industry gaps,enabling digital transformation in software engineering education evaluation.
文摘In their recent paper Pereira et al.(2025)claim that validation is overlooked in mapping and modelling of ecosystem services(ES).They state that“many studies lack critical evaluation of the results and no validation is provided”and that“the validation step is largely overlooked”.This assertion may have been true several years ago,for example,when Ochoa and Urbina-Cardona(2017)made a similar observation.However,there has been much work on ES model validation over the last decade.
基金Supported by Inter Disciplinary Direction Cultivation Project of Hunan University of Chinese Medicine,No.2025JC01032025 Hunan Province Science and Technology Innovation Plan Project,No.2025RC9012+2 种基金2022"Unveiling and Leading"Project of Discipline Construction at Hunan University of Chinese Medicine,No.22JBZ044Changsha Municipal Natural Science Foundation,No.kq2402174Hunan Provincial Science Popularization Fund Project,No.2025ZK4223.
文摘BACKGROUND Timely and accurate evaluation of mental disorders in adolescents using appropriate mental health literacy assessment tools is essential for improving their mental health literacy levels.AIM To develop an evaluation index system for the mental health literacy of adolescent patients with mental disorders,providing a scientific,comprehensive,and reliable tool for the monitoring and intervention of mental health literacy of such patients.METHODS From December 2022 to June 2023,the evaluation index system for mental health literacy of adolescents with mental disorders was developed through literature reviews,semi-structured interviews,expert letter consultations,and the analytic hierarchy process.Based on this index system,a self-assessment questionnaire was compiled and administered to 305 adolescents with mental disorders to test the reliability and validity of the index system.RESULTS The final evaluation index system for mental health literacy of adolescents with mental disorders included 4 first-level indicators,10 second-level indicators,and 52 third-level indicators.The overall Cronbach’sαcoefficient of the index system was 0.957,with a partial reliability of 0.826 and a content validity index of 0.975.The cumulative variance contribution rate of 10 common factors was 66.491%.The correlation coefficients between each dimension and the total questionnaire ranged from 0.672 to 0.724,while the correlation coefficients in each dimension ranged from 0.389 to 0.705.CONCLUSION The evaluation index system for mental health literacy of adolescents with mental disorders,developed in this study,demonstrated notable reliability and validity,making it a valuable tool for evaluating mental health literacy in this population.
基金supported by UKRI(EP/Z000025/1)Horizon Europe Programme under the MSCA grant for the ACMod project(101130271)。
文摘The exponential growth of video content has driven significant advancements in video summarization techniques in recent years.Breakthroughs in deep learning have been particularly transformative,enabling more effective detection of key information and creating new possibilities for video synopsis.To summarize recent progress and accelerate research in this field,this paper provides a comprehensive review of deep learning-based video summarization methods developed over the past decade.We begin by examining the research landscape of video abstraction technologies and identifying core challenges in video summarization.Subsequently,we systematically analyze prevailing deep learning frameworks and methodologies employed in current video summarization systems,offering researchers a clear roadmap of the field's evelution.Unlike previous review works,we first classify research papers based on the structural hierarchy of the video(from frame-level to shot-level to video-level),then further categorize them according to the summary backbone model(feature extraction and spatiotemporal modeling).This approach provides a more systematic and hierarchical organization of the documents.Following this comprehensive review,we summarize the benchmark datasets and evaluation metrics commonly employed in the field.Finally,we analyze persistent challenges and propose insightful directions for future research,providing a forward-looking perspective on video summarization technologies.This systematic literature review is of great reference value to new researchers exploring the fields of deep learning and video summarization.
基金National Natural Science Foundation of China,Grant/Award Number:82000102 and 82270112。
文摘The incidence of benign airway stenosis(BAS)is on the rise,and current treatment options are associated with a significant risk of restenosis.Therefore,there is an urgent need to explore new and effective prevention and treatment methods.Animal models serve as essential tools for investigating disease mechanisms and assessing novel therapeutic strategies,and the scientific rigor of their construction and validation significantly impacts the reliability of research findings.This paper systematically reviews the research progress and evaluation systems of BAS animal models over the past decade,aiming to provide a robust foundation for the optimized construction of BAS models,intervention studies,and clinical translation.This effort is intended to facilitate the innovation and advancement in BAS prevention and treatment strategies.
文摘LargeLanguageModels(LLMs)are increasingly appliedinthe fieldof code translation.However,existing evaluation methodologies suffer from two major limitations:(1)the high overlap between test data and pretraining corpora,which introduces significant bias in performance evaluation;and(2)mainstream metrics focus primarily on surface-level accuracy,failing to uncover the underlying factors that constrain model capabilities.To address these issues,this paper presents TCode(Translation-Oriented Code Evaluation benchmark)—a complexity-controllable,contamination-free benchmark dataset for code translation—alongside a dedicated static feature sensitivity evaluation framework.The dataset is carefully designed to control complexity along multiple dimensions—including syntactic nesting and expression intricacy—enabling both broad coverage and fine-grained differentiation of sample difficulty.This design supports precise evaluation of model capabilities across a wide spectrum of translation challenges.The proposed evaluation framework introduces a correlation-driven analysis mechanism based on static program features,enabling predictive modeling of translation success from two perspectives:Code Form Complexity(e.g.,code length and character density)and Semantic Modeling Complexity(e.g.,syntactic depth,control-flow nesting,and type system complexity).Empirical evaluations across representative LLMs—including Qwen2.5-72B and Llama3.3-70B—demonstrate that even state-of-the-art models achieve over 80% compilation success on simple samples,but their accuracy drops sharply below 40% on complex cases.Further correlation analysis indicates that Semantic Modeling Complexity alone is correlated with up to 60% of the variance in translation success,with static program features exhibiting nonlinear threshold effects that highlight clear capability boundaries.This study departs fromthe traditional accuracy-centric evaluation paradigm and,for the first time,systematically characterizes the capabilities of large languagemodels in translation tasks through the lens of programstatic features.The findings provide actionable insights for model refinement and training strategy development.
基金supported by NARI Relays Electric Co.,Ltd.under the Project“Research on Evaluation of Clearing Results and Switching Criteria for Primary-Backup Systems in Electricity SpotMarkets”(Project No.CGSQ240800443).
文摘The construction of spot electricity markets plays a pivotal role in power system reforms,where market clearing systems profoundly influence market efficiency and security.Current clearing systems predominantly adopt a single-system architecture,with research focusing primarily on accelerating solution algorithms through techniques such as high-efficiency parallel solvers and staggered decomposition of mixed-integer programming models.Notably absent are systematic studies evaluating the adaptability of primary-backup clearing systems incontingency scenarios—a critical gap given redundant systems’expanding applications in operational environments.This paper proposes a comprehensive evaluation framework for analyzing dual-system adaptability,demonstrated through an in-depth case study of the Inner Mongolia power market.First,we establish the innovative“Dual-Active Heterogeneous”architecture that enables independent parallelized operation and fault-isolated redundancy.Subsequently,key performance indices are quantitatively evaluated across four critical dimensions:unit commitment decisions,generator output constraints,transmission section congestion patterns,and clearing price formation mechanisms.An integrated fuzzy evaluation methodology incorporating grey relational analysis is employed for objective indicator weighting,enabling systematic quantification of system superiority under specific grid operating states.Empirical results based on actual operational data from 200 generation units demonstrate the framework’s efficacy in guiding optimal system selection,with particularly strong performance observed during peak load periods.The proposed approach shows high generalization potential for other regional markets employing redundant clearing mechanisms—particularly those with increasing renewable penetration and associated uncertainty.
基金supported by the Fundamental Research Funds for the Central Universities(No.CUC25SG013)the Foundation of Key Laboratory of Education Informatization for Nationalities(Yunnan Normal University),Ministry of Education(No.EIN2024C006).
文摘Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises.While large language models(LLMs)enable automated report generation,this specific domain lacks formal task definitions and corresponding benchmarks.To bridge this gap,we define the Automated Online Public Opinion Report Generation(OPOR-Gen)task and construct OPOR-Bench,an event-centric dataset with 463 crisis events across 108 countries(comprising 8.8 K news articles and 185 K tweets).To evaluate report quality,we propose OPOR-Eval,a novel agent-based framework that simulates human expert evaluation.Validation experiments show OPOR-Eval achieves a high Spearman’s correlation(ρ=0.70)with human judgments,though challenges in temporal reasoning persist.This work establishes an initial foundation for advancing automated public opinion reporting research.
文摘In the present study,researchers examined a solar off-grid-connected photovoltaic system for a family house in the city of Baghdad.The design was created with the help of the“How to Design PV Program”and the“Renewable Energy Investment Calculator(REICAL)”software(Version 1.1).In Iraq,the national grid provides around 71%of the overall electricity demand,though this drops to nearly 50%during extremely hot and cold months,where the supply alternates between four hours on and four hours off.During the off periods,power is generated by local generators at high costs.To promote the adoption of photovoltaic solar systems among Iraqi citizens through loans,three options for meeting 100% of electricity needs have been proposed:an on-grid solution,a hybrid system that supplies 24 h,and an off-grid solution for a 24-h supply.The 12-h off-grid system(hybrid)is both economical and efficient for delivering electricity.Findings reveal that,over 20 years,the system’s output will amount to 141,176.71 kWh,with a payback period of 5.85 years and a performance ratio of 86.2%.Investment outcome data showed a net present value of $6445,and the profitability index was 6.16,indicating the project’s profitability.Additionally,the system could result in a net reduction of CO_(2) emissions totaling 132,810.24 kg.
基金supported by Science Challenge Project(No.TZ20180001)。
文摘Based on the generalized reduced R-matrix theory,the R-matrix analysis code(RAC program)was used to analyze the experimental data of all the nuclear reaction channels related to the 5 He system.The current calculations provide accurate and reliable evaluation data and are in good agreement with the experimental data.In this study,self-consistent evaluation data for each reaction were obtained using multi-channel and multi-energy fitting.In particular,the error propagation theory of generalized least squares was used to determine the error of the evaluation data and the covariance matrix of the integral cross section.This R-matrix analysis for the 5 He system has three features.First,for the first time,the error in the evaluation data of the T(d,n)^(4)He reaction cross section and the covariance matrix of the integral cross section are provided.Second,we used only one set of R-matrix parameters to depict the reaction cross section of each reaction channel of the 5 He system for the entire energy region in our work.Third,in this evaluation,we considered some of the latest measured experimental data,especially after 2000.The T(d,n)^(4)He reaction cross section at 0.1 MeV and below was carefully studied.The effect of different energy levels in T(d,n)^(4)He was analyzed,with the energy levels 3/2^(+)making a major contribution to the cross section,and the role of the S-wave and P-wave from 3/2~-determines the lean forward trend of the angular distributions at 0.01–0.1 MeV.
基金supported by the National Natural Science Foundation of China(Nos.11502304 and51521063)the Science Foundation of China University of Petroleum(Nos.C201601 and2462013YJRC023)
文摘In this paper, we present an analytical method for evaluating the stress field within a casing-cement-formation system of oil/gas wells under anisotropic in-situ stresses in the rock formation and uniform pressure within the casing. The present method treats the in-situ stresses in the formation as initial stresses since the in-situ stresses have already developed in the formation before placement of cement and casing into the well. It is demonstrated that, via this treatment, the present method excludes additional displacements within the formation predicted by the existing method, and gives more reasonable stress results. An actual tight-oil well is analyzed using the present and existing analytical methods, as well as the finite element method. Good agreement between the analytical results and the finite element analysis (FEA) results is obtained, validating the present method. It is also evident that, compared with the present method, the existing method overestimates the compressive stress level within the casing and the cement. Finally, the effects of elastic properties of the formation, cement, and inner pressure of casing on stresses within the casing and cement are illustrated with a series of sensitivity analyses.
文摘Indicator systems of environmental sustainable development in the Poyang Lake Basin are established from 51 elementary indexes by factor analysis, which is composed of four steps such as the factor model, the parameter estimation, the factor rotation and the factor score. Under the condition that the cumulative proportion is greater than 85%, 5 explicit factors of environmental sustainable development as well as its factor score by region are carried out. The result indicates some impact factors to the basin environmental in descending sort order are volume of water, volume of waste gas discharge,volume of solid wasters.the degree to comprehensive utilization of waste gas, waste water and solid wastes, the emission volume of waste gas, waste water and solid wastes. It is helpful and important to provide decision support for constituting sustainable development strategies and evaluate the sustainable development status of each city.
文摘The Integrated Marine Observing System [IMOS] is an Australian national program for observing the oceans around Australia. As one of its important nodes, the New South Wales Integrated Marine Observing System (NSW-IMOS] aims to provide more accurate descriptions of the East Australian Current [EAC]. The purpose of this paper is to evaluate the potential economic benefits from NSW-IMOS. Six related sectors which can potentially be among its main beneficiaries are considered: beach recreation, commercial fishing, recreational fishing, recreational boating, natural hazard predictions, and oil spill mitigation. The 1% constant percentage increase evaluation method is used to estimate the potential economic benefits to these six beneficiaries. By using this method, our study shows that the total potential economic benefit for these sectors is estimated to be $ 6.07 million per year. We consider that this is indicative but not conclusive in demonstrating some of the potential economic benefits that can be provided from information gathered by NSW-IMOS facilities. We conclude with further evaluative approaches that could be used to provide more accurate estimates of potential economic benefits.
文摘The recent emergence of adaptive language learning systems calls for conceptual work to guide the design of assessment and learning in an adaptive environment.Although adaptive learning might have been touted as a universal cure for learning problems,many adaptive language learning systems fall short of educators’expectations,partly due to a lack of standards and best practices in this area.To fill this gap,this paper proposes some major considerations in designing a high-quality assessment and learning experience in adaptive learning and ways to evaluate an adaptive learning system.The architecture of adaptive learning is decomposed,with a chain of inferences supporting the overall efficacy of an adaptive learning system presented,including user property representation,user property estimation,content representation,user interaction representation,and user interaction impact.A detailed analysis of key validity issues is provided for each inference,which motivates the major considerations in designing and evaluating assessment and learning.The paper first provides an overview of different types of assessment used in adaptive learning and an analysis of the assessment approach,priorities,and design considerations of each to optimize its use in adaptive learning.Then it proposes a framework for evaluating different aspects of an adaptive learning system.Some special connections are made to models,techniques,designs,and technologies specific to language learning and assessment,bringing more relevance to adaptive language learning solutions.Through establishing some guidelines on key aspects to evaluate and how to evaluate them,the work intends to bring more rigor to the field of adaptive language learning systems.
文摘Ecological municipal solid waste (MSW) treatment systems are complex systems engineering concerning with multiple objectives and hierarchical levels. By combining an extension method with fuzzy logic theory, this paper investigated key technologies required by the comprehensive evaluation of ecological health. The method includes the construction of an evaluation system, quantification of evaluation indices, development of a matter-element model, development of an extension evaluation method, and assignment of a blended weight that combines subjectively and objectively estimated weights. This approach was used to develop a comprehensive model for evaluating the ecological health of an ecological treatment system for MSW. The model was then applied to a case study, and the results demonstrated that the model is a reasonable and effective.
基金the Ministry of Education Backbone Teachers Funded Projects
文摘The impact trend of reck-coal system was studied by the method of accumulating and releasing of deformation energy and interaction of rock-coal system. The system model of roof-coal-floor was established. Based on the RFPA software, rock fracture process analysis system, the numerical test of deformation, fracture and energy transmission of nonlinear and nonhomogeneous rock-coal system, and the numerical test and evaluating method of impact trend of reck-coal system were achieved. When the same coal seam was in different roof and floor conditions, the fracture process of reck-coal system can be classified as gradual, sudden and delayed fracture three kinds, and their impact trend can be classified as void, intense and medium correspondingly. The rock-coal system's impact trend is evaluated by the system impact index p and burst expanding forms. The criteria μ are μ〈1.0, 1.0≤μ〈l .5 and μ≥1.5 when the impact trend is void, intense or medium, which are tested and verified by the No.2 and No.4 coal seams in Sun- cun mine.
基金The research is supported by China Postdoctoral Science Foundation (No. 20080430129 ) and National Key Technology R&D Program ( No. 2007BAE07 B07 ).
文摘With more application of welding technology in important structures more attention was paid to the evaluation of the safety of welded structures, the life prediction and decision to repair the welded structures. Based on material fracture mechanism and Chinese standard of safety evaluations of pressure vessels, an expert system was developed to evaluate the safety of welded pressure vessels. The system can analyze the weld defects in a pressure vessel, convert different kinds of defects into equivalent cracks and obtain their equivalent sizes. Furthermore, the system can calculate the stress and strain in the positions of weld defects and make decision on whether the defects are tolerable or not according to the code. When it is tolerable, the system will calculate the safety margin. The fatigue life can be predicted if the defects undergo fatigue load too. Moreover, data bases are built for storing mechanical properties of material and evaluated results.
文摘A computer aided decision support system (merger and acquisition analyzing and evaluating-decision support system (MAAE-DSS)) for analyzing and evaluating corporate merger and acquisition (M&A) strategies, was proposed. Strategic management tools such as scale index-market growth rate matrix (S-M matrix), industrial attraction-corporate strength matrix (I-S matrix), market growth rate-market occupancy matrix (G-O matrix), and life cycle-competitive position matrix (L-C matrix), were applied in the MAAE-DSS with its own data base (DB), model base (MB), method base (MeB), and knowledge base (KB), in order to support the management bureau in the formulation of M&A strategies.
文摘<div style="text-align:justify;"> <span style="font-family:Verdana;">Evaluating the capacity of lecturer is the key to improve quality of education by improving lecturer capacity in higher education institution. Lecturer’s capacity has been evaluated using different parameters in Assosa University, Ethiopia. Mandatorily, lecturers are evaluated using printed check list. For the last few years we observed that, the lecturer efficiency score is found on the shelf and not checked by anyone to know the gaps and to establish follow-up system. The use of intranet based web system is better to use lecturer’s efficiency result to monitor their performance and to establish follow-up mechanism. In this study, a prototype of web based yearbook efficiency management system for evaluating and monitoring the performance of lecturers was designed and developed. The evaluation process in the system was presented according to the university evaluation format. The output generated by the proposed system can be used by lecturers, HoD, HRM and academic managers to monitor teaching performance.</span> </div>