The incidence of benign airway stenosis(BAS)is on the rise,and current treatment options are associated with a significant risk of restenosis.Therefore,there is an urgent need to explore new and effective prevention a...The incidence of benign airway stenosis(BAS)is on the rise,and current treatment options are associated with a significant risk of restenosis.Therefore,there is an urgent need to explore new and effective prevention and treatment methods.Animal models serve as essential tools for investigating disease mechanisms and assessing novel therapeutic strategies,and the scientific rigor of their construction and validation significantly impacts the reliability of research findings.This paper systematically reviews the research progress and evaluation systems of BAS animal models over the past decade,aiming to provide a robust foundation for the optimized construction of BAS models,intervention studies,and clinical translation.This effort is intended to facilitate the innovation and advancement in BAS prevention and treatment strategies.展开更多
Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises.While large language models(LLMs)enable automated report generation,this specific domain lack...Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises.While large language models(LLMs)enable automated report generation,this specific domain lacks formal task definitions and corresponding benchmarks.To bridge this gap,we define the Automated Online Public Opinion Report Generation(OPOR-Gen)task and construct OPOR-Bench,an event-centric dataset with 463 crisis events across 108 countries(comprising 8.8 K news articles and 185 K tweets).To evaluate report quality,we propose OPOR-Eval,a novel agent-based framework that simulates human expert evaluation.Validation experiments show OPOR-Eval achieves a high Spearman’s correlation(ρ=0.70)with human judgments,though challenges in temporal reasoning persist.This work establishes an initial foundation for advancing automated public opinion reporting research.展开更多
LargeLanguageModels(LLMs)are increasingly appliedinthe fieldof code translation.However,existing evaluation methodologies suffer from two major limitations:(1)the high overlap between test data and pretraining corpora...LargeLanguageModels(LLMs)are increasingly appliedinthe fieldof code translation.However,existing evaluation methodologies suffer from two major limitations:(1)the high overlap between test data and pretraining corpora,which introduces significant bias in performance evaluation;and(2)mainstream metrics focus primarily on surface-level accuracy,failing to uncover the underlying factors that constrain model capabilities.To address these issues,this paper presents TCode(Translation-Oriented Code Evaluation benchmark)—a complexity-controllable,contamination-free benchmark dataset for code translation—alongside a dedicated static feature sensitivity evaluation framework.The dataset is carefully designed to control complexity along multiple dimensions—including syntactic nesting and expression intricacy—enabling both broad coverage and fine-grained differentiation of sample difficulty.This design supports precise evaluation of model capabilities across a wide spectrum of translation challenges.The proposed evaluation framework introduces a correlation-driven analysis mechanism based on static program features,enabling predictive modeling of translation success from two perspectives:Code Form Complexity(e.g.,code length and character density)and Semantic Modeling Complexity(e.g.,syntactic depth,control-flow nesting,and type system complexity).Empirical evaluations across representative LLMs—including Qwen2.5-72B and Llama3.3-70B—demonstrate that even state-of-the-art models achieve over 80% compilation success on simple samples,but their accuracy drops sharply below 40% on complex cases.Further correlation analysis indicates that Semantic Modeling Complexity alone is correlated with up to 60% of the variance in translation success,with static program features exhibiting nonlinear threshold effects that highlight clear capability boundaries.This study departs fromthe traditional accuracy-centric evaluation paradigm and,for the first time,systematically characterizes the capabilities of large languagemodels in translation tasks through the lens of programstatic features.The findings provide actionable insights for model refinement and training strategy development.展开更多
Although previous studies have demonstrated that transcranial focused ultrasound stimulation protects the ischemic brain,clear criteria for the stimulation time window and intensity are lacking.Electrical impedance to...Although previous studies have demonstrated that transcranial focused ultrasound stimulation protects the ischemic brain,clear criteria for the stimulation time window and intensity are lacking.Electrical impedance tomography enables real-time monitoring of changes in cerebral blood perfusion within the ischemic brain,but investigating the feasibility of using this method to assess post-stroke rehabilitation in vivo remains critical.In this study,ischemic stroke was induced in rats through middle cerebral artery occlusion surgery.Transcranial focused ultrasound stimulation was used to treat the rat model of ischemia,and electrical impedance tomography was used to measure impedance during both the acute stage of ischemia and the rehabilitation stage following the stimulation.Electrical impedance tomography results indicated that cerebral impedance increased after the onset of ischemia and decreased following transcranial focused ultrasound stimulation.Furthermore,the stimulation promoted motor function recovery,reduced cerebral infarction volume in the rat model of ischemic stroke,and induced the expression of brain-derived neurotrophic factor in the ischemic brain.Our results also revealed a significant correlation between the impedance of the ischemic brain post-intervention and improvements in behavioral scores and infarct volume.This study shows that daily administration of transcranial focused ultrasound stimulation for 20 minutes to the ischemic hemisphere 24 hours after cerebral ischemia enhanced motor recovery in a rat model of ischemia.Additionally,our findings indicate that electrical impedance tomography can serve as a valuable tool for quantitatively evaluating rehabilitation after ischemic stroke in vivo.These findings suggest the feasibility of using impedance data collected via electrical impedance tomography to clinically assess the effects of rehabilitatory interventions for patients with ischemic stroke.展开更多
We propose an integrated method of data-driven and mechanism models for well logging formation evaluation,explicitly focusing on predicting reservoir parameters,such as porosity and water saturation.Accurately interpr...We propose an integrated method of data-driven and mechanism models for well logging formation evaluation,explicitly focusing on predicting reservoir parameters,such as porosity and water saturation.Accurately interpreting these parameters is crucial for effectively exploring and developing oil and gas.However,with the increasing complexity of geological conditions in this industry,there is a growing demand for improved accuracy in reservoir parameter prediction,leading to higher costs associated with manual interpretation.The conventional logging interpretation methods rely on empirical relationships between logging data and reservoir parameters,which suffer from low interpretation efficiency,intense subjectivity,and suitability for ideal conditions.The application of artificial intelligence in the interpretation of logging data provides a new solution to the problems existing in traditional methods.It is expected to improve the accuracy and efficiency of the interpretation.If large and high-quality datasets exist,data-driven models can reveal relationships of arbitrary complexity.Nevertheless,constructing sufficiently large logging datasets with reliable labels remains challenging,making it difficult to apply data-driven models effectively in logging data interpretation.Furthermore,data-driven models often act as“black boxes”without explaining their predictions or ensuring compliance with primary physical constraints.This paper proposes a machine learning method with strong physical constraints by integrating mechanism and data-driven models.Prior knowledge of logging data interpretation is embedded into machine learning regarding network structure,loss function,and optimization algorithm.We employ the Physically Informed Auto-Encoder(PIAE)to predict porosity and water saturation,which can be trained without labeled reservoir parameters using self-supervised learning techniques.This approach effectively achieves automated interpretation and facilitates generalization across diverse datasets.展开更多
The emergence of Medical Large Language Models has significantly transformed healthcare.Medical Large Language Models(Med-LLMs)serve as transformative tools that enhance clinical practice through applications in decis...The emergence of Medical Large Language Models has significantly transformed healthcare.Medical Large Language Models(Med-LLMs)serve as transformative tools that enhance clinical practice through applications in decision support,documentation,and diagnostics.This evaluation examines the performance of leading Med-LLMs,including GPT-4Med,Med-PaLM,MEDITRON,PubMedGPT,and MedAlpaca,across diverse medical datasets.It provides graphical comparisons of their effectiveness in distinct healthcare domains.The study introduces a domain-specific categorization system that aligns these models with optimal applications in clinical decision-making,documentation,drug discovery,research,patient interaction,and public health.The paper addresses deployment challenges of Medical-LLMs,emphasizing trustworthiness and explainability as essential requirements for healthcare AI.It presents current evaluation techniques that improve model transparency in high-stakes medical contexts and analyzes regulatory frameworks using benchmarking datasets such asMedQA,MedMCQA,PubMedQA,and MIMIC.By identifying ongoing challenges in biasmitigation,reliability,and ethical compliance,thiswork serves as a resource for selecting appropriate Med-LLMs and outlines future directions in the field.This analysis offers a roadmap for developing Med-LLMs that balance technological innovation with the trust and transparency required for clinical integration,a perspective often overlooked in existing literature.展开更多
An acute skin injury model using continuous tape tearing was established,and studies the application of this model in the evaluation of soothing effects through instrument evaluation.30 healthy adult subjects were sel...An acute skin injury model using continuous tape tearing was established,and studies the application of this model in the evaluation of soothing effects through instrument evaluation.30 healthy adult subjects were selected as the research subjects,and an acute skin model was established on the forearm flexion side of the subjects.The skin color a^(*)value and transdermal water loss rate(TEWL)value of the blank and experimental groups were tested using instruments.The results showed that at 15 and 30 minutes after using the sample,the growth values of a^(*)value and TEWL value in the experimental group were significantly lower than those in the blank group(P<0.05),the acute skin lesion model can effectively evaluate the soothing effect of cosmetics.展开更多
In the context of advancing towards dual carbon goals,numerous factories are actively engaging in energy efficiency upgrades and transformations.To accurately pinpoint energy efficiency bottlenecks within factories an...In the context of advancing towards dual carbon goals,numerous factories are actively engaging in energy efficiency upgrades and transformations.To accurately pinpoint energy efficiency bottlenecks within factories and prioritize renovation sequences,it is crucial to conduct comprehensive evaluations of the energy performance across various workshops.Therefore,this paper proposes an evaluation model for workshop energy efficiency based on the drive-state-response(DSR)framework combined with the fuzzy BORDA method.Firstly,an in-depth analysis of the relationships between different energy efficiency indicators was conducted.Based on the DSR model,evaluation criteria were selected from three dimensions-drive factors,state characteristics,and response measures-to establish a robust energy efficiency indicator system.Secondly,three distinct assessment techniques were selected:Grey Relational Analysis(GRA),Entropy Weight Method(EWM),and Technique for Order Preference by Similarity to Ideal Solution(TOPSIS)forming a diversified set of evaluation methods.Subsequently,by introducing the fuzzy BORDA method,a comprehensive energy efficiency evaluation model was developed,aimed at quantitatively ranking the energy performance status of each workshop.Using a real-world factory as a case study,applying our proposed evaluationmodel yielded detailed scores and rankings for each workshop.Furthermore,post hoc testing was performed using the Spearman correlation coefficient,revealing a statistic value of 10.209,which validates the effectiveness and reliability of the proposed evaluation model.This model not only assists in identifying underperforming workshops within the factory but also provides solid data support and a decision-making basis for future energy efficiency optimization strategies.展开更多
Objectives:Valid estimation of energy expenditure remains a challenge,particularly when using ankle-and thighworn devices.The Move 4 is a research-grade accelerometer previously tested for predicting metabolic equival...Objectives:Valid estimation of energy expenditure remains a challenge,particularly when using ankle-and thighworn devices.The Move 4 is a research-grade accelerometer previously tested for predicting metabolic equivalents(METs)when worn at the waist or wrist.This study aimed to calibrate and evaluate regression models to estimate METs from Move 4 data when worn at the ankle and thigh.Methods:Participants completed walking and jogging tasks under laboratory conditions while wearing Move 4 sensors and with indirect calorimetry as a reference measure.Models were calibrated using study 1(n=160)and evaluated in an independent dataset(study 2;n=15).Performance was assessed using mean absolute error(MAE),root mean square error(RMSE),and Bland-Altman analyses.Results:The MET models demonstrated strong agreement across both locations and datasets.For the thigh position,the MAE ranged from 0.60 METs(walking)to 1.38 METs(jogging),with RMSE of 0.82 and 1.70 in the evaluation data.Calibration metrics were comparable(jogging:MAE=1.24,RMSE=1.63).The ankle models showed similar accuracy,with MAEs of 0.66(walking)and 1.39(jogging),and RMSEs of 0.85 and 1.67,respectively.Systematic bias remained low(mean differences between−0.34 and−0.01 METs).Conclusions:This study provides the first calibration and evaluation for estimating METs from ankle-and thigh-worn Move 4 accelerometers.The model indicated accurate,highresolution MET estimation for walking and jogging.Future work should expand independent performance evaluations,including diverse activities such as static activities,and diverse samples under free-living conditions.展开更多
With the rapid development of generative artificial intelligence technologies,represented by large language models,university-level computer science education is undergoing a critical transition-from knowledge-based i...With the rapid development of generative artificial intelligence technologies,represented by large language models,university-level computer science education is undergoing a critical transition-from knowledge-based instruction to competency-oriented teaching.A postgraduate student competency evaluation model can serve as a framework to organize and guide both teaching and research activities at the postgraduate level.A number of relevant research efforts have already been conducted in this area.Graduate education plays a vital role not only as a continuation and enhancement of undergraduate education but also as essential preparation for future research endeavors.An analysis of the acceptance of competency evaluation models refers to the assessment of how various stakeholders perceive the importance of different components within the model.Investigating the degree of acceptance among diverse groups-such as current undergraduate students,current postgraduate students,graduates with less than three years of work experience,and those with more than three years of work experience-can offer valuable insights for improving and optimizing postgraduate education and training practices.展开更多
With the widespread application of Internet of Things(IoT)technology,the processing of massive realtime streaming data poses significant challenges to the computational and data-processing capabilities of systems.Alth...With the widespread application of Internet of Things(IoT)technology,the processing of massive realtime streaming data poses significant challenges to the computational and data-processing capabilities of systems.Although distributed streaming data processing frameworks such asApache Flink andApache Spark Streaming provide solutions,meeting stringent response time requirements while ensuring high throughput and resource utilization remains an urgent problem.To address this,the study proposes a formal modeling approach based on Performance Evaluation Process Algebra(PEPA),which abstracts the core components and interactions of cloud-based distributed streaming data processing systems.Additionally,a generic service flow generation algorithmis introduced,enabling the automatic extraction of service flows fromthe PEPAmodel and the computation of key performance metrics,including response time,throughput,and resource utilization.The novelty of this work lies in the integration of PEPA-based formal modeling with the service flow generation algorithm,bridging the gap between formal modeling and practical performance evaluation for IoT systems.Simulation experiments demonstrate that optimizing the execution efficiency of components can significantly improve system performance.For instance,increasing the task execution rate from 10 to 100 improves system performance by 9.53%,while further increasing it to 200 results in a 21.58%improvement.However,diminishing returns are observed when the execution rate reaches 500,with only a 0.42%gain.Similarly,increasing the number of TaskManagers from 10 to 20 improves response time by 18.49%,but the improvement slows to 6.06% when increasing from 20 to 50,highlighting the importance of co-optimizing component efficiency and resource management to achieve substantial performance gains.This study provides a systematic framework for analyzing and optimizing the performance of IoT systems for large-scale real-time streaming data processing.The proposed approach not only identifies performance bottlenecks but also offers insights into improving system efficiency under different configurations and workloads.展开更多
The average stiffness performance indices throughout the workspace are commonly used as global stiffness performance indices to evaluate the overall stiffness performance of parallel mechanisms,which involves an analy...The average stiffness performance indices throughout the workspace are commonly used as global stiffness performance indices to evaluate the overall stiffness performance of parallel mechanisms,which involves an analysis of the stiffness performance of numerous discrete points in the workspace.This necessitates time-consuming and inefficient calculation,which is particularly pronounced in the optimization design stage of the mechanism,where the variations in the global stiffness performance indices versus various dimensional and structural parameters need to be analyzed.This paper presents a semi-analytical approach for stiffness modeling of the novel(R(RPS&RP))&2-UPS parallel mechanism(referred to as the Trifree mechanism)and proposes“local”stiffness performance indices as alternatives to global indices.Drawing on the screw theory,the Cartesian stiffness matrix of the Trifree mechanism is formulated explicitly by considering the compliances of all elastic elements and the over-constraint characteristics inherent in the mechanism.Based on the spherical motion pattern of the Trifree mechanism,four special reference configurations are extracted within the workspace.This yields“local”stiffness performance indices capable of accurately evaluating the overall stiffness performance of the mechanism and effectively improving the computational efficiency.The variations in global and“local”stiffness performance indices versus key design parameters are investigated.Furthermore,the proposed indices are applied to the Tricept and Trimule mechanisms.The results demonstrate that the proposed indices exhibit excellent computational accuracy and efficiency in evaluating the overall stiffness performance of these spherical parallel mechanisms.Moreover,the stiffness performance of the novel parallel mechanism investigated in this study closely resembles that of the well-known Tricept and Trimule mechanisms.This research proposes a semi-analytic stiffness model of the Trifree mechanism and“local”stiffness performance indices to evaluate the overall stiffness performance,thereby substantially improving the computational efficiency without sacrificing accuracy.展开更多
The rapid development of evolutionary deep learning has led to the emergence of various Neural Architecture Search(NAS)algorithms designed to optimize neural network structures.However,these algorithms often face sign...The rapid development of evolutionary deep learning has led to the emergence of various Neural Architecture Search(NAS)algorithms designed to optimize neural network structures.However,these algorithms often face significant computational costs due to the time-consuming process of training neural networks and evaluating their performance.Traditional NAS approaches,which rely on exhaustive evaluations and large training datasets,are inefficient for solving complex image classification tasks within limited time frames.To address these challenges,this paper proposes a novel NAS algorithm that integrates a hierarchical evaluation strategy based on Surrogate models,specifically using supernet to pre-trainweights and randomforests as performance predictors.This hierarchical framework combines rapid Surrogate model evaluations with traditional,precise evaluations to balance the trade-off between performance accuracy and computational efficiency.The algorithm significantly reduces the time required for model evaluation by predicting the fitness of candidate architectures using a random forest Surrogate model,thus alleviating the need for full training cycles for each architecture.The proposed method also incorporates evolutionary operations such as mutation and crossover to refine the search process and improve the accuracy of the resulting architectures.Experimental evaluations on the CIFAR-10 and CIFAR-100 datasets demonstrate that the proposed hierarchical evaluation strategy reduces the search time and costs compared to traditional methods,while achieving comparable or even superior model performance.The results suggest that this approach can efficiently handle resourceconstrained tasks,providing a promising solution for accelerating the NAS process without compromising the quality of the generated architectures.展开更多
Offline policy evaluation,evaluating and selecting complex policies for decision-making by only using offline datasets is important in reinforcement learning.At present,the model-based offline policy evaluation(MBOPE)...Offline policy evaluation,evaluating and selecting complex policies for decision-making by only using offline datasets is important in reinforcement learning.At present,the model-based offline policy evaluation(MBOPE)is widely welcomed because of its easy to implement and good performance.MBOPE directly approximates the unknown value of a given policy using the Monte Carlo method given the estimated transition and reward functions of the environment.Usually,multiple models are trained,and then one of them is selected to be used.However,a challenge remains in selecting an appropriate model from those trained for further use.The authors first analyse the upper bound of the difference between the approximated value and the unknown true value.Theoretical results show that this difference is related to the trajectories generated by the given policy on the learnt model and the prediction error of the transition and reward functions at these generated data points.Based on the theoretical results,a new criterion is proposed to tell which trained model is better suited for evaluating the given policy.At last,the effectiveness of the proposed criterion is demonstrated on both benchmark and synthetic offline datasets.展开更多
With the continuous development of the nursing discipline,standardized nurse training has always been a crucial link in the development of nursing science and plays an irreplaceable role in talent cultivation.However,...With the continuous development of the nursing discipline,standardized nurse training has always been a crucial link in the development of nursing science and plays an irreplaceable role in talent cultivation.However,in the current standardized training for some nurses,there are problems such as the simplification of nursing skill evaluation models and insufficient post competence of nurses.Therefore,optimizing the training model for nursing talents has become an inevitable measure.The problem-based learning(PBL)method and the Direct Observation of Procedural Skills(DOPS)evaluation model provide new directions and guidance for the development of training.Against this background,this paper explores effective approaches for standardized nurse training,starting from basic concepts and gradually delving into specific practical paths,aiming to improve the quality of talent cultivation and provide valuable references for other researchers.展开更多
The comprehensive status of blast furnaces was one of the most important factors affecting their economy,quality,and longev-ity.The blast furnace comprehensive status had the nature of“black box,”and it was“unpredi...The comprehensive status of blast furnaces was one of the most important factors affecting their economy,quality,and longev-ity.The blast furnace comprehensive status had the nature of“black box,”and it was“unpredictable.”In this study,a blast furnace com-prehensive status score and prediction method based on a cascade system and a combined model were proposed to address this issue.A dual cascade evaluation system was developed by integrating subjective and objective weighting methods.The analytic hierarchy process,coefficient of variation,entropy weight method,and impart combinatorial games were jointly employed to determine the optimal weight distribution across indicators.Categorized statuses(raw material,gas flow,furnace body,furnace cylinder,and iron-slag)were evaluated.Based on the five categories of the status data,the second cascade was applied to upgrade the quantitative evaluation of the comprehens-ive status.The weights of the different categories were 0.22,0.15,0.22,0.21,and 0.20,respectively.According to the data analysis,the results of the comprehensive status score closely matched the on-site production logs.Based on the blast furnace smelting period,the maximal information coefficient method was applied to the 100 parameters that were most relevant to the comprehensive status.A com-bined prediction model for a comprehensive status score was designed using bidirectional long short-term memory(BiLSTM)and categorical boosting(CatBoost).The test results indicated that the combined model reduced the mean absolute error by an average of 0.275 and increased the hit rate by an average of 5.65 percentage points compared to BiLSTM or CatBoost alone.When the er-ror range was±2.5,the combined model predicted a hit rate of 91.66%for the next hour’s comprehensive status score,and its high accur-acy was deemed satisfactory for the field.SHapley Additive exPlanations(SHAP)and regression fitting were applied to analyze the lin-ear quantitative relationship between the key variables and the comprehensive status score.When the furnace bottom center temperature was increased by 10℃,the comprehensive status score increased by 0.44.This method contributes to a more precise management and control of the comprehensive status of the blast furnace on-site.展开更多
This paper proposes a multivariate data fusion based quality evaluation model for software talent cultivation.The model constructs a comprehensive ability and quality evaluation index system for college students from ...This paper proposes a multivariate data fusion based quality evaluation model for software talent cultivation.The model constructs a comprehensive ability and quality evaluation index system for college students from a perspective of engineering course,especially of software engineering.As for evaluation method,relying on the behavioral data of students during their school years,we aim to construct the evaluation model as objective as possible,effectively weakening the negative impact of personal subjective assumptions on the evaluation results.展开更多
This study explores the feasibility of constructing an intelligent educational evaluation system based on the CIPP model and artificial intelligence technology in the context of new engineering disciplines.By integrat...This study explores the feasibility of constructing an intelligent educational evaluation system based on the CIPP model and artificial intelligence technology in the context of new engineering disciplines.By integrating the CIPP model with AI technology,a novel intelligent educational evaluation system was designed.Through experimental validation and case studies,the system demonstrated significant effectiveness in improving teaching quality,facilitating personalized student development,and optimizing educational resource allocation.Additionally,the study predicts potential changes this system could bring to the education industry and proposes relevant policy recommendations.Although the current research has limitations,with technological advancements in the future,this system is expected to provide stronger support for innovations in engineering education models.展开更多
Existing sandstone rock structure evaluation methods rely on visual inspection,with low efficiency,semi-quantitative analysis of roundness,and inability to perform classified statistics in particle size analysis.This ...Existing sandstone rock structure evaluation methods rely on visual inspection,with low efficiency,semi-quantitative analysis of roundness,and inability to perform classified statistics in particle size analysis.This study presents an intelligent evaluation method for sandstone rock structure based on the Segment Anything Model(SAM).By developing a lightweight SAM fine-tuning method with rank-decomposition matrix adapters,a multispectral rock particle segmentation model named CoreSAM is constructed,which achieves rock particle edge extraction and type identification.Building upon this,we propose a comprehensive quantitative evaluation system for rock structure,assessing parameters including particle size,sorting,roundness,particle contact and cementation types.The experimental results demonstrate that CoreSAM outperforms existing methods in rock particle segmentation accuracy while showing excellent generalization across different image types such as CT scans and core photographs.The proposed method enables full-sample,classified particle size analysis and quantitative characterization of parameters like roundness,advancing reservoir evaluation towards more precise,quantitative,intuitive,and comprehensive development.展开更多
With the large-scale promotion of distributed photovoltaics,new challenges have emerged in the photovoltaic consumptionwithin distribution networks.Traditional photovoltaic consumption schemes have primarily focused o...With the large-scale promotion of distributed photovoltaics,new challenges have emerged in the photovoltaic consumptionwithin distribution networks.Traditional photovoltaic consumption schemes have primarily focused on static analysis.However,as the scale of photovoltaic power generation devices grows and the methods of integration diversify,a single consumption scheme is no longer sufficient to meet the actual needs of current distribution networks.Therefore,this paper proposes an optimal evaluation method for photovoltaic consumption schemes based on BASS model predictions of installed capacity,aiming to provide an effective tool for generating and evaluating photovoltaic consumption schemes in distribution networks.First,the BASS diffusion model,combined with existing photovoltaic capacity data and roof area information,is used to predict the trends in photovoltaic installed capacity for each substation area,providing a scientific basis for consumption evaluation.Secondly,an improved random scenario simulation method is proposed for assessing the photovoltaic consumption capacity in distribution networks.This method generates photovoltaic integration schemes based on the diffusion probabilities of different regions and evaluates the consumption capacity of each scheme.Finally,the Technique for Order Preference by Similarity to an Ideal Solution(TOPSIS)is used to comprehensively evaluate the generated schemes,ensuring that the selected scheme not only meets the consumption requirements but also offers high economic benefits and reliability.The effectiveness and feasibility of the proposedmethod are validated through simulations of the IEEE 33-node system,providing strong support for optimizing photovoltaic consumption schemes in distribution networks.展开更多
基金National Natural Science Foundation of China,Grant/Award Number:82000102 and 82270112。
文摘The incidence of benign airway stenosis(BAS)is on the rise,and current treatment options are associated with a significant risk of restenosis.Therefore,there is an urgent need to explore new and effective prevention and treatment methods.Animal models serve as essential tools for investigating disease mechanisms and assessing novel therapeutic strategies,and the scientific rigor of their construction and validation significantly impacts the reliability of research findings.This paper systematically reviews the research progress and evaluation systems of BAS animal models over the past decade,aiming to provide a robust foundation for the optimized construction of BAS models,intervention studies,and clinical translation.This effort is intended to facilitate the innovation and advancement in BAS prevention and treatment strategies.
基金supported by the Fundamental Research Funds for the Central Universities(No.CUC25SG013)the Foundation of Key Laboratory of Education Informatization for Nationalities(Yunnan Normal University),Ministry of Education(No.EIN2024C006).
文摘Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises.While large language models(LLMs)enable automated report generation,this specific domain lacks formal task definitions and corresponding benchmarks.To bridge this gap,we define the Automated Online Public Opinion Report Generation(OPOR-Gen)task and construct OPOR-Bench,an event-centric dataset with 463 crisis events across 108 countries(comprising 8.8 K news articles and 185 K tweets).To evaluate report quality,we propose OPOR-Eval,a novel agent-based framework that simulates human expert evaluation.Validation experiments show OPOR-Eval achieves a high Spearman’s correlation(ρ=0.70)with human judgments,though challenges in temporal reasoning persist.This work establishes an initial foundation for advancing automated public opinion reporting research.
文摘LargeLanguageModels(LLMs)are increasingly appliedinthe fieldof code translation.However,existing evaluation methodologies suffer from two major limitations:(1)the high overlap between test data and pretraining corpora,which introduces significant bias in performance evaluation;and(2)mainstream metrics focus primarily on surface-level accuracy,failing to uncover the underlying factors that constrain model capabilities.To address these issues,this paper presents TCode(Translation-Oriented Code Evaluation benchmark)—a complexity-controllable,contamination-free benchmark dataset for code translation—alongside a dedicated static feature sensitivity evaluation framework.The dataset is carefully designed to control complexity along multiple dimensions—including syntactic nesting and expression intricacy—enabling both broad coverage and fine-grained differentiation of sample difficulty.This design supports precise evaluation of model capabilities across a wide spectrum of translation challenges.The proposed evaluation framework introduces a correlation-driven analysis mechanism based on static program features,enabling predictive modeling of translation success from two perspectives:Code Form Complexity(e.g.,code length and character density)and Semantic Modeling Complexity(e.g.,syntactic depth,control-flow nesting,and type system complexity).Empirical evaluations across representative LLMs—including Qwen2.5-72B and Llama3.3-70B—demonstrate that even state-of-the-art models achieve over 80% compilation success on simple samples,but their accuracy drops sharply below 40% on complex cases.Further correlation analysis indicates that Semantic Modeling Complexity alone is correlated with up to 60% of the variance in translation success,with static program features exhibiting nonlinear threshold effects that highlight clear capability boundaries.This study departs fromthe traditional accuracy-centric evaluation paradigm and,for the first time,systematically characterizes the capabilities of large languagemodels in translation tasks through the lens of programstatic features.The findings provide actionable insights for model refinement and training strategy development.
基金supported by the Fundamental Research Funds for the Central Universities,Nos.G2021KY05107,G2021KY05101the National Natural Science Foundation of China,Nos.32071316,32211530049+1 种基金the Natural Science Foundation of Shaanxi Province,No.2022-JM482the Education and Teaching Reform Funds for the Central Universities,No.23GZ230102(all to LL and HH).
文摘Although previous studies have demonstrated that transcranial focused ultrasound stimulation protects the ischemic brain,clear criteria for the stimulation time window and intensity are lacking.Electrical impedance tomography enables real-time monitoring of changes in cerebral blood perfusion within the ischemic brain,but investigating the feasibility of using this method to assess post-stroke rehabilitation in vivo remains critical.In this study,ischemic stroke was induced in rats through middle cerebral artery occlusion surgery.Transcranial focused ultrasound stimulation was used to treat the rat model of ischemia,and electrical impedance tomography was used to measure impedance during both the acute stage of ischemia and the rehabilitation stage following the stimulation.Electrical impedance tomography results indicated that cerebral impedance increased after the onset of ischemia and decreased following transcranial focused ultrasound stimulation.Furthermore,the stimulation promoted motor function recovery,reduced cerebral infarction volume in the rat model of ischemic stroke,and induced the expression of brain-derived neurotrophic factor in the ischemic brain.Our results also revealed a significant correlation between the impedance of the ischemic brain post-intervention and improvements in behavioral scores and infarct volume.This study shows that daily administration of transcranial focused ultrasound stimulation for 20 minutes to the ischemic hemisphere 24 hours after cerebral ischemia enhanced motor recovery in a rat model of ischemia.Additionally,our findings indicate that electrical impedance tomography can serve as a valuable tool for quantitatively evaluating rehabilitation after ischemic stroke in vivo.These findings suggest the feasibility of using impedance data collected via electrical impedance tomography to clinically assess the effects of rehabilitatory interventions for patients with ischemic stroke.
基金supported by National Key Research and Development Program (2019YFA0708301)National Natural Science Foundation of China (51974337)+2 种基金the Strategic Cooperation Projects of CNPC and CUPB (ZLZX2020-03)Science and Technology Innovation Fund of CNPC (2021DQ02-0403)Open Fund of Petroleum Exploration and Development Research Institute of CNPC (2022-KFKT-09)
文摘We propose an integrated method of data-driven and mechanism models for well logging formation evaluation,explicitly focusing on predicting reservoir parameters,such as porosity and water saturation.Accurately interpreting these parameters is crucial for effectively exploring and developing oil and gas.However,with the increasing complexity of geological conditions in this industry,there is a growing demand for improved accuracy in reservoir parameter prediction,leading to higher costs associated with manual interpretation.The conventional logging interpretation methods rely on empirical relationships between logging data and reservoir parameters,which suffer from low interpretation efficiency,intense subjectivity,and suitability for ideal conditions.The application of artificial intelligence in the interpretation of logging data provides a new solution to the problems existing in traditional methods.It is expected to improve the accuracy and efficiency of the interpretation.If large and high-quality datasets exist,data-driven models can reveal relationships of arbitrary complexity.Nevertheless,constructing sufficiently large logging datasets with reliable labels remains challenging,making it difficult to apply data-driven models effectively in logging data interpretation.Furthermore,data-driven models often act as“black boxes”without explaining their predictions or ensuring compliance with primary physical constraints.This paper proposes a machine learning method with strong physical constraints by integrating mechanism and data-driven models.Prior knowledge of logging data interpretation is embedded into machine learning regarding network structure,loss function,and optimization algorithm.We employ the Physically Informed Auto-Encoder(PIAE)to predict porosity and water saturation,which can be trained without labeled reservoir parameters using self-supervised learning techniques.This approach effectively achieves automated interpretation and facilitates generalization across diverse datasets.
文摘The emergence of Medical Large Language Models has significantly transformed healthcare.Medical Large Language Models(Med-LLMs)serve as transformative tools that enhance clinical practice through applications in decision support,documentation,and diagnostics.This evaluation examines the performance of leading Med-LLMs,including GPT-4Med,Med-PaLM,MEDITRON,PubMedGPT,and MedAlpaca,across diverse medical datasets.It provides graphical comparisons of their effectiveness in distinct healthcare domains.The study introduces a domain-specific categorization system that aligns these models with optimal applications in clinical decision-making,documentation,drug discovery,research,patient interaction,and public health.The paper addresses deployment challenges of Medical-LLMs,emphasizing trustworthiness and explainability as essential requirements for healthcare AI.It presents current evaluation techniques that improve model transparency in high-stakes medical contexts and analyzes regulatory frameworks using benchmarking datasets such asMedQA,MedMCQA,PubMedQA,and MIMIC.By identifying ongoing challenges in biasmitigation,reliability,and ethical compliance,thiswork serves as a resource for selecting appropriate Med-LLMs and outlines future directions in the field.This analysis offers a roadmap for developing Med-LLMs that balance technological innovation with the trust and transparency required for clinical integration,a perspective often overlooked in existing literature.
文摘An acute skin injury model using continuous tape tearing was established,and studies the application of this model in the evaluation of soothing effects through instrument evaluation.30 healthy adult subjects were selected as the research subjects,and an acute skin model was established on the forearm flexion side of the subjects.The skin color a^(*)value and transdermal water loss rate(TEWL)value of the blank and experimental groups were tested using instruments.The results showed that at 15 and 30 minutes after using the sample,the growth values of a^(*)value and TEWL value in the experimental group were significantly lower than those in the blank group(P<0.05),the acute skin lesion model can effectively evaluate the soothing effect of cosmetics.
基金funded by the National Social Science Fund of China(Grant No.23BGL234).
文摘In the context of advancing towards dual carbon goals,numerous factories are actively engaging in energy efficiency upgrades and transformations.To accurately pinpoint energy efficiency bottlenecks within factories and prioritize renovation sequences,it is crucial to conduct comprehensive evaluations of the energy performance across various workshops.Therefore,this paper proposes an evaluation model for workshop energy efficiency based on the drive-state-response(DSR)framework combined with the fuzzy BORDA method.Firstly,an in-depth analysis of the relationships between different energy efficiency indicators was conducted.Based on the DSR model,evaluation criteria were selected from three dimensions-drive factors,state characteristics,and response measures-to establish a robust energy efficiency indicator system.Secondly,three distinct assessment techniques were selected:Grey Relational Analysis(GRA),Entropy Weight Method(EWM),and Technique for Order Preference by Similarity to Ideal Solution(TOPSIS)forming a diversified set of evaluation methods.Subsequently,by introducing the fuzzy BORDA method,a comprehensive energy efficiency evaluation model was developed,aimed at quantitatively ranking the energy performance status of each workshop.Using a real-world factory as a case study,applying our proposed evaluationmodel yielded detailed scores and rankings for each workshop.Furthermore,post hoc testing was performed using the Spearman correlation coefficient,revealing a statistic value of 10.209,which validates the effectiveness and reliability of the proposed evaluation model.This model not only assists in identifying underperforming workshops within the factory but also provides solid data support and a decision-making basis for future energy efficiency optimization strategies.
基金funded by the German Research Foundation[Grant Number:496846758].
文摘Objectives:Valid estimation of energy expenditure remains a challenge,particularly when using ankle-and thighworn devices.The Move 4 is a research-grade accelerometer previously tested for predicting metabolic equivalents(METs)when worn at the waist or wrist.This study aimed to calibrate and evaluate regression models to estimate METs from Move 4 data when worn at the ankle and thigh.Methods:Participants completed walking and jogging tasks under laboratory conditions while wearing Move 4 sensors and with indirect calorimetry as a reference measure.Models were calibrated using study 1(n=160)and evaluated in an independent dataset(study 2;n=15).Performance was assessed using mean absolute error(MAE),root mean square error(RMSE),and Bland-Altman analyses.Results:The MET models demonstrated strong agreement across both locations and datasets.For the thigh position,the MAE ranged from 0.60 METs(walking)to 1.38 METs(jogging),with RMSE of 0.82 and 1.70 in the evaluation data.Calibration metrics were comparable(jogging:MAE=1.24,RMSE=1.63).The ankle models showed similar accuracy,with MAEs of 0.66(walking)and 1.39(jogging),and RMSEs of 0.85 and 1.67,respectively.Systematic bias remained low(mean differences between−0.34 and−0.01 METs).Conclusions:This study provides the first calibration and evaluation for estimating METs from ankle-and thigh-worn Move 4 accelerometers.The model indicated accurate,highresolution MET estimation for walking and jogging.Future work should expand independent performance evaluations,including diverse activities such as static activities,and diverse samples under free-living conditions.
文摘With the rapid development of generative artificial intelligence technologies,represented by large language models,university-level computer science education is undergoing a critical transition-from knowledge-based instruction to competency-oriented teaching.A postgraduate student competency evaluation model can serve as a framework to organize and guide both teaching and research activities at the postgraduate level.A number of relevant research efforts have already been conducted in this area.Graduate education plays a vital role not only as a continuation and enhancement of undergraduate education but also as essential preparation for future research endeavors.An analysis of the acceptance of competency evaluation models refers to the assessment of how various stakeholders perceive the importance of different components within the model.Investigating the degree of acceptance among diverse groups-such as current undergraduate students,current postgraduate students,graduates with less than three years of work experience,and those with more than three years of work experience-can offer valuable insights for improving and optimizing postgraduate education and training practices.
基金funded by the Joint Project of Industry-University-Research of Jiangsu Province(Grant:BY20231146).
文摘With the widespread application of Internet of Things(IoT)technology,the processing of massive realtime streaming data poses significant challenges to the computational and data-processing capabilities of systems.Although distributed streaming data processing frameworks such asApache Flink andApache Spark Streaming provide solutions,meeting stringent response time requirements while ensuring high throughput and resource utilization remains an urgent problem.To address this,the study proposes a formal modeling approach based on Performance Evaluation Process Algebra(PEPA),which abstracts the core components and interactions of cloud-based distributed streaming data processing systems.Additionally,a generic service flow generation algorithmis introduced,enabling the automatic extraction of service flows fromthe PEPAmodel and the computation of key performance metrics,including response time,throughput,and resource utilization.The novelty of this work lies in the integration of PEPA-based formal modeling with the service flow generation algorithm,bridging the gap between formal modeling and practical performance evaluation for IoT systems.Simulation experiments demonstrate that optimizing the execution efficiency of components can significantly improve system performance.For instance,increasing the task execution rate from 10 to 100 improves system performance by 9.53%,while further increasing it to 200 results in a 21.58%improvement.However,diminishing returns are observed when the execution rate reaches 500,with only a 0.42%gain.Similarly,increasing the number of TaskManagers from 10 to 20 improves response time by 18.49%,but the improvement slows to 6.06% when increasing from 20 to 50,highlighting the importance of co-optimizing component efficiency and resource management to achieve substantial performance gains.This study provides a systematic framework for analyzing and optimizing the performance of IoT systems for large-scale real-time streaming data processing.The proposed approach not only identifies performance bottlenecks but also offers insights into improving system efficiency under different configurations and workloads.
基金Supported by National High-quality Development Project of China(Grant No.2340STCZB193).
文摘The average stiffness performance indices throughout the workspace are commonly used as global stiffness performance indices to evaluate the overall stiffness performance of parallel mechanisms,which involves an analysis of the stiffness performance of numerous discrete points in the workspace.This necessitates time-consuming and inefficient calculation,which is particularly pronounced in the optimization design stage of the mechanism,where the variations in the global stiffness performance indices versus various dimensional and structural parameters need to be analyzed.This paper presents a semi-analytical approach for stiffness modeling of the novel(R(RPS&RP))&2-UPS parallel mechanism(referred to as the Trifree mechanism)and proposes“local”stiffness performance indices as alternatives to global indices.Drawing on the screw theory,the Cartesian stiffness matrix of the Trifree mechanism is formulated explicitly by considering the compliances of all elastic elements and the over-constraint characteristics inherent in the mechanism.Based on the spherical motion pattern of the Trifree mechanism,four special reference configurations are extracted within the workspace.This yields“local”stiffness performance indices capable of accurately evaluating the overall stiffness performance of the mechanism and effectively improving the computational efficiency.The variations in global and“local”stiffness performance indices versus key design parameters are investigated.Furthermore,the proposed indices are applied to the Tricept and Trimule mechanisms.The results demonstrate that the proposed indices exhibit excellent computational accuracy and efficiency in evaluating the overall stiffness performance of these spherical parallel mechanisms.Moreover,the stiffness performance of the novel parallel mechanism investigated in this study closely resembles that of the well-known Tricept and Trimule mechanisms.This research proposes a semi-analytic stiffness model of the Trifree mechanism and“local”stiffness performance indices to evaluate the overall stiffness performance,thereby substantially improving the computational efficiency without sacrificing accuracy.
文摘The rapid development of evolutionary deep learning has led to the emergence of various Neural Architecture Search(NAS)algorithms designed to optimize neural network structures.However,these algorithms often face significant computational costs due to the time-consuming process of training neural networks and evaluating their performance.Traditional NAS approaches,which rely on exhaustive evaluations and large training datasets,are inefficient for solving complex image classification tasks within limited time frames.To address these challenges,this paper proposes a novel NAS algorithm that integrates a hierarchical evaluation strategy based on Surrogate models,specifically using supernet to pre-trainweights and randomforests as performance predictors.This hierarchical framework combines rapid Surrogate model evaluations with traditional,precise evaluations to balance the trade-off between performance accuracy and computational efficiency.The algorithm significantly reduces the time required for model evaluation by predicting the fitness of candidate architectures using a random forest Surrogate model,thus alleviating the need for full training cycles for each architecture.The proposed method also incorporates evolutionary operations such as mutation and crossover to refine the search process and improve the accuracy of the resulting architectures.Experimental evaluations on the CIFAR-10 and CIFAR-100 datasets demonstrate that the proposed hierarchical evaluation strategy reduces the search time and costs compared to traditional methods,while achieving comparable or even superior model performance.The results suggest that this approach can efficiently handle resourceconstrained tasks,providing a promising solution for accelerating the NAS process without compromising the quality of the generated architectures.
文摘Offline policy evaluation,evaluating and selecting complex policies for decision-making by only using offline datasets is important in reinforcement learning.At present,the model-based offline policy evaluation(MBOPE)is widely welcomed because of its easy to implement and good performance.MBOPE directly approximates the unknown value of a given policy using the Monte Carlo method given the estimated transition and reward functions of the environment.Usually,multiple models are trained,and then one of them is selected to be used.However,a challenge remains in selecting an appropriate model from those trained for further use.The authors first analyse the upper bound of the difference between the approximated value and the unknown true value.Theoretical results show that this difference is related to the trajectories generated by the given policy on the learnt model and the prediction error of the transition and reward functions at these generated data points.Based on the theoretical results,a new criterion is proposed to tell which trained model is better suited for evaluating the given policy.At last,the effectiveness of the proposed criterion is demonstrated on both benchmark and synthetic offline datasets.
文摘With the continuous development of the nursing discipline,standardized nurse training has always been a crucial link in the development of nursing science and plays an irreplaceable role in talent cultivation.However,in the current standardized training for some nurses,there are problems such as the simplification of nursing skill evaluation models and insufficient post competence of nurses.Therefore,optimizing the training model for nursing talents has become an inevitable measure.The problem-based learning(PBL)method and the Direct Observation of Procedural Skills(DOPS)evaluation model provide new directions and guidance for the development of training.Against this background,this paper explores effective approaches for standardized nurse training,starting from basic concepts and gradually delving into specific practical paths,aiming to improve the quality of talent cultivation and provide valuable references for other researchers.
基金supported by the Youth Program of National Natural Science Foundation of China(No.52404343)the General Program of National Natural Science Foundation of China(No.52274326)+2 种基金the Fundamental Research Funds for the Central Universities,China(No.N2425031)the Seventh Batch of Ten Thousand Talents Plan of China(No.ZX20220553)the Liaoning Province Science and Technology Plan Joint Program,China(Key Research and Development Program Project)(No.2023JH2/101800058).
文摘The comprehensive status of blast furnaces was one of the most important factors affecting their economy,quality,and longev-ity.The blast furnace comprehensive status had the nature of“black box,”and it was“unpredictable.”In this study,a blast furnace com-prehensive status score and prediction method based on a cascade system and a combined model were proposed to address this issue.A dual cascade evaluation system was developed by integrating subjective and objective weighting methods.The analytic hierarchy process,coefficient of variation,entropy weight method,and impart combinatorial games were jointly employed to determine the optimal weight distribution across indicators.Categorized statuses(raw material,gas flow,furnace body,furnace cylinder,and iron-slag)were evaluated.Based on the five categories of the status data,the second cascade was applied to upgrade the quantitative evaluation of the comprehens-ive status.The weights of the different categories were 0.22,0.15,0.22,0.21,and 0.20,respectively.According to the data analysis,the results of the comprehensive status score closely matched the on-site production logs.Based on the blast furnace smelting period,the maximal information coefficient method was applied to the 100 parameters that were most relevant to the comprehensive status.A com-bined prediction model for a comprehensive status score was designed using bidirectional long short-term memory(BiLSTM)and categorical boosting(CatBoost).The test results indicated that the combined model reduced the mean absolute error by an average of 0.275 and increased the hit rate by an average of 5.65 percentage points compared to BiLSTM or CatBoost alone.When the er-ror range was±2.5,the combined model predicted a hit rate of 91.66%for the next hour’s comprehensive status score,and its high accur-acy was deemed satisfactory for the field.SHapley Additive exPlanations(SHAP)and regression fitting were applied to analyze the lin-ear quantitative relationship between the key variables and the comprehensive status score.When the furnace bottom center temperature was increased by 10℃,the comprehensive status score increased by 0.44.This method contributes to a more precise management and control of the comprehensive status of the blast furnace on-site.
基金supported in part by the Education Reform Key Projects of Heilongjiang Province(Grant No.SJGZ20220011,SJGZ20220012)the Excellent Project of Ministry of Education and China Higher Education Association on Digital Ideological and Political Education in Universities(Grant No.GXSZSZJPXM001)。
文摘This paper proposes a multivariate data fusion based quality evaluation model for software talent cultivation.The model constructs a comprehensive ability and quality evaluation index system for college students from a perspective of engineering course,especially of software engineering.As for evaluation method,relying on the behavioral data of students during their school years,we aim to construct the evaluation model as objective as possible,effectively weakening the negative impact of personal subjective assumptions on the evaluation results.
基金Liaoning Provincial Social Science Planning Fund“Research on the Educational Intelligent Evaluation System Based on the CIPP Model and Artificial Intelligence under the Background of New Engineering”(L22BTJ005)。
文摘This study explores the feasibility of constructing an intelligent educational evaluation system based on the CIPP model and artificial intelligence technology in the context of new engineering disciplines.By integrating the CIPP model with AI technology,a novel intelligent educational evaluation system was designed.Through experimental validation and case studies,the system demonstrated significant effectiveness in improving teaching quality,facilitating personalized student development,and optimizing educational resource allocation.Additionally,the study predicts potential changes this system could bring to the education industry and proposes relevant policy recommendations.Although the current research has limitations,with technological advancements in the future,this system is expected to provide stronger support for innovations in engineering education models.
基金Supported by the National Natural Science Foundation of China(42372175,72088101)PetroChina Science and Technology Project of(2023DJ84)Basic Research Cooperation Project between China National Petroleum Corporation and Peking University.
文摘Existing sandstone rock structure evaluation methods rely on visual inspection,with low efficiency,semi-quantitative analysis of roundness,and inability to perform classified statistics in particle size analysis.This study presents an intelligent evaluation method for sandstone rock structure based on the Segment Anything Model(SAM).By developing a lightweight SAM fine-tuning method with rank-decomposition matrix adapters,a multispectral rock particle segmentation model named CoreSAM is constructed,which achieves rock particle edge extraction and type identification.Building upon this,we propose a comprehensive quantitative evaluation system for rock structure,assessing parameters including particle size,sorting,roundness,particle contact and cementation types.The experimental results demonstrate that CoreSAM outperforms existing methods in rock particle segmentation accuracy while showing excellent generalization across different image types such as CT scans and core photographs.The proposed method enables full-sample,classified particle size analysis and quantitative characterization of parameters like roundness,advancing reservoir evaluation towards more precise,quantitative,intuitive,and comprehensive development.
基金supported in part by theThe Planning Subject Project of Guangdong Power Grid Co.,Ltd.(62273104).
文摘With the large-scale promotion of distributed photovoltaics,new challenges have emerged in the photovoltaic consumptionwithin distribution networks.Traditional photovoltaic consumption schemes have primarily focused on static analysis.However,as the scale of photovoltaic power generation devices grows and the methods of integration diversify,a single consumption scheme is no longer sufficient to meet the actual needs of current distribution networks.Therefore,this paper proposes an optimal evaluation method for photovoltaic consumption schemes based on BASS model predictions of installed capacity,aiming to provide an effective tool for generating and evaluating photovoltaic consumption schemes in distribution networks.First,the BASS diffusion model,combined with existing photovoltaic capacity data and roof area information,is used to predict the trends in photovoltaic installed capacity for each substation area,providing a scientific basis for consumption evaluation.Secondly,an improved random scenario simulation method is proposed for assessing the photovoltaic consumption capacity in distribution networks.This method generates photovoltaic integration schemes based on the diffusion probabilities of different regions and evaluates the consumption capacity of each scheme.Finally,the Technique for Order Preference by Similarity to an Ideal Solution(TOPSIS)is used to comprehensively evaluate the generated schemes,ensuring that the selected scheme not only meets the consumption requirements but also offers high economic benefits and reliability.The effectiveness and feasibility of the proposedmethod are validated through simulations of the IEEE 33-node system,providing strong support for optimizing photovoltaic consumption schemes in distribution networks.