Objective To evaluate the criterion validity of Chronic Heart Failure(CHF)-Quality of Life(QOL)Scale of Integrative Medicine(abbreviated as Scale).Methods Clinical data of 249 CHF in-patients were collected.
Assessing adolescent body image is crucial for mental health interventions,yet traditional methods suffer from limited dimensional coverage,poor dynamic tracking,and weak ecological validity.To address these gaps,this...Assessing adolescent body image is crucial for mental health interventions,yet traditional methods suffer from limited dimensional coverage,poor dynamic tracking,and weak ecological validity.To address these gaps,this study proposes a multidimensional evalu-ation using large language models(LLMs)and compares its criterion validity against a dictionary-based method and expert ratings.We defined four dimensions-perception,positive attitude,negative attitude,behavior-by reviewing the body-image literature and built a validated dictionary through expert ratings and iterative refinement.A four-step prompt-engineering process,incorporating role-playing and other optimization techniques,produced tailored prompts for LLM-based recognition.To validate these tools,we collected self-reported texts and scale scores from 194 university students,performed semantic analyses with Llama-3.1-70B,Qwen-Max,and DeepSeek-R1 using these prompts,and confirmed ecological validity on social media posts.Results indicate that our mul-tidimensional dictionary correlated significantly with expert ratings across all four dimensions(r=0.515-0.625),providing a solid benchmark.LLM-based assessments then outperformed both the dictionary and human ratings,with zero-shot LLMs achieving r=0.664 in positive attitude(vs.expert r=0.657)and DeepSeek-R1 reaching r=0.722 in perception.Role-playing techniques sig-nificantly improved the validity in the perception dimension(Δr=+0.117).Consistency checks revealed that the DeepSeek model reduced error dispersion in extreme score ranges by 48.4%compared to human ratings,with the 95%consistency limits covering the fluctuations of human scores.Incremental validity analysis showed that LLMs could replace human evaluations in the perception dimension(ΔR2=0.220).In ecological validity checks,the Qwen model achieved a correlation of 0.651 in the social media behavior dimension-53.1%higher than the dictionary method.We found that LLMs demonstrated significant advantages in the multidimensional assessment of body image,offering a new intelligent approach to mental health measurement.展开更多
While the true value of environmental goods may be captured in a one-off payment, it may be easier to add a smaler amount to a private good by means of donation and colect the total environmental value over time. For ...While the true value of environmental goods may be captured in a one-off payment, it may be easier to add a smaler amount to a private good by means of donation and colect the total environmental value over time. For that, however we need to ensure the smaller amount of a heritage conservation donation added to a private good is adequate so that we can ifnd retailers to participate in such fund-raising activities. We test the contingent valuation method’s criterion validity by comparing their stated purchasing behavior with their actual behavior. The price increase from the addition of the donation did not affect total sales of the commodity. Adding a donation to specialized private goods may be an effective way to colect landscape and agricultural heritage conservation do-nations. Furthermore, our ifndings suggest that funds can be colected without affecting commodity sales. This ap-proach is effective in other environmental protection activities.展开更多
文摘Objective To evaluate the criterion validity of Chronic Heart Failure(CHF)-Quality of Life(QOL)Scale of Integrative Medicine(abbreviated as Scale).Methods Clinical data of 249 CHF in-patients were collected.
基金supported by Beijing Natural Science Foundation,IS23088。
文摘Assessing adolescent body image is crucial for mental health interventions,yet traditional methods suffer from limited dimensional coverage,poor dynamic tracking,and weak ecological validity.To address these gaps,this study proposes a multidimensional evalu-ation using large language models(LLMs)and compares its criterion validity against a dictionary-based method and expert ratings.We defined four dimensions-perception,positive attitude,negative attitude,behavior-by reviewing the body-image literature and built a validated dictionary through expert ratings and iterative refinement.A four-step prompt-engineering process,incorporating role-playing and other optimization techniques,produced tailored prompts for LLM-based recognition.To validate these tools,we collected self-reported texts and scale scores from 194 university students,performed semantic analyses with Llama-3.1-70B,Qwen-Max,and DeepSeek-R1 using these prompts,and confirmed ecological validity on social media posts.Results indicate that our mul-tidimensional dictionary correlated significantly with expert ratings across all four dimensions(r=0.515-0.625),providing a solid benchmark.LLM-based assessments then outperformed both the dictionary and human ratings,with zero-shot LLMs achieving r=0.664 in positive attitude(vs.expert r=0.657)and DeepSeek-R1 reaching r=0.722 in perception.Role-playing techniques sig-nificantly improved the validity in the perception dimension(Δr=+0.117).Consistency checks revealed that the DeepSeek model reduced error dispersion in extreme score ranges by 48.4%compared to human ratings,with the 95%consistency limits covering the fluctuations of human scores.Incremental validity analysis showed that LLMs could replace human evaluations in the perception dimension(ΔR2=0.220).In ecological validity checks,the Qwen model achieved a correlation of 0.651 in the social media behavior dimension-53.1%higher than the dictionary method.We found that LLMs demonstrated significant advantages in the multidimensional assessment of body image,offering a new intelligent approach to mental health measurement.
基金the Association of River Improvement Fund and the JSPS grant B, for young researchers 2010-2013, both of whom provided financial support for this work
文摘While the true value of environmental goods may be captured in a one-off payment, it may be easier to add a smaler amount to a private good by means of donation and colect the total environmental value over time. For that, however we need to ensure the smaller amount of a heritage conservation donation added to a private good is adequate so that we can ifnd retailers to participate in such fund-raising activities. We test the contingent valuation method’s criterion validity by comparing their stated purchasing behavior with their actual behavior. The price increase from the addition of the donation did not affect total sales of the commodity. Adding a donation to specialized private goods may be an effective way to colect landscape and agricultural heritage conservation do-nations. Furthermore, our ifndings suggest that funds can be colected without affecting commodity sales. This ap-proach is effective in other environmental protection activities.