This study examines the reliability and validity of AI-generated scoring for continuation writing tasks.By comparing GPT-4 with eight experienced human raters across 21 student responses,it evaluates AI’s consistency...This study examines the reliability and validity of AI-generated scoring for continuation writing tasks.By comparing GPT-4 with eight experienced human raters across 21 student responses,it evaluates AI’s consistency,severity,and alignment with human scoring criteria.Results show that AI exhibits high self-consistency and adapts effectively to different scoring roles(e.g.,teacher vs.highstakes rater).However,AI scores were more lenient than human raters and demonstrated divergent evaluation focuses—prioritizing narrative coherence and emotional depth,while teachers emphasized linguistic accuracy and richness of detail.The findings suggest AI’s potential as a supplementary assessment tool,offering rapid,holistic feedback,but highlight the need for further calibration to align with educational standards.Implications include exploring hybrid evaluation models that leverage the strengths of both AI and human raters to achieve more equitable,efficient,and pedagogically meaningful writing assessments.展开更多
This study investigates whether and how exemplars can facilitate student engagement in self-assessment tasks.An intact class of 32 undergraduates majoring in Chinese-English translation participated in the study.After...This study investigates whether and how exemplars can facilitate student engagement in self-assessment tasks.An intact class of 32 undergraduates majoring in Chinese-English translation participated in the study.After some preliminary training,the students performed three translation self-assessment tasks,each involving comparison with authentic exemplars of varying quality and filling out a structured self-assessment report.Our analysis of multiple sources of data reveals the multi-dimensional nature of student engagement in self-assessment activities and the potential for using authentic exemplars of different qualities to enhance students’cognitive and behavioral engagement and mitigate negative emotions in the process.Pedagogical implications for implementing exemplars and self-assessment to promote student engagement and support student learning are discussed.展开更多
BACKGROUND During the gradual decline of physical and social functioning associated with end-stage renal disease,patients might experience a premonition of impending death,resulting in a series of pre-mourning grief r...BACKGROUND During the gradual decline of physical and social functioning associated with end-stage renal disease,patients might experience a premonition of impending death,resulting in a series of pre-mourning grief responses called preparatory grief.The preparatory grief in advanced cancer patients(PGAC)scale is the most widely used preparatory grief scale for patients on hemodialysis in China.AIM To verify the reliability and validity of the PGAC scale in patients on hemodialysis.METHODS In total,327 patients undergoing regular hemodialysis in the blood purification center of three grade-A tertiary hospitals in Guangdong and Guizhou provinces were selected by convenience sampling.The assessment was administered using the general information questionnaire and the Chinese version of PGAC.SPSS 25.0 and Amos 24.0 were used for item analysis,confirmatory factor analysis(CFA),convergent validity,and internal consistency reliability estimation.RESULTS In the modified Chinese version of PGAC,7 dimensions covering 27 total items were retained.CFA revealed a good fit of the factor model(chi-square degree of freedom=2.056,standardized root mean square residual=0.0479,root mean square error of approximation=0.0570,GFI=0.872,AGFI=0.841,IFI=0.931,CFI=0.930,TLI=0.919).The factor loadings of the items ranged 0.503-0.884.The composite reliability ranged 0.664-0.914,and the average variance extracted ranged 0.366-0.747.Cronbach’sαof the scale was 0.945,and Cronbach’sαfor various dimensions ranged 0.662-0.914.CONCLUSION The modified PGAC has good reliability and validity,and it can effectively measure preparatory grief in patients on hemodialysis.展开更多
This study aimed to determine the reliability,validity and measurement invariance of scores from the Difficulties in Emotion Regulation Scale-8 in Chinese context.A total of 1114 Chinese adolescents were participants ...This study aimed to determine the reliability,validity and measurement invariance of scores from the Difficulties in Emotion Regulation Scale-8 in Chinese context.A total of 1114 Chinese adolescents were participants in three phases:N=424 for the initial DERS-8 measure completion;N=586 the DERS-8,General Anxiety Disorder Scale,Depression Scale and Emotion Regulation Scale completion,with an interval of one month.Then an additional 104 adolescents also completed DERS-8,General Anxiety Disorder Scale,Depression Scale and Emotion Regulation Scale.Both exploratory and confirmatory factor analyses confirmed the one-factor model of the scale,and the fitness indicators wereχ^(2)/df=4.05,RMSEA=0.07,CFI=0.98,and TLI=0.97.Each item of the DERS-8 had good discrimination.The internal consistency reliability coefficient,split-half reliability coefficient and test-retest reliability coefficient of the scale scores were 0.90,0.87 and 0.66,respectively.The findings suggest the Chinese version of the DERS-8 is a reliable measure of difficulty of emotion regulation in Chinese adolescents.展开更多
Objective:This study aimed to translate the de Morton Mobility Index(DEMMI)into Thai and assess its measurement properties.Methods:The de Morton Mobility Index(DEMMI)was translated into Thai using a cross-cultural tra...Objective:This study aimed to translate the de Morton Mobility Index(DEMMI)into Thai and assess its measurement properties.Methods:The de Morton Mobility Index(DEMMI)was translated into Thai using a cross-cultural translation method.A cross-sectional study was conducted in four public hospitals in Thailand between January and March 2023.A total of 260 patients were recruited from outpatient clinics.Convergent and known-group validity were evaluated through hypothesis testing.Construct validity was examined using confirmatory factor analysis.Reliability was assessed using Cronbach’s a coefficient.We also employed the Rasch analysis to validate validity and person reliability.Results:Content validity was high(S-CVI=0.96,I-CVI range:0.80e1.00).Strong convergent validity was observed,with a significant correlation(r=0.761,P<0.001)between the Thai DEMMI and the Parker Mobility Scale(PMS).Known-group validity was evident,demonstrating differences in scores across various patient groups.A confirmatory factor analysis supported the hypothesized factor structure of the Thai DEMMI with good fit indices:χ^(2)(df=4)=5.101,P=0.2771;χ^(2)/df=1.275,RMSEA=0.033;CFI=0.998;TLI=0.995;SRMR=0.016.The Thai DEMMI exhibited high internal consistency(Cronbach’s a=0.88).Rasch analysis revealed good person reliability(0.91)and acceptable information-weighted fit means square statistic(0.73-1.06).However,most items showed good fit based on the outlier-sensitive fit means square statistics(Outfit MNSQ),one exhibited a high Outfit MNSQ value of 29.94,suggesting a potential misfit.Conclusion:This study demonstrated the acceptable validity and reliability of the Thai DEMMI.Further evaluation of its responsiveness to change is still recommended.展开更多
Objective:This study aims to develop an assessment tool for postoperative wound healing in adult patients with benign anal canal and rectal diseases and to validate its reliability and validity.Methods:Based on Levine...Objective:This study aims to develop an assessment tool for postoperative wound healing in adult patients with benign anal canal and rectal diseases and to validate its reliability and validity.Methods:Based on Levine’s Conservation Model as the theoretical framework,an item pool was formed through literature review,and the initial draft of the scale was refined through two rounds of Delphi expert consultation.A total of 200 postoperative patients were selected for item analysis,internal consistency testing,content validity,and structural validity analysis.Results:The final tool comprises four dimensions:energy conservation,structural integrity,personal integrity,and social integrity,with a total of 24 items.It demonstrates good content validity(I-CVI 0.82-1.00,S-CVI/Ave 0.95,S-CVI/UA 0.87)and excellent internal consistency(Cronbach’sαfor the overall scale was 0.934).Exploratory factor analysis revealed a KMO value of 0.931,Bartlett’s test of sphericityχ^(2)=4147.853(p<0.001),and four common factors were extracted,accounting for a cumulative variance contribution rate of 64.345%,indicating ideal structural validity.Conclusion:The results indicate that the assessment tool has good reliability and validity and can systematically evaluate postoperative wound healing,providing a scientific basis for clinical individualized nursing interventions.展开更多
The study aims to determine the validity and reliability of the Wechsler Preschool and Primary Scale of Intelligence–Third Edition(WPPSI-III)scores in a sample of kindergarten and lower primary pupils from Khartoum S...The study aims to determine the validity and reliability of the Wechsler Preschool and Primary Scale of Intelligence–Third Edition(WPPSI-III)scores in a sample of kindergarten and lower primary pupils from Khartoum State,Sudan.It also aims to examine whether test’s factor structure in this sample replicated that of the original WPPSI-III.The study sample consisted of 384 kindergarten and primary school children in Khartoum State(females=50%mean age=4.14,SD=1.37),selected using stratified random sampling across its seven localities:Khartoum,Jebel Awliya,Khartoum Bahri,East Nile,Omdurman,Ombada,Karari.For concurrent validation,the children additionally completed the Goodenough Draw-a-Man Test,and the Colored Progressive Matrices.WPPSI-III scores demonstrated high internal consistency across the subtest items.Confirmatory factor analysis indicators for total,verbal,and performance intelligence were all excellent.The scale also showed weak to strong score stability ranging from 0.25(weak)to 0.88(strong)based on the Spearman-Brown equation,0.25 to 0.75 based on the Guttman split-half method.The Cronbach’s alpha coefficient scores ranged from 0.54 to 0.93.The WPPSI-III and Goodenough Draw-a-Man Test scores concurrent validity scores were poor(0.05)to modest(0.31),and while those with the Colored Progressive Matrices test were poor(r=0.04–0.18).Thesefindings provide evidence to suggest that the WPPSI-III is appropriate for research use with kindergarten and lower primary school students in Khartoum State,Sudan.展开更多
The purpose of this study is to investigate the effectiveness of the“expiration manager”mini program in managing the validity of ward items.The program was used to manage frequently and infrequently used consumables...The purpose of this study is to investigate the effectiveness of the“expiration manager”mini program in managing the validity of ward items.The program was used to manage frequently and infrequently used consumables by setting up an automatic reminder function.The item failure rate and the time required for nurses to conduct counts over 6 months before and after implementation were compared,as well as evaluated system availability using the System Usability scale(SUS).Results showed that after implementing the mini program,both the item failure rate and non-recognition rate significantly decreased(P<0.05),while the inspection pass rate significantly increased(P<0.05),and the monthly inventory time was reduced(P<0.05).The SUS evaluation yielded a total score of 74.38±11.73,with learnability at 80.21±20.27 and availability at 72.92±11.18,all indicating good user acceptance.In conclusion,the“expiration manager”mini program can effectively improve the efficiency of item expiration management,reduce the risk of expiration,and save inspection time,thereby demonstrating high user acceptance and promising potential for wider adoption.展开更多
Background:Investigators from low-,middle-,and high-income countries representing 6 continents contributed to the development of the Global Adolescent and Child Physical Activity Questionnaire(GAC-PAQ).The GAC-PAQ is ...Background:Investigators from low-,middle-,and high-income countries representing 6 continents contributed to the development of the Global Adolescent and Child Physical Activity Questionnaire(GAC-PAQ).The GAC-PAQ is designed to assess physical activity(PA)across all key domains(i.e.,school,chores,work/volunteering,transport,free time,outdoor time).It aimed to address multiple gaps in global PA surveillance(e.g.,omission of important PA domains,insufficient cultural adaptation,underrepresentation of rural areas in questionnaire validation studies).The purpose of this study was to assess the content validity of the GAC-PAQ among PA experts,8-to 17-year-olds,and one of their parents/guardians,and to discuss changes made to the questionnaire based on participants'feedback.Methods:Sixty-two experts in PA measurement and/or surveillance from 24 countries completed an online survey that included both closed-and open-ended questions about the content validity of the GAC-PAQ.The proportion of experts who agreed or strongly agreed with the items was calculated.Child-parent/guardian dyads from 15 countries(n=250;10-40 per country)participated in a structured cognitive interview to assess the clarity of the questions and response options,and they were encouraged to provide suggestions to improve clarity and facilitate completion of the questionnaire.Participating countries are:Aotearoa New Zealand,Brazil,Canada,China,Colombia,Czech Republic,India,Malawi,Mexico,Nepal,Nigeria,Spain,Sweden,Thailand,and the United Arab Emirates.Interviews were conducted in 13 different languages and structured by PA domain.Generic images were included to help participants in answering questions about PA intensity.Results:Expert agreement with the items for each domain exceeded 75%,and their qualitative feedback was used to revise the questionnaire before cognitive interviews.In general,participants found the questionnaire to be comprehensive.Adolescents(12-17 years)found it easier than children(8-11 years)to answer the questions.Several children struggled to answer questions about the duration and intensity of activities and/or concepts related to travel modes,active trips,and organized activities.Many parents/guardians were unsure about the frequency,duration,and intensity of their children's or adolescents'PA at school and/or recommended using more culturally relevant and appropriate images.Some participants misunderstood the concept of activities that“make you stronger”(intended to assess resistance activities)and/or struggled to differentiate between work,volunteering,and chores.Conclusion:Participants'feedback was used to develop a revised,simplified,and culturally adapted GAC-PAQ,which will be pilot-tested in all15 countries in an App that will include country-specific images and narration in local languages.Further research is needed to assess the reliability and validity of the revised GAC-PAQ.展开更多
This paper dwells on the two important factors-reliability and validity in language testing: explain the concepts of reliability and validity; factors influencing reliability and validity; possible ways to achieve hig...This paper dwells on the two important factors-reliability and validity in language testing: explain the concepts of reliability and validity; factors influencing reliability and validity; possible ways to achieve high reliability and validity; comments on the modern language testing tendency on reliability and validity and authors' own ideas.展开更多
To make oral test accurately reflect the actual English spoken ability of candidates and play its role in guiding and promoting the improvement of English learners in the teaching, we must ensure that the design of sc...To make oral test accurately reflect the actual English spoken ability of candidates and play its role in guiding and promoting the improvement of English learners in the teaching, we must ensure that the design of scientific questions, the feasibility and validity of judgments to make an accurate and fair measurement of testers' language ability.展开更多
This paper examines reading comprehensions in 2005 MET. Statistic analysis shows that the MET in 2005 have certain validity,but there are some problems existing in these tests. The paper lists the problems and suggest...This paper examines reading comprehensions in 2005 MET. Statistic analysis shows that the MET in 2005 have certain validity,but there are some problems existing in these tests. The paper lists the problems and suggests some methods. The writer hopes that test-designers can pay much attention to them so as to improve the tests quality.展开更多
Intercultural Communication Competence (ICC), as one of the research fields of intercultural communication, has been given much importance from scholars all around the world. Intercultural sensitivity is one of the th...Intercultural Communication Competence (ICC), as one of the research fields of intercultural communication, has been given much importance from scholars all around the world. Intercultural sensitivity is one of the three dimensions in Dr Chen's ICC model. This research investigates the reliability and validity of Chen and Starosta's Intercultural Sensitivity Scale (ISS) (2000) against Chinese cultural background by using Chinese university students majoring in English as respondents.展开更多
Speaking is the main purpose and most important skill for second language learning.This paper reviews the important validity considerations in designing a test,such as face validity,content validity,on the basis of wh...Speaking is the main purpose and most important skill for second language learning.This paper reviews the important validity considerations in designing a test,such as face validity,content validity,on the basis of which the author analyzes the oral test paper for the postgraduate entrance interview in Shaanxi University of Technology,thus putting forward some methods to improve the test paper.展开更多
Language testing is an important link in language teaching,in this paper,the two important criteria of language test the reliability and validity has carried on the detailed elaboration,in order to a language teacher ...Language testing is an important link in language teaching,in this paper,the two important criteria of language test the reliability and validity has carried on the detailed elaboration,in order to a language teacher proposition and evaluation test more scientific.展开更多
Test plays an important role in our lives in that it can cause backwash towards our teaching and learning.Thus the pur pose of the present study is to examine the quality of an English test paper in a junior middle sc...Test plays an important role in our lives in that it can cause backwash towards our teaching and learning.Thus the pur pose of the present study is to examine the quality of an English test paper in a junior middle school of Urumqi according to the language test theories.Through the data collection and analyses,the results indicate that the test paper is a medium-level test and the reliability is suitable.From the correlation analysis,we can see that each part has a high correlation value to the total,indicat ing that all of them contribute to generate language proficiency.展开更多
In order to assess college students'overall language proficiency,a new format,banked cloze,is included in the revised CET-4.Based on the previous studies on banked cloze in China,this paper discusses on how to con...In order to assess college students'overall language proficiency,a new format,banked cloze,is included in the revised CET-4.Based on the previous studies on banked cloze in China,this paper discusses on how to construct highly valid CET-4 banked cloze in line with the revised CET Syllabus(2006).展开更多
Content validity is an important part of language testing.In this paper,the content validity of the CET-4 fast reading test is analyzed in terms of expected response and text input.The result of final research shows t...Content validity is an important part of language testing.In this paper,the content validity of the CET-4 fast reading test is analyzed in terms of expected response and text input.The result of final research shows that the content validity of the fast reading test is high with some limitations proposed.展开更多
When talked about the language testing, we need to focus on the reliability and validity,the paper explains the importance of reliability and validity and why should we focus on them when we make language testing pape...When talked about the language testing, we need to focus on the reliability and validity,the paper explains the importance of reliability and validity and why should we focus on them when we make language testing paper. Also list various factors that affect reliability and validity.展开更多
Objective The primary objective of this study was to examine the validity and reliability of a semi-quantitative food frequency questionnaire(FFQ) among Chinese children aged 12-17 years. Methods A semi-quantitative 7...Objective The primary objective of this study was to examine the validity and reliability of a semi-quantitative food frequency questionnaire(FFQ) among Chinese children aged 12-17 years. Methods A semi-quantitative 72-food item FFQ was developed for children aged 12-17 years. The reliability and validity of this FFQ were evaluated against 24-h dietary recalls(24 h DRs) to measure the consumption of foods and nutrients. We administered two FFQs and three DRs to children(N = 160) over a period of 1 month to evaluate the reliability and validity. Reliability was examined by quartile agreement and intraclass correlation coefficients(ICCs), and validity was examined by quartile agreement, Bland-Altman plots and correlation with DRs. Results For reliability, the ICCs between the two FFQs ranged from 0.21 to 0.76 for foods and nutrients, and the quartile agreement ranged from 70.0% to 95.0% in the same or adjacent quartiles. Spearman’s correlation coefficients of foods and nutrients between the second FFQ and the 24 h DRs ranged from-0.04 to 0.59. The Bland-Altman plots demonstrated good agreement across the range of intakes among nutrients. The quartile agreement ranged from 50.0% to 100.0%, with infrequent misclassification. Conclusion The FFQ assessment of dietary intakes demonstrated acceptable relative validity and high reproducibility for Chinese children aged 12-17 years.展开更多
文摘This study examines the reliability and validity of AI-generated scoring for continuation writing tasks.By comparing GPT-4 with eight experienced human raters across 21 student responses,it evaluates AI’s consistency,severity,and alignment with human scoring criteria.Results show that AI exhibits high self-consistency and adapts effectively to different scoring roles(e.g.,teacher vs.highstakes rater).However,AI scores were more lenient than human raters and demonstrated divergent evaluation focuses—prioritizing narrative coherence and emotional depth,while teachers emphasized linguistic accuracy and richness of detail.The findings suggest AI’s potential as a supplementary assessment tool,offering rapid,holistic feedback,but highlight the need for further calibration to align with educational standards.Implications include exploring hybrid evaluation models that leverage the strengths of both AI and human raters to achieve more equitable,efficient,and pedagogically meaningful writing assessments.
文摘This study investigates whether and how exemplars can facilitate student engagement in self-assessment tasks.An intact class of 32 undergraduates majoring in Chinese-English translation participated in the study.After some preliminary training,the students performed three translation self-assessment tasks,each involving comparison with authentic exemplars of varying quality and filling out a structured self-assessment report.Our analysis of multiple sources of data reveals the multi-dimensional nature of student engagement in self-assessment activities and the potential for using authentic exemplars of different qualities to enhance students’cognitive and behavioral engagement and mitigate negative emotions in the process.Pedagogical implications for implementing exemplars and self-assessment to promote student engagement and support student learning are discussed.
文摘BACKGROUND During the gradual decline of physical and social functioning associated with end-stage renal disease,patients might experience a premonition of impending death,resulting in a series of pre-mourning grief responses called preparatory grief.The preparatory grief in advanced cancer patients(PGAC)scale is the most widely used preparatory grief scale for patients on hemodialysis in China.AIM To verify the reliability and validity of the PGAC scale in patients on hemodialysis.METHODS In total,327 patients undergoing regular hemodialysis in the blood purification center of three grade-A tertiary hospitals in Guangdong and Guizhou provinces were selected by convenience sampling.The assessment was administered using the general information questionnaire and the Chinese version of PGAC.SPSS 25.0 and Amos 24.0 were used for item analysis,confirmatory factor analysis(CFA),convergent validity,and internal consistency reliability estimation.RESULTS In the modified Chinese version of PGAC,7 dimensions covering 27 total items were retained.CFA revealed a good fit of the factor model(chi-square degree of freedom=2.056,standardized root mean square residual=0.0479,root mean square error of approximation=0.0570,GFI=0.872,AGFI=0.841,IFI=0.931,CFI=0.930,TLI=0.919).The factor loadings of the items ranged 0.503-0.884.The composite reliability ranged 0.664-0.914,and the average variance extracted ranged 0.366-0.747.Cronbach’sαof the scale was 0.945,and Cronbach’sαfor various dimensions ranged 0.662-0.914.CONCLUSION The modified PGAC has good reliability and validity,and it can effectively measure preparatory grief in patients on hemodialysis.
基金funded by Science Research Project of Hebei Education Department(BJ2025238)Humanities and Social Science Research Project of Hebei Normal University(S24YX002)Humanities and Social Science Research Foundation of Hebei Normal University(S22B019).
文摘This study aimed to determine the reliability,validity and measurement invariance of scores from the Difficulties in Emotion Regulation Scale-8 in Chinese context.A total of 1114 Chinese adolescents were participants in three phases:N=424 for the initial DERS-8 measure completion;N=586 the DERS-8,General Anxiety Disorder Scale,Depression Scale and Emotion Regulation Scale completion,with an interval of one month.Then an additional 104 adolescents also completed DERS-8,General Anxiety Disorder Scale,Depression Scale and Emotion Regulation Scale.Both exploratory and confirmatory factor analyses confirmed the one-factor model of the scale,and the fitness indicators wereχ^(2)/df=4.05,RMSEA=0.07,CFI=0.98,and TLI=0.97.Each item of the DERS-8 had good discrimination.The internal consistency reliability coefficient,split-half reliability coefficient and test-retest reliability coefficient of the scale scores were 0.90,0.87 and 0.66,respectively.The findings suggest the Chinese version of the DERS-8 is a reliable measure of difficulty of emotion regulation in Chinese adolescents.
文摘Objective:This study aimed to translate the de Morton Mobility Index(DEMMI)into Thai and assess its measurement properties.Methods:The de Morton Mobility Index(DEMMI)was translated into Thai using a cross-cultural translation method.A cross-sectional study was conducted in four public hospitals in Thailand between January and March 2023.A total of 260 patients were recruited from outpatient clinics.Convergent and known-group validity were evaluated through hypothesis testing.Construct validity was examined using confirmatory factor analysis.Reliability was assessed using Cronbach’s a coefficient.We also employed the Rasch analysis to validate validity and person reliability.Results:Content validity was high(S-CVI=0.96,I-CVI range:0.80e1.00).Strong convergent validity was observed,with a significant correlation(r=0.761,P<0.001)between the Thai DEMMI and the Parker Mobility Scale(PMS).Known-group validity was evident,demonstrating differences in scores across various patient groups.A confirmatory factor analysis supported the hypothesized factor structure of the Thai DEMMI with good fit indices:χ^(2)(df=4)=5.101,P=0.2771;χ^(2)/df=1.275,RMSEA=0.033;CFI=0.998;TLI=0.995;SRMR=0.016.The Thai DEMMI exhibited high internal consistency(Cronbach’s a=0.88).Rasch analysis revealed good person reliability(0.91)and acceptable information-weighted fit means square statistic(0.73-1.06).However,most items showed good fit based on the outlier-sensitive fit means square statistics(Outfit MNSQ),one exhibited a high Outfit MNSQ value of 29.94,suggesting a potential misfit.Conclusion:This study demonstrated the acceptable validity and reliability of the Thai DEMMI.Further evaluation of its responsiveness to change is still recommended.
基金Sichuan Provincial Nursing Research Project of the Sichuan Nursing Association in 2023(Project No.:H23028)。
文摘Objective:This study aims to develop an assessment tool for postoperative wound healing in adult patients with benign anal canal and rectal diseases and to validate its reliability and validity.Methods:Based on Levine’s Conservation Model as the theoretical framework,an item pool was formed through literature review,and the initial draft of the scale was refined through two rounds of Delphi expert consultation.A total of 200 postoperative patients were selected for item analysis,internal consistency testing,content validity,and structural validity analysis.Results:The final tool comprises four dimensions:energy conservation,structural integrity,personal integrity,and social integrity,with a total of 24 items.It demonstrates good content validity(I-CVI 0.82-1.00,S-CVI/Ave 0.95,S-CVI/UA 0.87)and excellent internal consistency(Cronbach’sαfor the overall scale was 0.934).Exploratory factor analysis revealed a KMO value of 0.931,Bartlett’s test of sphericityχ^(2)=4147.853(p<0.001),and four common factors were extracted,accounting for a cumulative variance contribution rate of 64.345%,indicating ideal structural validity.Conclusion:The results indicate that the assessment tool has good reliability and validity and can systematically evaluate postoperative wound healing,providing a scientific basis for clinical individualized nursing interventions.
基金The authors extend their appreciation to the Ongoing Research Funding Program,number(ORF2025R705),King Saud University,Riyadh,Saudi Arabia,for funding this work.
文摘The study aims to determine the validity and reliability of the Wechsler Preschool and Primary Scale of Intelligence–Third Edition(WPPSI-III)scores in a sample of kindergarten and lower primary pupils from Khartoum State,Sudan.It also aims to examine whether test’s factor structure in this sample replicated that of the original WPPSI-III.The study sample consisted of 384 kindergarten and primary school children in Khartoum State(females=50%mean age=4.14,SD=1.37),selected using stratified random sampling across its seven localities:Khartoum,Jebel Awliya,Khartoum Bahri,East Nile,Omdurman,Ombada,Karari.For concurrent validation,the children additionally completed the Goodenough Draw-a-Man Test,and the Colored Progressive Matrices.WPPSI-III scores demonstrated high internal consistency across the subtest items.Confirmatory factor analysis indicators for total,verbal,and performance intelligence were all excellent.The scale also showed weak to strong score stability ranging from 0.25(weak)to 0.88(strong)based on the Spearman-Brown equation,0.25 to 0.75 based on the Guttman split-half method.The Cronbach’s alpha coefficient scores ranged from 0.54 to 0.93.The WPPSI-III and Goodenough Draw-a-Man Test scores concurrent validity scores were poor(0.05)to modest(0.31),and while those with the Colored Progressive Matrices test were poor(r=0.04–0.18).Thesefindings provide evidence to suggest that the WPPSI-III is appropriate for research use with kindergarten and lower primary school students in Khartoum State,Sudan.
基金The First Affiliated Hospital of Shaoyang University,China(Project No.:23FY1015)。
文摘The purpose of this study is to investigate the effectiveness of the“expiration manager”mini program in managing the validity of ward items.The program was used to manage frequently and infrequently used consumables by setting up an automatic reminder function.The item failure rate and the time required for nurses to conduct counts over 6 months before and after implementation were compared,as well as evaluated system availability using the System Usability scale(SUS).Results showed that after implementing the mini program,both the item failure rate and non-recognition rate significantly decreased(P<0.05),while the inspection pass rate significantly increased(P<0.05),and the monthly inventory time was reduced(P<0.05).The SUS evaluation yielded a total score of 74.38±11.73,with learnability at 80.21±20.27 and availability at 72.92±11.18,all indicating good user acceptance.In conclusion,the“expiration manager”mini program can effectively improve the efficiency of item expiration management,reduce the risk of expiration,and save inspection time,thereby demonstrating high user acceptance and promising potential for wider adoption.
基金supported by a Project Grant(Grant No.PJT183705)an Early Career Investigator Prize(Grant No.ECP 184184)from the Canadian Institutes of Health Research+7 种基金a Prentice Institute Research Affiliate Fund Grant from the Prentice Institute for Global Population and Economy(Grant No.G00004116)a Te Herenga Waka Victoria University of Wellington Division of Science Health Engineering Architecture and Design Innovation Faculty Strategic Research Grant(Grant No.FSRG-SHEADI-10724)The Thailand Physical Activity Knowledge Development Centre(TPAK)/Thai Health Promotion Foundation provided funding for the cognitive interviews and pilot study in Thailand(Grant No.66-P1-0473)The University Pablo de Olavide provided a scholarship for 2 undergraduate students working on the project(codes PPI2207 and PPI2308)In the Czech Republicthe study was supported by Palacky University IGA(Grant No.IGA_FTK_2023_017)supported by the Division of Intramural Research at the National Institute on Minority Health and Health Disparities of the National Institutes of Healthsupported by the Key Project of the National Philosophy and Social Science Foundation of China(23&ZD197)。
文摘Background:Investigators from low-,middle-,and high-income countries representing 6 continents contributed to the development of the Global Adolescent and Child Physical Activity Questionnaire(GAC-PAQ).The GAC-PAQ is designed to assess physical activity(PA)across all key domains(i.e.,school,chores,work/volunteering,transport,free time,outdoor time).It aimed to address multiple gaps in global PA surveillance(e.g.,omission of important PA domains,insufficient cultural adaptation,underrepresentation of rural areas in questionnaire validation studies).The purpose of this study was to assess the content validity of the GAC-PAQ among PA experts,8-to 17-year-olds,and one of their parents/guardians,and to discuss changes made to the questionnaire based on participants'feedback.Methods:Sixty-two experts in PA measurement and/or surveillance from 24 countries completed an online survey that included both closed-and open-ended questions about the content validity of the GAC-PAQ.The proportion of experts who agreed or strongly agreed with the items was calculated.Child-parent/guardian dyads from 15 countries(n=250;10-40 per country)participated in a structured cognitive interview to assess the clarity of the questions and response options,and they were encouraged to provide suggestions to improve clarity and facilitate completion of the questionnaire.Participating countries are:Aotearoa New Zealand,Brazil,Canada,China,Colombia,Czech Republic,India,Malawi,Mexico,Nepal,Nigeria,Spain,Sweden,Thailand,and the United Arab Emirates.Interviews were conducted in 13 different languages and structured by PA domain.Generic images were included to help participants in answering questions about PA intensity.Results:Expert agreement with the items for each domain exceeded 75%,and their qualitative feedback was used to revise the questionnaire before cognitive interviews.In general,participants found the questionnaire to be comprehensive.Adolescents(12-17 years)found it easier than children(8-11 years)to answer the questions.Several children struggled to answer questions about the duration and intensity of activities and/or concepts related to travel modes,active trips,and organized activities.Many parents/guardians were unsure about the frequency,duration,and intensity of their children's or adolescents'PA at school and/or recommended using more culturally relevant and appropriate images.Some participants misunderstood the concept of activities that“make you stronger”(intended to assess resistance activities)and/or struggled to differentiate between work,volunteering,and chores.Conclusion:Participants'feedback was used to develop a revised,simplified,and culturally adapted GAC-PAQ,which will be pilot-tested in all15 countries in an App that will include country-specific images and narration in local languages.Further research is needed to assess the reliability and validity of the revised GAC-PAQ.
文摘This paper dwells on the two important factors-reliability and validity in language testing: explain the concepts of reliability and validity; factors influencing reliability and validity; possible ways to achieve high reliability and validity; comments on the modern language testing tendency on reliability and validity and authors' own ideas.
文摘To make oral test accurately reflect the actual English spoken ability of candidates and play its role in guiding and promoting the improvement of English learners in the teaching, we must ensure that the design of scientific questions, the feasibility and validity of judgments to make an accurate and fair measurement of testers' language ability.
文摘This paper examines reading comprehensions in 2005 MET. Statistic analysis shows that the MET in 2005 have certain validity,but there are some problems existing in these tests. The paper lists the problems and suggests some methods. The writer hopes that test-designers can pay much attention to them so as to improve the tests quality.
文摘Intercultural Communication Competence (ICC), as one of the research fields of intercultural communication, has been given much importance from scholars all around the world. Intercultural sensitivity is one of the three dimensions in Dr Chen's ICC model. This research investigates the reliability and validity of Chen and Starosta's Intercultural Sensitivity Scale (ISS) (2000) against Chinese cultural background by using Chinese university students majoring in English as respondents.
文摘Speaking is the main purpose and most important skill for second language learning.This paper reviews the important validity considerations in designing a test,such as face validity,content validity,on the basis of which the author analyzes the oral test paper for the postgraduate entrance interview in Shaanxi University of Technology,thus putting forward some methods to improve the test paper.
文摘Language testing is an important link in language teaching,in this paper,the two important criteria of language test the reliability and validity has carried on the detailed elaboration,in order to a language teacher proposition and evaluation test more scientific.
文摘Test plays an important role in our lives in that it can cause backwash towards our teaching and learning.Thus the pur pose of the present study is to examine the quality of an English test paper in a junior middle school of Urumqi according to the language test theories.Through the data collection and analyses,the results indicate that the test paper is a medium-level test and the reliability is suitable.From the correlation analysis,we can see that each part has a high correlation value to the total,indicat ing that all of them contribute to generate language proficiency.
文摘In order to assess college students'overall language proficiency,a new format,banked cloze,is included in the revised CET-4.Based on the previous studies on banked cloze in China,this paper discusses on how to construct highly valid CET-4 banked cloze in line with the revised CET Syllabus(2006).
文摘Content validity is an important part of language testing.In this paper,the content validity of the CET-4 fast reading test is analyzed in terms of expected response and text input.The result of final research shows that the content validity of the fast reading test is high with some limitations proposed.
文摘When talked about the language testing, we need to focus on the reliability and validity,the paper explains the importance of reliability and validity and why should we focus on them when we make language testing paper. Also list various factors that affect reliability and validity.
基金provided by the Science&Technology Basic Resources Investigation Program of China [grant number:2017FY101101 and 2017FY101103]
文摘Objective The primary objective of this study was to examine the validity and reliability of a semi-quantitative food frequency questionnaire(FFQ) among Chinese children aged 12-17 years. Methods A semi-quantitative 72-food item FFQ was developed for children aged 12-17 years. The reliability and validity of this FFQ were evaluated against 24-h dietary recalls(24 h DRs) to measure the consumption of foods and nutrients. We administered two FFQs and three DRs to children(N = 160) over a period of 1 month to evaluate the reliability and validity. Reliability was examined by quartile agreement and intraclass correlation coefficients(ICCs), and validity was examined by quartile agreement, Bland-Altman plots and correlation with DRs. Results For reliability, the ICCs between the two FFQs ranged from 0.21 to 0.76 for foods and nutrients, and the quartile agreement ranged from 70.0% to 95.0% in the same or adjacent quartiles. Spearman’s correlation coefficients of foods and nutrients between the second FFQ and the 24 h DRs ranged from-0.04 to 0.59. The Bland-Altman plots demonstrated good agreement across the range of intakes among nutrients. The quartile agreement ranged from 50.0% to 100.0%, with infrequent misclassification. Conclusion The FFQ assessment of dietary intakes demonstrated acceptable relative validity and high reproducibility for Chinese children aged 12-17 years.