摘要
眼动追踪语料库是研究人类文本加工机制的重要资源。本研究以东盟国家越南、泰国和老挝的中文学习者为对象,构建了中文学习者眼动语料库,系统采集了学习者在自然阅读过程中的眼动数据。该语料库涵盖200名中文学习者和70名中文母语者阅读60篇真实文本及15篇HSK4~6级阅读理解文本的眼动记录,数据总量超过129万个词元。本文详细介绍了中文学习者眼动语料库的设计原则、实验流程,并对多项关键眼动指标进行了信效度检验、描述性统计与建模分析,结果验证了该语料库的可靠性与有效性。该语料库的建立有助于深化第二语言加工机制与双语理论的研究,并为国际中文教育的区域化与国别化发展提供数据支撑。
Eye-tracking corpora represent valuable resources for studying human text processing mechanisms.This study targets Chinese learners from ASEAN countries,namely Vietnam,Thailand and Laos,and builds the Chinese Learner Eye-tracking Corpus(CECO),which systematically collects eye-movement data during natural reading processes.The corpus incorporates eye-tracking records from 200 Chinese learners and 70 native Chinese speakers reading 60 authentic texts and 15HSK Level 4~6 reading comprehension texts,totaling over 1.29 million tokens.This paper elaborates on the design principles and experimental procedures of CECO,and conducts reliability analysis,descriptive statistics,and modeling analyses on several key eye-movement metrics,thereby verifying the reliability and validity of the corpus.The building of CECO facilitates in-depth investigation into second language processing mechanisms and bilingual theories,providing data support for regional and country-specific research in international Chinese education.
作者
鹿士义
劳耀琳
张秀婷
由文华
孙瑞
LU Shiyi;LAO Yaolin;ZHANG Xiuting;YOU Wenhua;SUN Rui
出处
《语言文字应用》
北大核心
2025年第3期76-88,共13页
Applied Linguistics
基金
广西壮族自治区级人文社科重点研究基地项目“面向东盟国家汉语学习者动态加工数据库建设”(XDWX 202302-04)
广西大学项目库项目“东盟汉语学习者认知加工数据库建设”(2024-2025)的资助。