Lexical simplification via single-word generation 被引量：1

导出

摘要 1 Introduction Lexical simplification(LS)aims to simplify a sentence by replacing complex words with simpler words without changing the meaning of the sentence,which can facilitate comprehension of the text for people with non-native speakers and children.Traditional LS methods utilize linguistic databases(e.g.,WordNet)[1]or word embedding models[2]to extract synonyms or high-similar words for the complex word,and then sort them based on their appropriateness in context.Recently,BERT-based LS methods[3,4]entirely or partially mask the complex word of the original sentence,and then feed the sentence into pretrained modeling BERT[5]to obtain the top probability tokens corresponding to the masked word as the substitute candidates.They have made remarkable progress in generating substitutes by making full use of the context information of complex words,that can effectively alleviate the shortcomings of traditional methods.

作者 Jipeng QIANG Yang LI Yun LI Yunhao YUAN Yi ZHU

机构地区 Department of Computer Science

出处《Frontiers of Computer Science》 SCIE EI CSCD 2023年第6期163-165,共3页 中国计算机科学前沿（英文版）

基金 supported by the National Natural Science Foundation of China(Grant Nos.62076217 and 61906060) the Blue Project of Yangzhou University.

关键词 TOKEN utilize SPEAKERS

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献4

1强继朋,李云,吴信东.自动词语简化方法综述[J].中文信息学报,2021,35(12):1-16. 被引量：4
2吴思远,于东,江新.汉语文本可读性特征体系构建和效度验证[J].世界汉语教学,2020,34(1):81-97. 被引量：44
3张俊,陈秀宏.基于BERT模型的无监督候选词生成及排序算法[J].南京大学学报（自然科学版）,2022,58(2):286-297. 被引量：2
4强继朋,钱镇宇,李云,袁运浩,朱毅.基于预训练表示模型的英语词语简化方法[J].自动化学报,2022,48(8):2075-2087. 被引量：6

引证文献1

1陈丽丽,刘康,强继朋,李云.基于混合预训练语言模型的中文词汇简化方法[J].扬州大学学报（自然科学版）,2024,27(5):25-32.

1Shiyu ZHU,Yun LI,Xiaoye OUYANG,Xiaocheng HU,Jipeng QIANG.Safeguarding text generation API’s intellectual property through meaning-preserving lexical watermarks[J].Frontiers of Computer Science,2023,17(6):195-197.
2康书铭,朱焱.基于话题注意力和依存句法信息的文本立场分析[J].计算机科学,2023,50(S02):52-56.
3Wenxin Zhu,Fengming Hu,Feng Xu.Post-Processing of InSAR Deformation Time Series Using Clustering-Based Pattern Identification[J].Journal of Beijing Institute of Technology,2023,32(6):704-716.
4Chang Jian LIU,Shao Qing WANG.On the Center Problem for Generalized Abel Equations[J].Acta Mathematica Sinica,English Series,2023,39(12):2329-2337.
5Yun Yun HU,Jing Bo DOU.Improved Hardy–Littlewood–Sobolev Inequality on S^(n)under Constraints[J].Acta Mathematica Sinica,English Series,2023,39(11):2149-2163.
6Hao Zhang,Yegang Li,Jiachen Yang,Rujiang Bai.A Knowledge-Integrate Cross-Domain Data Generation Method for Aspect and Opinion Co-Extraction[J].Journal of Computer and Communications,2023,11(12):31-48.
7Zhihui Tian,Xiaoyu Guo,Xiaohui He,Panle Li,Xijjie Cheng,Guangsheng Zhou.MSCANet: multiscale context information aggregation network for Tibetan Plateau lake extraction from remote sensing images[J].International Journal of Digital Earth,2023,16(1):1-30. 被引量：1
8Mengchen Li,Jiyuan Luan,Xuguang Gao,Ji-Peng Wang,Abdelali Dadda.A micro-investigation on water bridge effects for unsaturated granular materials with constant water content by discrete element method[J].Particuology,2023(12):50-62.
9杨振,路英明,杨玉超.Reconfigurable Mott electronics for homogeneous neuromorphic platform[J].Chinese Physics B,2023,32(12):67-72.
10陈曜琦,徐伟华,蒋宗颖.三支概念的恢复集[J].山东大学学报（理学版）,2023,58(12):52-62. 被引量：1

Frontiers of Computer Science

2023年第6期

浏览历史

内容加载中请稍等...

Lexical simplification via single-word generation 被引量：1

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史