In China,Tibetan is usually divided into three major dialects:the Am-do,Khams and Lhasa dialects.The Am-do dialect evolved from ancient Tibetan and is a local variant of modern Tibetan.Although this dialect has its ow...In China,Tibetan is usually divided into three major dialects:the Am-do,Khams and Lhasa dialects.The Am-do dialect evolved from ancient Tibetan and is a local variant of modern Tibetan.Although this dialect has its own specific historical and social conditions and development,there have been different degrees of communication with other ethnic groups,but all the abovementioned dialects developed from the same language:Tibetan.This paper uses the particularity of Tibetan suffixes in pronunciation and proposes a lexicon for the Am-do language,which optimizes the problems existing in previous research.Audio data of the Am-do dialect are expanded by data augmentation technology combining noise and reverberation,and the morphological characteristics and characteristics of the Tibetan language are further considered.According to the particularity of Tibetan grammar,grammatical features are used to optimize grammatical relationships and are combined with a language model,and the Am-do dialect is scored and rescored.Experimental results show that compared with the baseline,our proposed new lexicon and data augmentation technology yields a relative increase of approximately 3%in character error rates(CERs)and a relative increase of 3%-19%in the recognition rate of acoustic models and language models.展开更多
基金This work was supported by the Regional Innovation Cooperation Project of Sichuan Province(Grant No.22QYCX0082).
文摘In China,Tibetan is usually divided into three major dialects:the Am-do,Khams and Lhasa dialects.The Am-do dialect evolved from ancient Tibetan and is a local variant of modern Tibetan.Although this dialect has its own specific historical and social conditions and development,there have been different degrees of communication with other ethnic groups,but all the abovementioned dialects developed from the same language:Tibetan.This paper uses the particularity of Tibetan suffixes in pronunciation and proposes a lexicon for the Am-do language,which optimizes the problems existing in previous research.Audio data of the Am-do dialect are expanded by data augmentation technology combining noise and reverberation,and the morphological characteristics and characteristics of the Tibetan language are further considered.According to the particularity of Tibetan grammar,grammatical features are used to optimize grammatical relationships and are combined with a language model,and the Am-do dialect is scored and rescored.Experimental results show that compared with the baseline,our proposed new lexicon and data augmentation technology yields a relative increase of approximately 3%in character error rates(CERs)and a relative increase of 3%-19%in the recognition rate of acoustic models and language models.