Due to the interaction between lexical tones and intonation in Chinese speech melody, the contour patterns of Chinese IPs (intonational phrases) have long been an intriguing yet complicated subject. Within the AM (...Due to the interaction between lexical tones and intonation in Chinese speech melody, the contour patterns of Chinese IPs (intonational phrases) have long been an intriguing yet complicated subject. Within the AM (Autosegmental-Metrical) framework, this paper makes an empirical study on the subject based on a small-scale database of 22 standard Chinese news stories. A five-degree D value system is proposed for data normalization so that various contour patterns of IPs can be identified and compared despite complications in pitch register and pitch range. The major findings are: Chinese IPs mainly comprise 2, 3, or 4 ips (intermediate phrases). And the contour patterns of Chinese IPs suggest that the statement intonation in Chinese is primarily indicated by the near-bottom Dmin value of the last ip. Meanwhile, when the primary indicator is not typical, the falling trend of the last part of the top-line can serve as a compensatory perceptual clue.展开更多
The strategy of modeling the control mechanism for generating F0 contour of speech signal is studied in this paper. Based on some dynamic characteristics of vocal cord strain, the complex laryngeal mechanism relative ...The strategy of modeling the control mechanism for generating F0 contour of speech signal is studied in this paper. Based on some dynamic characteristics of vocal cord strain, the complex laryngeal mechanism relative to local F0 regulation is simplified to be a feasible physical model. Furthermore, a model function is deduced as the control mechanism for the generation process of local rise-fall patterns, and two kinds of basic feature patterns result with so called rise-fall commands defined by model parameters. on the logarithmic scale of F0 versus time the local characteristics of an F0 contour are approximated by the sum of these patterns generated by appropriate commands. The experimenial results in analyzing and synthesizing the F0 contours of spoken Chinese utterances indicate that the observed F0 contours can be always approximated well by the model, and a good correlation exists between some model parameters and the transition duration of local F0 rising or falling. The model lays a foundation for Chinese F0 contour synthesis by rule.展开更多
文摘Due to the interaction between lexical tones and intonation in Chinese speech melody, the contour patterns of Chinese IPs (intonational phrases) have long been an intriguing yet complicated subject. Within the AM (Autosegmental-Metrical) framework, this paper makes an empirical study on the subject based on a small-scale database of 22 standard Chinese news stories. A five-degree D value system is proposed for data normalization so that various contour patterns of IPs can be identified and compared despite complications in pitch register and pitch range. The major findings are: Chinese IPs mainly comprise 2, 3, or 4 ips (intermediate phrases). And the contour patterns of Chinese IPs suggest that the statement intonation in Chinese is primarily indicated by the near-bottom Dmin value of the last ip. Meanwhile, when the primary indicator is not typical, the falling trend of the last part of the top-line can serve as a compensatory perceptual clue.
文摘The strategy of modeling the control mechanism for generating F0 contour of speech signal is studied in this paper. Based on some dynamic characteristics of vocal cord strain, the complex laryngeal mechanism relative to local F0 regulation is simplified to be a feasible physical model. Furthermore, a model function is deduced as the control mechanism for the generation process of local rise-fall patterns, and two kinds of basic feature patterns result with so called rise-fall commands defined by model parameters. on the logarithmic scale of F0 versus time the local characteristics of an F0 contour are approximated by the sum of these patterns generated by appropriate commands. The experimenial results in analyzing and synthesizing the F0 contours of spoken Chinese utterances indicate that the observed F0 contours can be always approximated well by the model, and a good correlation exists between some model parameters and the transition duration of local F0 rising or falling. The model lays a foundation for Chinese F0 contour synthesis by rule.