Comparison between random and daily speech database in the speech visualization

Nur Syafikah Binti Samsudin, Kazunori Mano

研究成果: Conference contribution

抄録

This paper presents a new technique using animated texts as the speech features' visualization medium for checking and detecting language learners' pronunciation. The proposed visualization tool will transform learners' speech features such as pitch, tempo or rhythm into animated texts form, and the mispronounce parts can be located by comparing them with the correct sample. In our previous experiments, Japanese language learners gave positive feedback on the animated texts designs as their speech visualization tool. By practicing this tool, learners found that the mispronounce parts in their speech can be detected and confirmed easily. However, in order to have more practical feedback and response from the learners, besides using the plain speech data set, which only express random speech contents, the daily conversation speech data set is proposed as the data sample. In this paper, the comparison between both database samples for determining the proposed visualization tool's approachability was observed. Evaluation experiments results showed that participants gave positive responses to the animated texts visualization tool and were able to understand speech features in the visualized texts form better by using the daily conversation as the speech sample.

本文言語English
ホスト出版物のタイトル2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017
出版社Institute of Electrical and Electronics Engineers Inc.
ページ3135-3140
ページ数6
2017-January
ISBN(電子版)9781538616451
DOI
出版ステータスPublished - 2017 11月 27
イベント2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017 - Banff, Canada
継続期間: 2017 10月 52017 10月 8

Other

Other2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017
国/地域Canada
CityBanff
Period17/10/517/10/8

ASJC Scopus subject areas

  • 人工知能
  • コンピュータ サイエンスの応用
  • 人間とコンピュータの相互作用
  • 制御と最適化

フィンガープリント

「Comparison between random and daily speech database in the speech visualization」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル