Automatic Language Identification Based on Posterior Probability on Articulatory Classes

Takagi KAZUYUKI

戻る

会議発表プレゼンテーション

Automatic Language Identification Based on Posterior Probability on Articulatory Classes

Takagi KAZUYUKI

電子情報通信学会、日本音響学会

電子情報通信学会技術報告,第16回音声言語シンポジウム (東京工業大学すずかけ台キャンパス)

16/12/2014

抄録

Extraction of features from input speech that are effective in distinguishing the language is a key issue for language identification system. We use posterior probabilities on articulatory classes as features for language identification. Posterior probability on each articulatory class is calculated by GMMs. Each GMM is trained with MFCC data of speech segments labeled with the phonemes or acoustic events that correspond to the articulatory class. The posterior probability values of the articulatory classes are concatenated to form an articulatory-feature-class-posterior-probability (AFCPP) vector at each analysis frame. These vectors are then quantized to yield VQ code sequence, which is used as the training data for a n-gram language model. Language identification is performed by selecting the n-gram model that yields the highest likelihood for the AFCPP vector sequence of the input utterance. Language identification experiment between Japanese and English by the present method showed identification rate of 97.1%.

メトリック

1 レコードビュー

詳細

タイトル: Automatic Language Identification Based on Posterior Probability on Articulatory Classes
作成者 – 役職なし: Takagi KAZUYUKI
会議: 電子情報通信学会技術報告,第16回音声言語シンポジウム (東京工業大学すずかけ台キャンパス)
出版者: 電子情報通信学会、日本音響学会
ID: 991002554490107421
組織: The University of Electro-Communications
言語: 日本語
資料タイプ: 会議発表プレゼンテーション
リソースのサブタイプ: rm_presentations: Poster Presentation