研究業績リスト
ジャーナル論文 - rm_published_papers: Scientific Journal
Development of a Capsule Type Leak Detection Device for Pipeline
公開済 06/2018
Water, Land and Environmental Engineering, 86, 6, 31 - 36
ジャーナル論文 - rm_misc: Others
Automatic Language Identification Based on Posterior Probability on Articulatory Classes
公開済 08/12/2014
IPSJ SIG Notes, 2014, 28, 1 - 5
Extraction of features from input speech that are effective in distinguishing the language is a key issue for language identification system. We use posterior probabilities on articulatory classes as features for language identification. Posterior probability on each articulatory class is calculated by GMMs. Each GMM is trained with MFCC data of speech segments labeled with the phonemes or acoustic events that correspond to the articulatory class. The posterior probability values of the articulatory classes are concatenated to form an articulatory-feature- class-posterior-probability (AFCPP) vector at each analysis frame. These vectors are then quantized to yield VQ code sequence, which is used as the training data for a n-gram language model. Language identification is performed by selecting the n-gram model that yields the highest likelihood for the AFCPP vector sequence of the input utterance. Language identification experiment between Japanese and English by the present method showed identification rate of 97.1%.
ジャーナル論文 - rm_misc: Others
公開済 19/12/2011
IEICE technical report. Speech, 111, 365, 45 - 48
Language identification is the technique to identify the language being spoken by an unknown speaker. In this paper, phonotactic information was used as the feature for language identification. In order to obtain phonotactic information, it is required to extract the phoneme sequence from speech data. A template-based non-negative matrix factorization was applied for this purpose. The extracted phoneme sequence was then analyzed to yield n-gram models which may reflect the order in which the phoneme-like categories of speech occur in the language. Language identification was carried out by a support vector machine with the n-gram as the feature vector. It is shown that the identification performance changes with the number of spectrum templates and the order of n-gram, and that the best performance of 98.6% was obtained when the number of spectrum was 13 and the order of n-gram was 3.
ジャーナル論文 - rm_published_papers: Scientific Journal
Referential reconstruction in complex frequency domain for word recognition under noisy environments
公開済 09/2008
The Journal of the Acoustical Society of Japan, 64, 9, 533 - 544
本論文では音声に雑音が重畳した単一チャネルの信号から原音声を復元し,自動音声認識性能を向上させる手法について述べる。著者らはすでに,小規模の音声データベースを事前に用意し,ある尺度で入力フレームと類似しているフレームをデータベース内から抽出し,その抽出したフレームを参考にして出力を得るという手法を提案しているが,本論文では更にその類似尺度と出力方法の改良法を報告する。改良の要点は,短時間フーリエ変換後の位相情報をそのまま保持しておくことと,そこにバイナリマスクをかけることの2点である。性能評価をするために器楽曲雑音及び環境雑音を用いて単語認識実験を行ったところ,低いSNRにおいて単語正解率の改善が見られた。
ジャーナル論文 - rm_published_papers: Scientific Journal
公開済 06/2008
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E91D, 6, 1774 - 1782
ジャーナル論文 - rm_published_papers: International Conference Proceedings
公開済 05/2006
Proceedings of International Conference of Speech Prosody 2006, PS5 - 20
ジャーナル論文 - rm_published_papers: Scientific Journal
Sentence compression using statistical information about dependency path length
公開済 2006
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 4188, 127 - 134
ジャーナル論文 - rm_published_papers: Scientific Journal
Japanese dependency structure analysis using information about multiple pauses and F-0
公開済 01/2006
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E89D, 1, 298 - 304
ジャーナル論文 - rm_published_papers: Scientific Journal
公開済 11/2004
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E87D, 11, 2453 - 2459
ジャーナル論文 - rm_published_papers: International Conference Proceedings
公開済 10/2004
Proceedings of ICSLP2004 (8th International Conference on Spoken Language Processing), 3, 1749 - 1752