Nagoya Institute of Technology

Font size
Site search

Topics

Home > Topics > 2016/11/18(Fri) FRIIS Seminar "Statistical Voice Conversion and Its Application to Augmented Speech Production"

2016/11/18(Fri) FRIIS Seminar "Statistical Voice Conversion and Its Application to Augmented Speech Production"

Category:Event|Publishing : November 14, 2016


WHEN 2016/11/18 (Fri) 10:30-11:30
WHERE Bld.4 in Nagoya Institute of Technology
Seminar info

Tomoki Toda, Professor, Nagoya University

"Statistical Voice Conversion and Its Application to Augmented Speech Production"

→ Seminar flyer (pdf)

For Graduate students, faculty members, and anyone interested
Reservation

NOT required (free of charge)

*For inquiries, please contact Prof. Tokuda 052-735-5479.

   Voice conversion is a technique for modifying speech acoustics, converting non-/para- linguistic information to any form we want while preserving the linguistic content. One of the most popular approaches to voice conversion is based on statistical processing, which is capable of extracting complex conversion functions from a parallel speech data set consisting of utterance pairs of the source and the target voices. Although this technique was originally studied in the context of speaker conversion, which converts the voice of a certain speaker to sound like that of another speaker, it has great potential to achieve various applications beyond speaker conversion. This talk will briefly review the recent progress of statistical voice conversion techniques, and will highlight a technique to achieve a lower conversion delay. Finally this talk will show some potential applications of real-time statistical voice conversion to augment our speech production, making it possible to enhance human-to-human speech communication beyond several constraints.

 


To page top