TY - JOUR
T1 - An automatic broadcast system for a weather report radio program
AU - Segi, Hiroyuki
AU - Takou, Reiko
AU - Seiyama, Nobumasa
AU - Takagi, Tohru
AU - Uematsu, Yuko
AU - Saito, Hideo
AU - Ozawa, Shinji
PY - 2013
Y1 - 2013
N2 - Here we describe a speech-synthesis method using templates that can generate recording-sentence sets for speech databases and produce natural sounding synthesized speech. Applying this method to the Japan Broadcasting Corporation (NHK) weather report radio program reduced the size of the recording-sentence set required to just a fraction of that needed by a comparable method. After integrating the recording voice of the generated recording-sentence set into the speech database, speech was produced by a voice synthesizer using templates. In a paired-comparison test, 66% of the speech samples synthesized by our system using templates were preferred to those produced by a conventional voice synthesizer. In an evaluation test using a five-point mean opinion score (MOS) scale, the speech samples synthesized by our system scored 4.97, whereas the maximum score for commercially available voice synthesizers was 3.09. In addition, we developed an automatic broadcast system for the weather report program using the speech-synthesis method and speech-rate converter. The system was evaluated using real weather data for more than 1 year, and exhibited sufficient stability and synthesized speech quality for broadcast purposes.
AB - Here we describe a speech-synthesis method using templates that can generate recording-sentence sets for speech databases and produce natural sounding synthesized speech. Applying this method to the Japan Broadcasting Corporation (NHK) weather report radio program reduced the size of the recording-sentence set required to just a fraction of that needed by a comparable method. After integrating the recording voice of the generated recording-sentence set into the speech database, speech was produced by a voice synthesizer using templates. In a paired-comparison test, 66% of the speech samples synthesized by our system using templates were preferred to those produced by a conventional voice synthesizer. In an evaluation test using a five-point mean opinion score (MOS) scale, the speech samples synthesized by our system scored 4.97, whereas the maximum score for commercially available voice synthesizers was 3.09. In addition, we developed an automatic broadcast system for the weather report program using the speech-synthesis method and speech-rate converter. The system was evaluated using real weather data for more than 1 year, and exhibited sufficient stability and synthesized speech quality for broadcast purposes.
KW - Recording-sentence set
KW - speech-rate conversion
KW - templates
KW - voice synthesizer
UR - http://www.scopus.com/inward/record.url?scp=84883463897&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84883463897&partnerID=8YFLogxK
U2 - 10.1109/TBC.2013.2272406
DO - 10.1109/TBC.2013.2272406
M3 - Article
AN - SCOPUS:84883463897
SN - 0018-9316
VL - 59
SP - 548
EP - 555
JO - IEEE Transactions on Broadcasting
JF - IEEE Transactions on Broadcasting
IS - 3
M1 - 6572852
ER -