|
|
|
|
|
ABSTRACT:
Speech synthesis is a process of building machinery that can generate human-like speech from any text input to imitate human speakers. The objective of a text to speech system is to convert an arbitrary text into its corresponding spoken waveform. Text processing and speech generation are two main components of a text to speech system. To build a natural sounding speech synthesis system, it is essential that text processing component produce an appropriate sequence of phonemic units. Speech synthesizers in the current state of art are based on Data Driven Concatenative synthesis. Required sequence of basic units is synthesized by selecting appropriate unit instances from huge database consisting of multiple instances of units with varying prosodic properties. Performance of speech synthesis systems differ depending on the size of the speech units used in the process. A system can use phone, diphone or syllables as basic unit. High performance is achieved by the system which is domain specific and basic units are words or sentences in the database. The quality of a speech synthesizer is judged by its closeness to the natural human voice and understandability. This paper describes an approach to build a Telugu TTS system using concatenative synthesis method with syllable as a basic unit of concatenation.
|
|
|
|
STATISTICS
|
|
Click on # to view
|
|
Citations
|
|
0
|
|
References
|
|
0
|
|
Comments
|
|
0
|
|
Quality
|
|
0/0.00
|
|
Interest
|
|
0/0.00
|
|
View(er)s
|
|
1/120
|
|
|
|
|
|
|
| Prev |
Next |
|