Estrategias de generación y reducción de variantes de pronunciación en sistemas de reconocimiento automático de hablaconsideraciones arquitecturales

  1. Macías Guarasa, Javier
  2. Montero, Juan Manuel
  3. Ferreiros López, Javier
  4. Córdoba, Ricardo de
  5. Romeral, José David
Zeitschrift:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Datum der Publikation: 2003

Nummer: 31

Seiten: 91-98

Art: Artikel

Andere Publikationen in: Procesamiento del lenguaje natural

Zusammenfassung

In the context of large vocabulary speech recognition systems, it is crucial to accurately model the allophonic variations that will be found in a real world task. In this paper we describe a study on the use of data driven pronunciation variations, considering the generation and reduction strategies, as well as their impact in the system performance. The described techniques are supported by the corresponding experimental evaluation on two radically different systems in what respect to their discrimination power (based on integrated and non-integrated architectures, designed to work as hypothesis and verification modules, respectively), so that it's possible to discuss on their relative performance as a function of the increase in dictionary size. The most relevant results show that in the case of the non integrated architecture, we can significantly improve the inclusion rate, even for huge increases in dictionary size (up to 250%). On the contrary, the increase in the number of pronunciation variants has a clearly negative effect when applied to the integrated system.