Speech technologies for under-resourced languages of the world.


2015. №2, 117-135

Alexey A. Karpov а, b, @, Vasilisa O. Verkhodanova a
a St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS), St. Petersburg, 199178, Russia;
b ITMO University, St. Petersburg, 197101, Russia;
@ karpov@iias.spb.su

Abstract:

Over the past decade, computer speech processing of under-resourced and minority languages has experienced a significant progress. In this paper, we present an analytical review of existing problems and approaches in the field of speech recognition for many spoken languages lacking speech and text resources, including languages of the Russian Federation. The definition and characteristics of under-resourced languages and challenges connected with their automatic processing are presented, as well as projects and investigations dealing with analysis and preservation of under-resourced languages of the world are described.