• English
    • Deutsch
  • English 
    • English
    • Deutsch
  • Login
View Item 
  •   Home
  • Universität Ulm
  • Publikationen
  • View Item
  •   Home
  • Universität Ulm
  • Publikationen
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Lexical and language modeling for Russian large vocabulary continuous speech recognition

Thumbnail
vts_9981_15223.pdf (6.730Mb)
164 S.
Veröffentlichung
2016-03-14
Authors
Zablotskiy, Sergey
Dissertation


Faculties
Fakultät für Ingenieurwissenschaften und Informatik
Abstract
This thesis outlines novel approaches for improving Russian large vocabulary continuous speech recognition. There are several peculiarities of Russian, which cause serious challenges for speech recognition process. The most severe problems are tackled in the scope of this work. First of all, phonetic transcriptions of Russian words depend strongly on the position of emphasized vowels. However, there are no rules for their localization. Therefore, two different methods were suggested to overcome this problem. Secondly, the non-trivial Russian grammar sophisticates tremendously the process of text normalization essential for language modeling. While being normalized, the majority of numerals and abbreviations should be declined according to a proper grammatical case. However, due to a very loose word order in Russian sentences it becomes a very challenging task. Since no solutions with satisfactory functionality were available to the best of our knowledge, we designed and implemented an advanced tool for Russian text normalization from scratch and made it publicly available. Thirdly, Russian is a highly inflected language with a complex mechanism of word formation. Therefore, an abundant lexicon is required in order to recognize fluent spoken utterances of any broad domain. The main part of our investigation is devoted to this problem as the most challenging and extensive one. Hybrid sub-word lexical and language models were utilized. Several important sub-word modeling parameters were under investigation: unit type, their optimal amount and size. Moreover, three algorithms for the joining of small elements were proposed and evaluated. One of the most important proposals of this thesis is the employment of double-sided marking for sub-word units. The majority of the suggested approaches are theoretically applicable not only for Russian, but also for all synthetic languages with highly inflected nature, for example, other Slavic languages.
Date created
2015
Subject headings
[GND]: Automatische Spracherkennung | Russisch
[LCSH]: Dictionaries | Russian language; Data processing | Speech perception
[Free subject headings]: Language modeling | Russian speech recognition | Text normalization
[DDC subject group]: DDC 004 / Data processing & computer science
License
Standard
https://oparu.uni-ulm.de/xmlui/license_v3

Metadata
Show full item record

DOI & citation

Please use this identifier to cite or link to this item: http://dx.doi.org/10.18725/OPARU-3262

Zablotskiy, Sergey (2016): Lexical and language modeling for Russian large vocabulary continuous speech recognition. Open Access Repositorium der Universität Ulm und Technischen Hochschule Ulm. Dissertation. http://dx.doi.org/10.18725/OPARU-3262
Citation formatter >



Policy | kiz service OPARU | Contact Us
Impressum | Privacy statement
 

 

Advanced Search

Browse

All of OPARUCommunities & CollectionsPersonsInstitutionsPublication typesUlm SerialsDewey Decimal ClassesEU projects UlmDFG projects UlmOther projects Ulm

My Account

LoginRegister

Statistics

View Usage Statistics

Policy | kiz service OPARU | Contact Us
Impressum | Privacy statement