Show simple item record

AuthorZablotskiy, Sergeydc.contributor.author
Date of accession2016-03-15T10:40:13Zdc.date.accessioned
Available in OPARU since2016-03-15T10:40:13Zdc.date.available
Year of creation2015dc.date.created
AbstractThis thesis outlines novel approaches for improving Russian large vocabulary continuous speech recognition. There are several peculiarities of Russian, which cause serious challenges for speech recognition process. The most severe problems are tackled in the scope of this work. First of all, phonetic transcriptions of Russian words depend strongly on the position of emphasized vowels. However, there are no rules for their localization. Therefore, two different methods were suggested to overcome this problem. Secondly, the non-trivial Russian grammar sophisticates tremendously the process of text normalization essential for language modeling. While being normalized, the majority of numerals and abbreviations should be declined according to a proper grammatical case. However, due to a very loose word order in Russian sentences it becomes a very challenging task. Since no solutions with satisfactory functionality were available to the best of our knowledge, we designed and implemented an advanced tool for Russian text normalization from scratch and made it publicly available. Thirdly, Russian is a highly inflected language with a complex mechanism of word formation. Therefore, an abundant lexicon is required in order to recognize fluent spoken utterances of any broad domain. The main part of our investigation is devoted to this problem as the most challenging and extensive one. Hybrid sub-word lexical and language models were utilized. Several important sub-word modeling parameters were under investigation: unit type, their optimal amount and size. Moreover, three algorithms for the joining of small elements were proposed and evaluated. One of the most important proposals of this thesis is the employment of double-sided marking for sub-word units. The majority of the suggested approaches are theoretically applicable not only for Russian, but also for all synthetic languages with highly inflected nature, for example, other Slavic languages.dc.description.abstract
Languageendc.language.iso
PublisherUniversität Ulmdc.publisher
LicenseStandarddc.rights
Link to license texthttps://oparu.uni-ulm.de/xmlui/license_v3dc.rights.uri
KeywordLanguage modelingdc.subject
KeywordRussian speech recognitiondc.subject
KeywordText normalizationdc.subject
Dewey Decimal GroupDDC 004 / Data processing & computer sciencedc.subject.ddc
LCSHDictionariesdc.subject.lcsh
LCSHRussian language; Data processingdc.subject.lcsh
LCSHSpeech perceptiondc.subject.lcsh
TitleLexical and language modeling for Russian large vocabulary continuous speech recognitiondc.title
Resource typeDissertationdc.type
DOIhttp://dx.doi.org/10.18725/OPARU-3262dc.identifier.doi
PPN1655704958dc.identifier.ppn
URNhttp://nbn-resolving.de/urn:nbn:de:bsz:289-vts-99811dc.identifier.urn
GNDAutomatische Spracherkennungdc.subject.gnd
GNDRussischdc.subject.gnd
FacultyFakultät für Ingenieurwissenschaften und Informatikuulm.affiliationGeneral
Date of activation2016-03-14T11:29:08Zuulm.freischaltungVTS
Peer reviewneinuulm.peerReview
Shelfmark print versionW: W-H 14.636uulm.shelfmark
DCMI TypeTextuulm.typeDCMI
VTS ID9981uulm.vtsID
CategoryPublikationenuulm.category
Bibliographyuulmuulm.bibliographie


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record