• English
    • Deutsch
  • Deutsch 
    • English
    • Deutsch
  • Einloggen
Dokumentanzeige 
  •   Startseite
  • Universität Ulm
  • Publikationen
  • Dokumentanzeige
  •   Startseite
  • Universität Ulm
  • Publikationen
  • Dokumentanzeige
JavaScript is disabled for your browser. Some features of this site may not work without it.

Novel methods for text preprocessing and classification

Thumbnail
vts_9647_14616.pdf (2.219Mb)
219 S.
Veröffentlichung
2015-08-18
Autoren
Gasanova, Tatiana
Dissertation


Fakultäten
Fakultät für Ingenieurwissenschaften und Informatik
Zusammenfassung
Written text is a form of communication that represents language (speech) using signs and symbols. For a given language text depends on the same structures as speech (vocabulary, grammar and semantics) and the structured system of signs and symbols (formal alphabet). Written text has always been an instrument of exchanging information, recording history, spreading knowledge, maintaining financial accounts and formation of legal systems. With the development of computers and Internet the amount of textual information in digital form has dramatically grown. There is an increasing need to automatically process this information for variety of tasks related to text processing such as information retrieval, machine translation, question answering, topic categorization and topic segmentation, sentiment analysis etc. Many important text processing tasks fall into the field of text classification. This thesis addresses the development and evaluation of novel text preprocessing methods, which combine supervised and unsupervised learning models in order to reduce dimensionality of the feature space and improve the classification performance. Metaheuristic approaches for Support Vector Machine and Artificial Neural Network generation and parameters optimization are modified and applied for text classification and compared with other state-of-the-art methods using different text representations.
Erstellung / Fertigstellung
2015
Schlagwörter
[GND]: Automatische Klassifikation
[LCSH]: Text processing (Computer science)
[Freie Schlagwörter]: Text classification | Text preprocessing
[DDC Sachgruppe]: DDC 000 / Computer science, information & general works
Lizenz
Standard
https://oparu.uni-ulm.de/xmlui/license_v3

Metadata
Zur Langanzeige

DOI & Zitiervorlage

Nutzen Sie bitte diesen Identifier für Zitate & Links: http://dx.doi.org/10.18725/OPARU-3242

Gasanova, Tatiana (2015): Novel methods for text preprocessing and classification. Open Access Repositorium der Universität Ulm und Technischen Hochschule Ulm. Dissertation. http://dx.doi.org/10.18725/OPARU-3242
Verschiedene Zitierstile >



Leitlinien | kiz Service OPARU | Kontakt
Impressum | Datenschutzerklärung
 

 

Erweiterte Suche

Browsen

Gesamter BestandBereiche & SammlungenPersonenInstitutionenPublikationstypUlmer Reihen & ZeitschriftenDDC-SachgruppenEU-Projekte UlmDFG-Projekte UlmWeitere Projekte Ulm

Mein Benutzerkonto

EinloggenRegistrieren

Statistik

Benutzungsstatistik

Leitlinien | kiz Service OPARU | Kontakt
Impressum | Datenschutzerklärung