• English
    • Deutsch
Dokumentanzeige 
  •   OPARU Startseite
  • Fakultät für Ingenieurwissenschaften, Informatik und Psychologie
  • Publikationen
  • Dokumentanzeige
  •   OPARU Startseite
  • Fakultät für Ingenieurwissenschaften, Informatik und Psychologie
  • Publikationen
  • Dokumentanzeige
  • Deutsch 
    • English
    • Deutsch
  • Einloggen
JavaScript is disabled for your browser. Some features of this site may not work without it.

Novel methods for text preprocessing and classification

Thumbnail
Download
vts_9647_14616.pdf (2.219Mb)
219 S.
 
Veröffentlichung
2015-08-18
DOI
10.18725/OPARU-3242

Dissertation

Autoren
Gasanova, Tatiana
Fakultäten
Fakultät für Ingenieurwissenschaften und Informatik
Lizenz
Standard
https://oparu.uni-ulm.de/xmlui/license_v3
Zusammenfassung
Written text is a form of communication that represents language (speech) using signs and symbols. For a given language text depends on the same structures as speech (vocabulary, grammar and semantics) and the structured system of signs and symbols (formal alphabet). Written text has always been an instrument of exchanging information, recording history, spreading knowledge, maintaining financial accounts and formation of legal systems. With the development of computers and Internet the amount of textual information in digital form has dramatically grown. There is an increasing need to automatically process this information for variety of tasks related to text processing such as information retrieval, machine translation, question answering, topic categorization and topic segmentation, sentiment analysis etc. Many important text processing tasks fall into the field of text classification. This thesis addresses the development and evaluation of novel text preprocessing methods, which combine supervised and unsupervised learning models in order to reduce dimensionality of the feature space and improve the classification performance. Metaheuristic approaches for Support Vector Machine and Artificial Neural Network generation and parameters optimization are modified and applied for text classification and compared with other state-of-the-art methods using different text representations.
Erstellung / Fertigstellung
2015
Normierte Schlagwörter
Automatische Klassifikation [GND]
Text processing (Computer science) [LCSH]
Schlagwörter
Text classification; Text preprocessing
DDC-Sachgruppe
DDC 000 / Computer science, information & general works

Metadata
Zur Langanzeige

Zitiervorlage

Gasanova, Tatiana (2015): Novel methods for text preprocessing and classification. Open Access Repositorium der Universität Ulm. Dissertation. http://dx.doi.org/10.18725/OPARU-3242

Weitere Zitierstile



Informationen zu OPARU | Kontakt | Feedback
Impressum | Datenschutzerklärung
 

 

Erweiterte Suche

Stöbern

Gesamter BestandBereiche & SammlungenFakultätenInstitutionenPersonenRessourcentypenUlmer Reihen & ZeitschriftenDDC-SachgruppenFörderinformationenAusgewählte SammlungFakultätenInstitutionenPersonenRessourcentypenUlmer Reihen & ZeitschriftenDDC-SachgruppenFörderinformationen

Mein Benutzerkonto

EinloggenRegistrieren

Statistik

Benutzungsstatistik

Informationen zu OPARU | Kontakt | Feedback
Impressum | Datenschutzerklärung