IDENTIFYING MORPHOLOGICAL PROPERTIES OF RUSSIAN WORDS WITH THE ONTOLOGY-BASED TEXT ANALYSER

Ksenia Balysheva1*, Elena Kartashova2,
Konstantin Kondratiev3, Aleksey Mikheev4
1 Assit. Prof. Dr., Mari State University, Russia, qsuaka@mail.ru
2 Prof. Dr., Mari State University, Russia, elena.karta77@mail.ru
3 Telephone Systems Ltd, Russia, kk@digt.ru
4 Ph. D. Student, Mari State University, Russia, scurra.42@yandex.ru
*Corresponding Author

Abstract
This article presents the first stage of an ongoing effort of creating the application Ontology-Based Text Analyser (OTA) aimed at automatic identifying semantics and grammatical properties of widely used Russian words in connected texts. At present this application identifies only morphological properties of Russian words. In this application all morphological properties of content words and grammatical function words are revealed on the basis of a query to the Ontology of Russian Grammatical Forms (OntoRuGrammaForm) that we earlier set up. In OntoRuGrammaForm we used LexInfo which represents morphological properties of words in the ontological format as a scheme for data organising. To set up OntoRuGrammaForm the existing LexInfo ontology was extended with missing and refined grammatical categories. In OntoRuGrammaForm the linkage of semantics with morphological properties is implemented with OntoLex which makes it possible to link grammatical word forms with lemmas and lemmas with concepts in knowledge area ontologies. The automatic process of word morphology identification is illustrated with a connected text of the informative type taken from the open news online-portal. In this news text the system of morphological properties of words is identified with OntoRuGrammaForm. This application also displays lemmas and transcription of separate words in a connected text. The created application (OTA) can be used as an innovative methodical tool in teaching Russian to foreign students to develop skills of identifying morphological characteristics of words in news texts. At present this application is available on the Web in the open access and can be used for analysing morphological properties of widely used Russian words in a connected text.

Keywords: Automatic identifying, Morphological properties, Ontology, Ontology-based application, Text analyser, LexInfo, OntoLex



FULL TEXT PDF

CITATION: Abstracts & Proceedings of SOCIOINT 2017- 4th International Conference on Education, Social Sciences and Humanities, 10-12 July 2017- Dubai, UAE

ISBN: 978-605-82433-1-6