Issue |
ITM Web Conf.
Volume 10, 2017
2017 Seminar on Systems Analysis
|
|
---|---|---|
Article Number | 02001 | |
Number of page(s) | 4 | |
Section | Intelligent Systems | |
DOI | https://doi.org/10.1051/itmconf/20171002001 | |
Published online | 15 March 2017 |
Automated Determination of the Type of Genre and Stylistic Coloring of Russian Texts
1 Institute of Computational Technologies of SB RAS, Lavrentiev av., 6, 630090, Novosibirsk, Russia
2 Novosibirsk State University, Pirogova st., 1, 630090, Novosibirsk, Russia
* Corresponding author: bar@ict.nsc.ru
In this paper we propose the algorithm of automated definition of the genre type and semantic characteristics of poetic texts in Russian. We formulated the approaches to the construction of a joint (“two-dimensional”) classifier of genre types and stylistic colouring of poetic texts, based on the definition of interdependence of the type of genre and stylistic colouring of the text. On the basis of these approaches the principles of formation of the training samples for the algorithms for the definition of styles and genre types were analyzed. The computational experiments with a corpus of texts of the Lyceum lyrics of A.S.Pushkin were implemented, which showed good results in determining the stylistic colouring of poetic texts and sufficient results in determining the genres. The proposed algorithms can be used for automation of the complex analysis of Russian poetic texts, significantly facilitating the work of the expert in determining their styles and genres by providing appropriate recommendations.
© The Authors, published by EDP Sciences, 2017
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.