Development of two data bases with comments in Bulgarian language and application of supervised learning approaches on them for comparative sentiment analysis. А brief overview

Authors

  • Daniela Ivanova Petrova Technical University- Varna
  • Violeta Bojikova Technical university of Varna

DOI:

https://doi.org/10.29114/ajtuv.vol6.iss2.261

Keywords:

Automatic Sentiment Analysis, opinion mining, supervised learning approaches, Bulgarian language

Abstract

The purpose of the current paper is to make an overview of the work done so far by the authors and make a summary of the results and reflections on the performed sentiment analysis on user comments in two different fields in Bulgarian language. As a starting point for the authors’ work is the development of two databases with users’ reviews and their preprocessing to become usable source of information for different types of analysis projects. As a result of the preprocessing is a revised Bulgarian language-driven algorithm for data preprocessing for Bulgarian language. The second part of the project is implemented into two steps: sentiment analysis using the supervised learning approaches developed for the two databases and a comparative sentiment analysis of the two databases, following their additional examination.

Downloads

Download data is not yet available.

References

<p>Dimitrova, T., Stefanova, V. (2018). The semantic classification of adjectives in the Bulgarian Wordnet: Towards a multiclass approach. <em>Cognitive Studies | Etudes cognitives</em>, 2018(18). <a href="https://doi.org/10.11649/cs.1709" target="_blank">Crossref</a> <br /><br />Hajmohammadi, M. S., Ibrahim, R., &amp; Othman, Z. A. Opinion mining and sentiment analysis: a survey. INTERNATIONAL JOURNAL OF COMPUTERS &amp; TECHNOLOGY, 2(3c), 171-178. <a href="https://doi.org/10.24297/IJCT.V2I3C.2717" target="_blank">Crossref</a> <br /><br />Kapukaranov, B., Nakov, P., Fine-grained sentiment analysis for movie reviews in Bulgarian, Proceedings of Recent Advances in Natural Language Processing, p. 266-274, Hisar, Bulgaria, Sep.7-9 2015.<br /><a href="https://aclanthology.org/R15-1036.pdf" target="_blank">https://aclanthology.org/R15-1036.pdf</a> <br /><br />Nakov, P. (1998). BulStem: Design and Evaluation of Inflectional Stemmer for Bulgarian.<br />Retrieved from <a href="https://www.researchgate.net/publication/250443777_Design_and_Evaluation_of_Inflectional_Stemmer_for_Bulgarian" target="_blank">RG</a> <br /><br />Petrova, D. (2021) Automatic Sentiment Analysis on Hotel Reviews in Bulgarian &ndash; Basic Approaches and Results, IEMAICLOUD - London April 2021, <br /><br />Petrova, D. (2021) Comparative assay on sentiment analysis on two databases in Bulgarian language, Interdisciplinary Conference on Mechanics, Computers and Electrics, Ankara, Turkey, 27-28 November 2021, ISBN: 978-625-409-707-2, to be published <br /><br />Ramos, J.E. (2003). Using TF-IDF to determine word relevance in document queries. Tech. Rep., Department of Computer science. Rutgers University <br />Retrieved from&nbsp; <a href="https://citeseerx.ist.psu.edu/doc_view/pid/b3bf6373ff41a115197cb5b30e57830c16130c2c" target="_blank">link</a> <br /><br />Стоянова Ив.&rdquo;Автоматично разпознаване и тагиране на съставни лексикални единици в българския език&ldquo;, BAS, Sofia, April 2012<br />Retrieved from <a href="https://ibl.bas.bg/wp-content/uploads/2014/10/IStoyanova-avtoreferat.pdf">https://ibl.bas.bg/wp-content/uploads/2014/10/IStoyanova-avtoreferat.pdf</a> <br /><br />Wankhade, M., Chandra, A.,Rao, S., Dara, S.,Kaushik, Baij. (2017). A sentiment analysis of food review using logistic regression. International Conference on Machine Learning and Computational Intelligence-2017, 2456-3307. <br /><a href="https://www.researchgate.net/publication/334654833" target="_blank">RG</a> <br /><br />Ye Q.,Z.Zhang, R.Law. (2009). Sentiment classification of online reviews to travel destinations by supervised machine learning approaches, <em>Expert Systems with Applications 36</em>, 2009, p.6527-6535.<br/> <a href="https://doi.org/10.1016/j.eswa.2008.07.035" target="_blank">Crossref</a></p>

Downloads

Published

2022-12-31

How to Cite

Petrova, D. I., & Bojikova, V. (2022). Development of two data bases with comments in Bulgarian language and application of supervised learning approaches on them for comparative sentiment analysis. А brief overview. ANNUAL JOURNAL OF TECHNICAL UNIVERSITY OF VARNA, BULGARIA, 6(2), 57–62. https://doi.org/10.29114/ajtuv.vol6.iss2.261

Issue

Section

INFORMATION TECHNOLOGIES, COMMUNICATION AND COMPUTER EQUIPMENT

Similar Articles

<< < 1 2 3 4 5 > >> 

You may also start an advanced similarity search for this article.