An automatic non-English sentiment lexicon builder using unannotated corpus

Kaity, Mohammed and Balakrishnan, Vimala (2019) An automatic non-English sentiment lexicon builder using unannotated corpus. The Journal of Supercomputing, 75 (4). pp. 2243-2268. ISSN 0920-8542, DOI https://doi.org/10.1007/s11227-019-02755-3.

Full text not available from this repository.
Official URL: https://doi.org/10.1007/s11227-019-02755-3

Abstract

Sentiment lexicons in the English language are widely accessible while in many other languages, these resources are extremely deficient. Current techniques and methods for sentiment analysis focus mainly on the English language, whereas other languages are neglected due to lack of resources. In order to overcome challenges faced in building non-English lexicons, we propose a language-independent method that automatically builds non-English sentiment lexicons based on currently available English lexicons with an unannotated corpus from the target language. The proposed method will automatically recognize and extract new polarity words from the unannotated corpus based on the initial seed lexicons that are developed by translating three reliable English lexicons. The experimental results from the test datasets confirmed that a developed non-English sentiment lexicon could significantly enhance the performance of non-English sentiment classifications, compared with other methods and lexicons. The developed lexicon in the Arabic language outperformed other commonly used methods for developing non-English lexicons, with an 0.74 F measure. The adopted approach in this study was proven to be language independent and can be implemented in other languages as well. This paper also contributes to understanding the approaches to developing sentiment resources. © 2019, Springer Science+Business Media, LLC, part of Springer Nature.

Item Type: Article
Funders: UNSPECIFIED
Uncontrolled Keywords: Building resources; Natural language processing; Sentiment analysis; Sentiment lexicon; Text analysis
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Computer Science & Information Technology
Depositing User: Ms. Juhaida Abd Rahim
Date Deposited: 06 Apr 2020 15:27
Last Modified: 06 Apr 2020 15:27
URI: http://eprints.um.edu.my/id/eprint/24153

Actions (login required)

View Item View Item