Multi-label sentiment analysis on 100 languages with dynamic weighting for label imbalance

Yılmaz, Selim Fırat; Kaynak, Ergün Batuhan; Koç, Aykut; Dibeklioğlu, Hamdi; Kozat, Süleyman Serdar

Multi-label sentiment analysis on 100 languages with dynamic weighting for label imbalance

buir.contributor.author	Yılmaz , Selim Fırat
buir.contributor.author	Kaynak , Ergün Batuhan
buir.contributor.author	Koç , Aykut
buir.contributor.author	Dibeklioğlu, Hamdi
buir.contributor.author	Kozat , Süleyman Serdar
buir.contributor.orcid	Yılmaz, Selim Fırat\|0000-0002-0486-7731
buir.contributor.orcid	Kaynak, Ergün Batuhan\|0000-0002-3249-3343
buir.contributor.orcid	Koç, Aykut\|0000-0002-6348-2663
buir.contributor.orcid	Dibeklioğlu, Hamdi\|0000-0003-0851-7808
buir.contributor.orcid	Kozat, Süleyman Serdar\|0000-0002-6488-3848
dc.citation.epage	343	en_US
dc.citation.issueNumber	1
dc.citation.spage	331
dc.citation.volumeNumber	34
dc.contributor.author	Yılmaz, Selim Fırat
dc.contributor.author	Kaynak, Ergün Batuhan
dc.contributor.author	Koç, Aykut
dc.contributor.author	Dibeklioğlu, Hamdi
dc.contributor.author	Kozat, Süleyman Serdar
dc.date.accessioned	2024-03-19T05:20:23Z
dc.date.available	2024-03-19T05:20:23Z
dc.date.issued	2023-01-01
dc.department	Department of Electrical and Electronics Engineering
dc.department	Department of Computer Engineering
dc.department	National Magnetic Resonance Research Center (UMRAM)
dc.description.abstract	We investigate cross-lingual sentiment analysis, which has attracted significant attention due to its applications in various areas including market research, politics, and social sciences. In particular, we introduce a sentiment analysis framework in multi-label setting as it obeys Plutchik’s wheel of emotions. We introduce a novel dynamic weighting method that balances the contribution from each class during training, unlike previous static weighting methods that assign non-changing weights based on their class frequency. Moreover, we adapt the focal loss that favors harder instances from single-label object recognition literature to our multi-label setting. Furthermore, we derive a method to choose optimal class-specific thresholds that maximize the macro-f1 score in linear time complexity. Through an extensive set of experiments, we show that our method obtains the state-of-the-art performance in seven of nine metrics in three different languages using a single model compared with the common baselines and the best performing methods in the SemEval competition. We publicly share our code for our model, which can perform sentiment analysis in 100 languages, to facilitate further research.
dc.identifier.doi	10.1109/TNNLS.2021.3094304	en_US
dc.identifier.eissn	2162-2388	en_US
dc.identifier.issn	2162-237X	en_US
dc.identifier.uri	https://hdl.handle.net/11693/114920	en_US
dc.language.iso	English	en_US
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	en_US
dc.relation.isversionof	https://dx.doi.org/10.1109/TNNLS.2021.3094304
dc.source.title	IEEE Transactions on Neural Networks and Learning Systems
dc.subject	Cross-lingual
dc.subject	Label imbalance
dc.subject	Macro-f1 maximization
dc.subject	Multi-label
dc.subject	Natural language processing (NLP)
dc.subject	Sentiment analysis
dc.subject	Social media
dc.title	Multi-label sentiment analysis on 100 languages with dynamic weighting for label imbalance
dc.type	Article

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Multi-label_sentiment_analysis_on_100_languages_with_dynamic_weighting_for_label_imbalance.pdf
Size:: 1.45 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.01 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Electrical and Electronics Engineering
Scholarly Publications - Computer Engineering
Scholarly Publications - UMRAM