Multi-label sentiment analysis on 100 languages with dynamic weighting for label imbalance

buir.contributor.authorYılmaz, Selim Fırat
buir.contributor.authorKaynak, Ergün Batuhan
buir.contributor.authorKoç, Aykut
buir.contributor.authorDibeklioğlu, Hamdi
buir.contributor.authorKozat, Süleyman Serdar
buir.contributor.orcidYılmaz, Selim Fırat|0000-0002-0486-7731
buir.contributor.orcidKaynak, Ergün Batuhan|0000-0002-3249-3343
buir.contributor.orcidKoç, Aykut|0000-0002-6348-2663
buir.contributor.orcidDibeklioğlu, Hamdi|0000-0003-0851-7808
buir.contributor.orcidKozat, Süleyman Serdar|0000-0002-6488-3848
dc.citation.epage343en_US
dc.citation.issueNumber1
dc.citation.spage331 en_US
dc.citation.volumeNumber34
dc.contributor.authorYılmaz, Selim Fırat
dc.contributor.authorKaynak, Ergün Batuhan
dc.contributor.authorKoç, Aykut
dc.contributor.authorDibeklioğlu, Hamdi
dc.contributor.authorKozat, Süleyman Serdar
dc.date.accessioned2022-03-04T08:53:25Z
dc.date.available2022-03-04T08:53:25Z
dc.date.issued2021-07-19
dc.departmentDepartment of Computer Engineeringen_US
dc.departmentDepartment of Electrical and Electronics Engineeringen_US
dc.departmentNational Magnetic Resonance Research Center (UMRAM)en_US
dc.description.abstractWe investigate cross-lingual sentiment analysis, which has attracted significant attention due to its applications in various areas including market research, politics, and social sciences. In particular, we introduce a sentiment analysis framework in multi-label setting as it obeys Plutchik's wheel of emotions. We introduce a novel dynamic weighting method that balances the contribution from each class during training, unlike previous static weighting methods that assign non-changing weights based on their class frequency. Moreover, we adapt the focal loss that favors harder instances from single-label object recognition literature to our multi-label setting. Furthermore, we derive a method to choose optimal class-specific thresholds that maximize the macro-f1 score in linear time complexity. Through an extensive set of experiments, we show that our method obtains the state-of-the-art performance in seven of nine metrics in three different languages using a single model compared with the common baselines and the best performing methods in the SemEval competition. We publicly share our code for our model, which can perform sentiment analysis in 100 languages, to facilitate further research.en_US
dc.description.provenanceSubmitted by Dilan Ayverdi (dilan.ayverdi@bilkent.edu.tr) on 2022-03-04T08:53:24Z No. of bitstreams: 1 Multi-label_sentiment_analysis_on_100_languages_with_dynamic_weighting_for_label_imbalance.pdf: 1714127 bytes, checksum: a19fa2dd30febece0e044a2d58228885 (MD5)en
dc.description.provenanceMade available in DSpace on 2022-03-04T08:53:25Z (GMT). No. of bitstreams: 1 Multi-label_sentiment_analysis_on_100_languages_with_dynamic_weighting_for_label_imbalance.pdf: 1714127 bytes, checksum: a19fa2dd30febece0e044a2d58228885 (MD5) Previous issue date: 2021-07-19en
dc.identifier.doi10.1109/TNNLS.2021.3094304en_US
dc.identifier.eissn2162-2388en_US
dc.identifier.issn2162-237Xen_US
dc.identifier.urihttp://hdl.handle.net/11693/77683en_US
dc.language.isoEnglishen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.relation.isversionofhttps://doi.org/10.1109/TNNLS.2021.3094304en_US
dc.source.titleIEEE Transactions on Neural Networks and Learning Systemsen_US
dc.subjectCross-lingualen_US
dc.subjectLabel imbalanceen_US
dc.subjectMacro-f1 maximizationen_US
dc.subjectMulti-labelen_US
dc.subjectNatural language processing (NLP)en_US
dc.subjectSentiment analysisen_US
dc.subjectSocial mediaen_US
dc.titleMulti-label sentiment analysis on 100 languages with dynamic weighting for label imbalanceen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Multi-label_sentiment_analysis_on_100_languages_with_dynamic_weighting_for_label_imbalance.pdf
Size:
1.44 MB
Format:
Adobe Portable Document Format