Automatic construction of sememe knowledge bases from machine readable dictionaries

buir.contributor.authorBattal, Ömer Musa
buir.contributor.authorKoç, Aykut
buir.contributor.orcidKoç, Aykut|0000-0002-6348-2663
dc.citation.epage1035
dc.citation.spage1023
dc.citation.volumeNumber32
dc.contributor.authorBattal, Ömer Musa
dc.contributor.authorKoç, Aykut
dc.date.accessioned2025-02-28T12:02:52Z
dc.date.available2025-02-28T12:02:52Z
dc.date.issued2023-12-28
dc.departmentDepartment of Electrical and Electronics Engineering
dc.departmentNational Magnetic Resonance Research Center (UMRAM)
dc.description.abstractSememes are the minimum semantic units of natural languages. Words annotated with sememes are organized into Sememe Knowledge Bases (SKBs). SKBs are successfully applied to various high-level language processing tasks as external knowledge bases. However, existing SKBs are manually or semi-manually constructed by linguistic experts over long periods, inhibiting their widespread utilization, updating, and expansion. To automatically construct an SKB from Machine-Readable Dictionaries (MRDs), which are readily available, we propose MRD2SKB as an automatic SKB generation approach. Well-established MRDs exist, and their construction is much simpler than SKBs. Therefore, the proposed MRD2SKB allows for fast, flexible, and extendable generation of SKBs. Building upon matrix factorization and topic modeling, we proposed several variants of MRD2SKB and constructed SKBs fully automatically. Both quantitative and qualitative results of extensive experiments are presented to demonstrate that the performances of the proposed automatically created SKBs are on par with manually and semi-manually prepared SKBs.
dc.identifier.doi10.1109/TASLP.2023.3347927
dc.identifier.eissn2329-9304
dc.identifier.issn2329-9290
dc.identifier.urihttps://hdl.handle.net/11693/117014
dc.language.isoEnglish
dc.publisherInstitute of Electrical and Electronics Engineers
dc.relation.isversionofhttps://dx.doi.org/10.1109/TASLP.2023.3347927
dc.rightsCC BY 4.0 Deed (Attribution 4.0 International)
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.source.titleIEEE-ACM Transactions on Audio, Speech, and Language Processing
dc.subjectSememes
dc.subjectMachine readable dictionary
dc.subjectSememe knowledge bases
dc.subjectSKB
dc.subjectMachine learning
dc.titleAutomatic construction of sememe knowledge bases from machine readable dictionaries
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Automatic_Construction_of_Sememe_Knowledge_Bases_From_Machine_Readable_Dictionaries.pdf
Size:
1.76 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: