VISPool: enhancing transformer encoders with vector visibility graph neural networks
buir.contributor.author | Alikaşifoğlu, Tuna | |
buir.contributor.author | Aras, Arda Can | |
buir.contributor.author | Koç, Aykut | |
buir.contributor.orcid | Alikaşifoğlu, Tuna|0000-0001-8030-8088 | |
buir.contributor.orcid | Koç, Aykut|0000-0002-6348-2663 | |
dc.citation.epage | 2556 | |
dc.citation.spage | 2547 | |
dc.contributor.author | Alikaşifoğlu, Tuna | |
dc.contributor.author | Aras, Arda Can | |
dc.contributor.author | Koç, Aykut | |
dc.coverage.spatial | Hybrid, Bangkok, Thailand | |
dc.date.accessioned | 2025-02-28T07:01:28Z | |
dc.date.available | 2025-02-28T07:01:28Z | |
dc.date.issued | 2024-08-16 | |
dc.department | Department of Electrical and Electronics Engineering | |
dc.department | National Magnetic Resonance Research Center (UMRAM) | |
dc.description | Conference Name: Findings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 | |
dc.description | Date of Conference: 11 August 2024 - 16 August 2024 | |
dc.description.abstract | The emergence of transformers has revolutionized natural language processing (NLP), as evidenced in various NLP tasks. While graph neural networks (GNNs) show recent promise in NLP, they are not standalone replacements for transformers. Rather, recent research explores combining transformers and GNNs. Existing GNN-based approaches rely on static graph construction methods requiring excessive text processing, and most of them are not scalable with the increasing document and word counts. We address these limitations by proposing a novel dynamic graph construction method for text documents based on vector visibility graphs (VVGs) generated from transformer output. Then, we introduce visibility pooler (VISPool), a scalable model architecture that seamlessly integrates VVG convolutional networks into transformer pipelines. We evaluate the proposed model on the General Language Understanding Evaluation (GLUE) benchmark datasets. VISPool outperforms the baselines with less trainable parameters, demonstrating the viability of the visibility-based graph construction method for enhancing transformers with GNNs. © 2024 Association for Computational Linguistics. | |
dc.identifier.doi | 10.18653/v1/2024.findings-acl.149 | |
dc.identifier.isbn | 979-889176099-8 | |
dc.identifier.issn | 0736587X | |
dc.identifier.uri | https://hdl.handle.net/11693/116965 | |
dc.language.iso | English | |
dc.publisher | Association for Computational Linguistics | |
dc.relation.isversionof | https://dx.doi.org/10.18653/v1/2024.findings-acl.149 | |
dc.source.title | Association for Computational Linguistics. Annual Meeting. Conference Proceedings | |
dc.subject | HUMANITIES and RELIGION::Languages and linguistics::Linguistic subjects::Computational linguistics | |
dc.subject | Convolutional neural networks | |
dc.subject | Distribution transformers | |
dc.subject | Gluing | |
dc.subject | Modeling languages | |
dc.subject | Natural language processing systems | |
dc.subject | Word processing | |
dc.title | VISPool: enhancing transformer encoders with vector visibility graph neural networks | |
dc.type | Conference Paper |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- VISPool_Enhancing_Transformer_Encoders_with_Vector_Visibility_Graph_Neural_Networks.pdf
- Size:
- 614.27 KB
- Format:
- Adobe Portable Document Format