A novel neural ensemble architecture for on-the-fly classification of evolving text streams

buir.contributor.authorGhahramanian, Pouya
buir.contributor.authorBakhshi, Sepehr
buir.contributor.authorFazlı, Can
buir.contributor.orcidGhahramanian, Pouya|0000-0003-3479-8842
buir.contributor.orcidBakhshi, Sepehr|0000-0003-2292-6130
buir.contributor.orcidCan, Fazlı|0000-0003-0016-4278
dc.citation.issueNumber4
dc.citation.volumeNumber18
dc.contributor.authorGhahramanian, Pouya
dc.contributor.authorBakhshi, Sepehr
dc.contributor.authorBonab, Hamed
dc.contributor.authorCan, Fazlı
dc.date.accessioned2025-02-23T17:30:47Z
dc.date.available2025-02-23T17:30:47Z
dc.date.issued2024
dc.departmentDepartment of Computer Engineering
dc.description.abstractWe study on-the-fly classification of evolving text streams in which the relation between the input data and target labels changes over time-i.e., "concept drift." These variations decrease the model's performance, as predictions become less accurate over time and they necessitate a more adaptable system. While most studies focus on concept drift detection and handling with ensemble approaches, the application of neural models in this area is relatively less studied. We introduce Adaptive Neural Ensemble Network (AdaNEN), a novel ensemble-based neural approach, capable of handling concept drift in data streams. With our novel architecture, we address some of the problems neural models face when exploited for online adaptive learning environments. Most current studies address concept drift detection and handling in numerical streams, and the evolving text stream classification remains relatively unexplored. We hypothesize that the lack of public and large-scale experimental data could be one reason. To this end, we propose a method based on an existing approach for generating evolving text streams by introducing various types of concept drifts to real-world text datasets. We provide an extensive evaluation of our proposed approach using 12 state-of-the-art baselines and 13 datasets. We first evaluate concept drift handling capability of AdaNEN and the baseline models on evolving numerical streams; this aims to demonstrate the concept drift handling capabilities of our method on a general spectrum and motivate its use in evolving text streams. The models are then evaluated in evolving text stream classification. Our experimental results show that AdaNEN consistently outperforms the existing approaches in terms of predictive performance with conservative efficiency.
dc.identifier.doi10.1145/3639054
dc.identifier.eissn1556-472X
dc.identifier.issn1556-4681
dc.identifier.urihttps://hdl.handle.net/11693/116696
dc.language.isoEnglish
dc.publisherAssociation for Computing Machinery (ACM)
dc.relation.isversionofhttps://dx.doi.org/10.1145/3639054
dc.rightsCC BY 4.0 (Attribution 4.0 International Deed)
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.source.titleACM Transactions On Knowledge Discovery From Data (TKDD)
dc.subjectData stream mining
dc.subjectConcept drift
dc.subjectText stream classification
dc.subjectEnsemble methods
dc.subjectNeural networks
dc.titleA novel neural ensemble architecture for on-the-fly classification of evolving text streams
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
A_Novel_Neural_Ensemble_Architecture_for_On-the-fly.pdf
Size:
1.29 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: