Prioritized binary transformation method for efficient multi-label classification of data streams with many labels

buir.contributor.authorYıldırım, Onur
buir.contributor.authorBakhshi, Sepehr
buir.contributor.authorCan, Fazlı
buir.contributor.orcidYıldırım, Onur|0009-0009-9274-8908
dc.citation.epage4222
dc.citation.spage4218
dc.contributor.authorYıldırım, Onur
dc.contributor.authorBakhshi, Sepehr
dc.contributor.authorCan, Fazlı
dc.coverage.spatialUnited States
dc.date.accessioned2025-02-28T11:04:00Z
dc.date.available2025-02-28T11:04:00Z
dc.date.issued2024-10-21
dc.departmentDepartment of Music
dc.descriptionConference Name: 33rd ACM International Conference on Information and Knowledge Management,
dc.descriptionDate of Conference: 21 October 2024
dc.description.abstractReal-time data processing systems generate huge amounts of data that need to be classified. The volume, variety, velocity, and veracity (uncertainty) of this data necessitate new approaches and the adaptation of existing classification methods. Moreover, the arriving data can belong to more than one class at the same time. As the number of labels grows larger, a significant portion of the multi-label data stream classification methods become computationally inefficient. We propose a novel online approach: the Prioritized Binary Transformation (PBT) method, which can classify data with large numbers of labels by ordering the labels using Principal Component Analysis (PCA) within a fixed-size window. This order is then used to transform the label vectors for classification. We perform an empirical analysis on 12 datasets and compare PBT to four prominent baselines using four evaluation metrics. PBT achieves the best average ranking in three of the four evaluation metrics. Moreover, we investigate efficiency under average execution time per data item and memory consumption where PBT achieves second and first average rankings, respectively. © 2024 Owner/Author.
dc.identifier.doi10.1145/3627673.3679980
dc.identifier.isbn9798400704369
dc.identifier.issn21550751
dc.identifier.urihttps://hdl.handle.net/11693/116992
dc.language.isoEnglish
dc.publisherAssociation for Computing Machinery
dc.relation.isversionofhttps://dx.doi.org/10.1145/3627673.3679980
dc.source.titleInternational Conference on Information and Knowledge Management, Proceedings
dc.subjectData stream
dc.subjectMulti-label classification
dc.subjectProblem transformation
dc.titlePrioritized binary transformation method for efficient multi-label classification of data streams with many labels
dc.typeConference Paper

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Prioritized_binary_transformation_method_for_efficient_multi-label_classification_of_data_streams_with_many_labels.pdf
Size:
1.08 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: