Prioritized binary transformation method for efficient multi-label classification of data streams with many labels
buir.contributor.author | Yıldırım, Onur | |
buir.contributor.author | Bakhshi, Sepehr | |
buir.contributor.author | Can, Fazlı | |
buir.contributor.orcid | Yıldırım, Onur|0009-0009-9274-8908 | |
dc.citation.epage | 4222 | |
dc.citation.spage | 4218 | |
dc.contributor.author | Yıldırım, Onur | |
dc.contributor.author | Bakhshi, Sepehr | |
dc.contributor.author | Can, Fazlı | |
dc.coverage.spatial | United States | |
dc.date.accessioned | 2025-02-28T11:04:00Z | |
dc.date.available | 2025-02-28T11:04:00Z | |
dc.date.issued | 2024-10-21 | |
dc.department | Department of Music | |
dc.description | Conference Name: 33rd ACM International Conference on Information and Knowledge Management, | |
dc.description | Date of Conference: 21 October 2024 | |
dc.description.abstract | Real-time data processing systems generate huge amounts of data that need to be classified. The volume, variety, velocity, and veracity (uncertainty) of this data necessitate new approaches and the adaptation of existing classification methods. Moreover, the arriving data can belong to more than one class at the same time. As the number of labels grows larger, a significant portion of the multi-label data stream classification methods become computationally inefficient. We propose a novel online approach: the Prioritized Binary Transformation (PBT) method, which can classify data with large numbers of labels by ordering the labels using Principal Component Analysis (PCA) within a fixed-size window. This order is then used to transform the label vectors for classification. We perform an empirical analysis on 12 datasets and compare PBT to four prominent baselines using four evaluation metrics. PBT achieves the best average ranking in three of the four evaluation metrics. Moreover, we investigate efficiency under average execution time per data item and memory consumption where PBT achieves second and first average rankings, respectively. © 2024 Owner/Author. | |
dc.identifier.doi | 10.1145/3627673.3679980 | |
dc.identifier.isbn | 9798400704369 | |
dc.identifier.issn | 21550751 | |
dc.identifier.uri | https://hdl.handle.net/11693/116992 | |
dc.language.iso | English | |
dc.publisher | Association for Computing Machinery | |
dc.relation.isversionof | https://dx.doi.org/10.1145/3627673.3679980 | |
dc.source.title | International Conference on Information and Knowledge Management, Proceedings | |
dc.subject | Data stream | |
dc.subject | Multi-label classification | |
dc.subject | Problem transformation | |
dc.title | Prioritized binary transformation method for efficient multi-label classification of data streams with many labels | |
dc.type | Conference Paper |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Prioritized_binary_transformation_method_for_efficient_multi-label_classification_of_data_streams_with_many_labels.pdf
- Size:
- 1.08 MB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: