A new CNN-LSTM architecture for activity recognition employing wearable motion sensor data: enabling diverse feature extraction

buir.contributor.authorKoşar, Enes
buir.contributor.authorBarshan, Billur
buir.contributor.orcidKoşar, Enes|0000-0002-8433-8177
buir.contributor.orcidBarshan, Billur|0000-0001-6783-6572
dc.citation.epage19en_US
dc.citation.spage1
dc.citation.volumeNumber124
dc.contributor.authorKoşar, Enes
dc.contributor.authorBarshan, Billur
dc.date.accessioned2024-03-18T07:45:23Z
dc.date.available2024-03-18T07:45:23Z
dc.date.issued2023-06-28
dc.departmentDepartment of Electrical and Electronics Engineering
dc.description.abstractExtracting representative features to recognize human activities through the use of wearables is an area of on-going research. While hand-crafted features and machine learning (ML) techniques have been sufficiently well investigated in the past, the use of deep learning (DL) techniques is the current trend. Specifically, Convolutional Neural Networks (CNNs), Long Short Term Memory Networks (LSTMs), and hybrid models have been investigated. We propose a novel hybrid network architecture to recognize human activities through the use of wearable motion sensors and DL techniques. The LSTM and the 2D CNN branches of the model that run in parallel receive the raw signals and their spectrograms, respectively. We concatenate the features extracted at each branch and use them for activity recognition. We compare the classification performance of the proposed network with three single and three hybrid commonly used network architectures: 1D CNN, 2D CNN, LSTM, standard 1D CNN-LSTM, 1D CNN-LSTM proposed by Ordóñez and Roggen, and an alternative 1D CNN-LSTM model. We tune the hyper-parameters of six of the models using Bayesian optimization and test the models on two publicly available datasets. The comparison between the seven networks is based on four performance metrics and complexity measures. Because of the stochastic nature of DL algorithms, we provide the average values and standard deviations of the performance metrics over ten repetitions of each experiment. The proposed 2D CNN-LSTM architecture achieves the highest average accuracies of 95.66% and 92.95% on the two datasets, which are, respectively, 2.45% and 3.18% above those of the 2D CNN model that ranks the second. This improvement is a consequence of the proposed model enabling the extraction of a broader range of complementary features that comprehensively represent human activities. We evaluate the complexities of the networks in terms of the total number of parameters, model size, training/testing time, and the number of floating point operations (FLOPs). We also compare the results of the proposed network with those of recent related work that use the same datasets.
dc.description.provenanceMade available in DSpace on 2024-03-18T07:45:23Z (GMT). No. of bitstreams: 1 A_new_CNN_LSTM_architecture_for_activity_recognition_employing_wearable_motion_sensor_data_enabling_diverse_feature_extraction.pdf: 5512738 bytes, checksum: ca042b3a9305ca0bf7447f7d21a6f2d6 (MD5) Previous issue date: 2023-06-28en
dc.embargo.release2025-06-28
dc.identifier.doi10.1016/j.engappai.2023.106529
dc.identifier.eissn1873-6769
dc.identifier.issn0952-1976
dc.identifier.urihttps://hdl.handle.net/11693/114859
dc.language.isoen
dc.publisherElsevier
dc.relation.isversionofhttps://doi.org/10.1016/j.engappai.2023.106529
dc.rightsCC BY-NC-ND (Attribution-NonCommercial-NoDerivs 4.0 International)
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.source.titleEngineering Applications of Artificial Intelligence
dc.subjectHuman activity recognition (HAR)
dc.subjectWearables
dc.subjectWearable sensors
dc.subjectMotion sensors
dc.subjectDeep learning
dc.subjectHybrid network models
dc.subjectCNN
dc.subjectLSTM
dc.subjectCNN-LSTM
dc.subjectFeature extraction
dc.subjectModel complexity
dc.subjectUCI HAR dataset
dc.subjectDaily and Sports Activities (DSA) dataset
dc.titleA new CNN-LSTM architecture for activity recognition employing wearable motion sensor data: enabling diverse feature extraction
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
A_new_CNN_LSTM_architecture_for_activity_recognition_employing_wearable_motion_sensor_data_enabling_diverse_feature_extraction.pdf
Size:
5.26 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.01 KB
Format:
Item-specific license agreed upon to submission
Description: