Over the air federated edge learning with hierarchical clustering

buir.contributor.authorKazemi, Mohammad
buir.contributor.authorDuman, Tolga Mete
buir.contributor.orcidKazemi, Mohammad|0000-0001-5177-1874
buir.contributor.orcidDuman, Tolga Mete|0000-0002-5187-8660
dc.citation.epage17871
dc.citation.issueNumber12
dc.citation.spage17856
dc.citation.volumeNumber23
dc.contributor.authorAygün, Ozan
dc.contributor.authorKazemi, Mohammad
dc.contributor.authorGündüz, Deniz
dc.contributor.authorDuman, Tolga Mete
dc.date.accessioned2025-02-27T08:06:18Z
dc.date.available2025-02-27T08:06:18Z
dc.date.issued2024-12
dc.departmentDepartment of Electrical and Electronics Engineering
dc.description.abstractWe examine federated learning (FL) with over-the-air (OTA) aggregation, where mobile users (MUs) aim to reach a consensus on a global model with the help of a parameter server (PS) that aggregates the local gradients. In OTA FL, MUs train their models using local data at every training round and transmit their gradients simultaneously using the same frequency band in an uncoded fashion. Based on the received signal of the superposed gradients, the PS performs a global model update. While the OTA FL has a significantly decreased communication cost, it is susceptible to adverse channel effects and noise. Employing multiple antennas at the receiver side can reduce these effects, yet the path-loss is still a limiting factor for users located far away from the PS. To ameliorate this issue, in this paper, we propose a wireless-based hierarchical FL scheme that uses intermediate servers (ISs) to form clusters in the areas where the MUs are more densely located. Our scheme utilizes OTA cluster aggregations for the communication of the MUs with their corresponding IS, and OTA global aggregations from the ISs to the PS. We present a convergence analysis for the proposed algorithm, and show through numerical evaluations of the derived analytical expressions and experimental results that utilizing ISs results in a faster convergence and a better performance than the OTA FL alone while using less transmit power. We also validate the results on the performance using different numbers of cluster iterations with different datasets and data distributions. We conclude that the best choice of cluster aggregations depends on the data distribution among the MUs and the clusters.
dc.identifier.doi10.1109/TWC.2024.3457591
dc.identifier.eissn1558-2248
dc.identifier.issn1536-1276
dc.identifier.urihttps://hdl.handle.net/11693/116906
dc.language.isoEnglish
dc.publisherIEEE
dc.relation.isversionofhttps://dx.doi.org/10.1109/TWC.2024.3457591
dc.rightsCC BY-NC-ND 4.0 DEED (Attribution-NonCommercial-NoDerivatives 4.0 International)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.source.titleIEEE Transactions on Wireless Communications
dc.subjectMachine learning
dc.subjectOver-the-air communications
dc.subjectFederated learning
dc.subjectWireless communications
dc.subjectOver-the-air aggregation
dc.subjectHierarchical clustering
dc.titleOver the air federated edge learning with hierarchical clustering
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Over-the-Air_Federated_Edge_Learning_With_Hierarchical_Clustering.pdf
Size:
7.06 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: