On federated learning over wireless channels with over-the-air aggregation

buir.advisorDuman, Tolga Mete
dc.contributor.authorAygün, Ozan
dc.date.accessioned2022-08-16T11:27:18Z
dc.date.available2022-08-16T11:27:18Z
dc.date.copyright2022-07
dc.date.issued2022-07
dc.date.submitted2022-08-11
dc.departmentDepartment of Electrical and Electronics Engineeringen_US
dc.descriptionCataloged from PDF version of article.en_US
dc.descriptionThesis (Master's): Bilkent University, Department of Electrical and Electronics Engineering, İhsan Doğramacı Bilkent University, 2022.en_US
dc.descriptionIncludes bibliographical references (leaves 79-86).en_US
dc.description.abstractA decentralized machine learning (ML) approach called federated learning (FL) has recently been at the center of attention since it secures edge users’ data and decreases communication costs. In FL, a parameter server (PS), which keeps track of the global model orchestrates local training and global model aggregation across a set of mobile users (MUs). While there exist studies on FL over wireless channels, its performance on practical wireless communication scenarios has not been investigated very well. With this motivation, this thesis considers wireless FL schemes that use realistic channel models, and analyze the impact of different wireless channel effects. In the first part of the thesis, we study hierarchical federated learning (HFL) where intermediate servers (ISs) are utilized to make the server-side closer to the MUs. Clustering approach is used where MUs are assigned to ISs to perform multiple cluster aggregations before the global aggregation. We first analyze the performance of a partially wireless approach where the MUs send their gradients through a channel with path-loss and fading using over-the-air (OTA) aggregation. We assume that there is no inter-cluster interference and the gradients from the ISs to the PS are sent error-free. We show through numerical and experimental analysis that our proposed algorithm offers a faster convergence and lower power consumption compared to the standard FL with OTA aggregation. As an extension, we also examine a fully-wireless HFL setup where both the MUs and ISs send their gradients through OTA aggregation, taking into account the effect of inter-cluster interference. Our numerical and experimental results reveal that utilizing ISs results in a faster convergence and a better performance than the OTA FL without any IS while using less transmit power. It is also shown that the best choice of cluster aggregations depends on the data distribution among the MUs and the clusters. In the second part of the thesis, we study FL with energy harvesting MUs with stochastic energy arrivals. In every global iteration, the MUs with enough energy in their batteries perform local SGD iterations, and transmit their gradients using OTA aggregation. Before sending the gradients to the PS, the gradients are scaled with respect to the idle time and data cardinality of each MU, through a cooldown multiplier, to amplify the importance of the MUs that send less frequent local updates. We provide a convergence analysis of the proposed setup, and validate our results with numerical and neural network simulations under different energy arrival profiles. The results show that the OTA FL with energy harvesting devices performs slightly worse than the OTA FL without any energy restrictions, and that utilizing the excess energy for more local SGD iterations gives a better convergence rate than simply increasing the transmit power.en_US
dc.description.degreeM.S.en_US
dc.description.statementofresponsibilityby Ozan Aygünen_US
dc.format.extentxiv, 101 leaves : illustrations, charts ; 30 cm.en_US
dc.identifier.itemidB161147
dc.identifier.urihttp://hdl.handle.net/11693/110450
dc.language.isoEnglishen_US
dc.publisherBilkent Universityen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectDistributed machine learningen_US
dc.subjectFederated learningen_US
dc.subjectWireless channelsen_US
dc.subjectPath-lossen_US
dc.subjectFading channelsen_US
dc.subjectEnergy harvesting communicationsen_US
dc.titleOn federated learning over wireless channels with over-the-air aggregationen_US
dc.title.alternativeKablosuz kanallar üzerinden havada birleştirme ile federe öğrenmeen_US
dc.typeThesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
B161147.pdf
Size:
2.44 MB
Format:
Adobe Portable Document Format
Description:
Full printable version
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description: