Drowsiness Detection Using Federated Learning: Lessons Learnt from Dealing with Non-IID Data
Chapter
Published version
Date
2024Metadata
Show full item recordCollections
- Publikasjoner fra CRIStin - SINTEF AS [5864]
- SINTEF Digital [2536]
Original version
PETRA '24: Proceedings of the 17th International Conference on PErvasive Technologies Related to Assistive Environments. 2024, 285-292. 10.1145/3652037.3652074Abstract
The privacy of personal data is paramount in the realm of assisted living and digital healthcare. Federated Learning (FL), with its decentralised model training approach, has emerged as a compelling solution to reconcile the need for personalised models with the requirement to protect sensitive personal information. By allowing model training to occur locally on user devices without centralising raw data, FL is intended to strike a balance between personalisation and privacy. While the potential benefits of FL in assisted living and digital healthcare are substantial, practical implementation poses significant challenges. One of them is the non-Independently and Identically Distributed (non-IID) nature of personal data. Unlike centralised datasets, non-IID data exhibits inherent variability across different individuals, as well as their surrounding contexts. Unfortunately, many research approaches in this domain often overlook the nuances of non-IID data, potentially leading to models that lack robust generalisation across diverse healthcare scenarios. To highlight the importance of this challenge, in this paper, we report on our hands-on experience of building a FL system for drowsiness detection using non-IID data. We compare this federated setup with a traditional, centralised approach to model training by identifying and discussing the associated challenges from multiple perspectives, as well as possible solutions and recommendations for further research.