Evaluation of Distributed Statistical Learning

محفوظ في:
التفاصيل البيبلوغرافية
الحاوية / القاعدة:PQDT - Global (2025)
المؤلف الرئيسي: Ávila, André Ismael Ferraz
منشور في:
ProQuest Dissertations & Theses
الموضوعات:
الوصول للمادة أونلاين:Citation/Abstract
Full Text - PDF
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

MARC

LEADER 00000nab a2200000uu 4500
001 3287857473
003 UK-CbPIL
020 |a 9798270220884 
035 |a 3287857473 
045 2 |b d20250101  |b d20251231 
084 |a 189128  |2 nlm 
100 1 |a Ávila, André Ismael Ferraz 
245 1 |a Evaluation of Distributed Statistical Learning 
260 |b ProQuest Dissertations & Theses  |c 2025 
513 |a Dissertation/Thesis 
520 3 |a Natural Language Processing models have gained significant attention due to the development of large-scale models such as OpenAI’s GPT. These models rely on extensive and diverse datasets, which presents data-sharing challenges such as privacy and ownership.Federated Learning addresses those challenges by allowing multiple actors to collaboratively train a shared model without exchanging the raw data. Not sharing the raw data enables the use of data that, in normal conditions, could not be shared due to privacy, ethical or legal concerns. This decentralised approach minimises the central storage requirements while also lessening data privacy risks.Statistical learning algorithms are commonly used in Federated Learning. However, they must be adapted into distributed statistical learning algorithms in order to handle decentralised data. These distributed algorithms are being developed and, therefore, must obtain empirical results to assess their theoretical foundations. Due to the distributed nature of the algorithms, performing an empirical evaluation is a complex task, as the environments these algorithms operate in, and consequently, the adversities they encounter are difficult to replicate physically and consistently. This dissertation aims to support the development, improvement and analysis of distributed statistical learning algorithms by introducing an evaluation framework implemented as a discrete event simulator. The existent discrete-event simulators are compared and analysed with the evaluation of the target algorithms in mind. Then, a simulator is designed and purpose-built to be extensible, configurable and observable. The developed simulator is validated by comparing its functioning to that of an already established simulator, and its metrics visualisation capabilities are demonstrated. Furthermore, the simulator is used to evaluate a distributed statistical learning algorithm. Based on the evaluation results, a solution is proposed to address the algorithm’s identified functional shortcomings. The proposed solution is also evaluated using the designed simulator, and its results are compared to those of the original implementation.  
653 |a User interface 
653 |a Simulation 
653 |a Artificial intelligence 
653 |a Privacy 
653 |a General Data Protection Regulation 
653 |a Peers 
653 |a Reproducibility 
653 |a Computer science 
773 0 |t PQDT - Global  |g (2025) 
786 0 |d ProQuest  |t ProQuest Dissertations & Theses Global 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3287857473/abstract/embedded/IZYTEZ3DIR4FRXA2?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/3287857473/fulltextPDF/embedded/IZYTEZ3DIR4FRXA2?source=fedsrch