Describir: Federated reinforcement learning for scheduling-offloading policies in multi-cluster NOMA systems