Describir: An Architecture for Voice-Based Authentication and Authorization with Deepfake Detection