Describir: Bridging Fault Tolerance, Time Synchronization, and Performance Understanding Across Scalable Architectures and Applications