Describir: Graph Unfolding and Sampling for Transitory Video Summarization via Gershgorin Disc Alignment