Describir: Benchmarking the coding strategies of non-coding mutations on sequence-based downstream tasks with machine learning