Describir: Training Data Curation for Language Models With Weak Supervision