Describir: Identification and correction of systematic error in high-throughput sequence data