Describir: Automatic In-Place Text Detection and Translation in Video Games