Describir: Autoregressive Video Generation without Vector Quantization