##article.return## Beyond Next-Token Prediction: A Performance Characterization of Diffusion versus Autoregressive Language Models Download Download PDF