Sequence Modeling
Developing generative models for discrete sequences—including diffusion language models, masked discrete diffusion, and one-step discrete generation—for language modeling and biological sequence design.
We develop generative models for discrete sequences, with applications spanning language modeling and biological sequence design. Traditional autoregressive models generate tokens sequentially, but diffusion and flow-based approaches offer the promise of parallel, non-sequential generation and bidirectional editing.
Our work introduces coupling models for one-step discrete generation, learning direct couplings between discrete sequences and Gaussian latents to generate samples in a single step. We also investigate how to adapt pretrained autoregressive language models to diffusion language models via representation alignment, preserving the semantic structure learned during pretraining while enabling non-sequential generation.
Key Directions
- One-Step Discrete Generation: Coupling models that learn direct couplings between discrete sequences and Gaussian latents
- Diffusion Language Models: Adapting pretrained AR models to diffusion LMs via representation alignment, and planner-aware path learning for training
- Masked Diffusion: Path planning and steering for masked discrete diffusion models via denoising posterior prediction
- Evaluation: Understanding the limitations of generative perplexity and advocating for distributional metrics in text evaluation
People
Selected Publications
-
Hacking Generative Perplexity: Why Unconditional Text Evaluation Needs Distributional Metrics
Antonio Franca, Alexander Tong
In ICML 2026 Workshop on Structured Probabilistic Inference & Generative Modeling (SPIGM)
July 2026
-
Coupling Models for One-Step Discrete Generation
Fred Zhangzhi Peng, Avishek Joey Bose, Anru R. Zhang, Alexander Tong
Preprint
May 2026
-
Don't Retrain, Align: Adapting Autoregressive LMs to Diffusion LMs via Representation Alignment
Fred Zhangzhi Peng, Alexis Fox, Anru R. Zhang, Alexander Tong
Preprint
May 2026
-
Planner Aware Path Learning in Diffusion Language Models Training
Fred Zhangzhi Peng, Zachary Bezemek, Jarrid Rector-Brooks, Shuibai Zhang, Anru R. Zhang, Michael Bronstein, Avishek Joey Bose, Alexander Tong
In ICLR 2026 (Oral)
May 2026
-
Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction
Jarrid Rector-Brooks, Mohsin Hasan, Zhangzhi Peng, Zachary Quinn, Chenghao Liu, Sarthak Mittal, Nouha Dziri, Michael Bronstein, Yoshua Bengio, Pranam Chatterjee, Alexander Tong, Avishek Joey Bose
In ICLR 2025
May 2025
-
Path Planning for Masked Diffusion Model Sampling
Fred Zhangzhi Peng, Zachary Bezemek, Sawan Patel, Jarrid Rector-Brooks, Sherwood Yao, Alexander Tong, Pranam Chatterjee
arXiv preprint
February 2025