Weimin Bai, PhD Candidate @ Peking University

Greetings! I am a PhD Candidate majoring in Computer Science and Physics at Peking University, supervised by Dr. He Sun and Dr. Wenzheng Chen. I specialize in generative models and inverse rendering. I developed EMDiffusion for data-efficient diffusion training, LatentDEM for blind inversion, VLM3D and Dive3D for 3D-aware reward modeling, and InstantViR for real-time video reconstruction. I have held internships and collaborations with leading industrial institutions such as Microsoft Research Asia, RedNote Inc., Kuaishou Inc., and Shanghai AI Laboratory.

I obtained my Master's degree in Computer Science from Southeast University in 2023, where I was advised by Prof. Hui Xue. Prior to that, I earned a Bachelor's degree in Computer Science from China Agricultural University in 2020.

I am actively seeking internship for 2026 and full-time positions starting in Summer 2027. I am open to collaborations and discussions—please feel free to reach out anytime.

Vision

Build practical intelligence, which necessitates (1) real-time inference and (2) infinite context memory, thus ultimately empowering applications like agents to liberate human society from repetitive labor and enjoy a peaceful life.

News

  • 2025: EM Generalist: A physics-driven diffusion foundation model for electron microscopy. H Sun, E Ye, C Lu, Z Jiang, W Bai, S Ren, C Wang, J Zhang, R Shi, L Ma, ... (2025)
  • 2025: Microscopic image restoration and uncertainty quantification using physics-informed generative models. W Bai, S Ren, E Ye, W Tang, H Sun. Computational Optical Imaging and Artificial Intelligence in Biomedical …, 2025
  • 2025: InstantViR: Real-Time Video Inverse Problem Solver with Distilled Diffusion Prior. W Bai, S Xu, Y Ren, J Hao, M Sun, W Chen, H Sun. arXiv preprint arXiv:2511.14208, 2025
  • 2025: Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning. Y Gu, W Bai, Y Wang, W Luo, H Sun. arXiv preprint arXiv:2511.15190, 2025
  • 2025: Let Language Constrain Geometry: Vision-Language Models as Semantic and Spatial Critics for 3D Generation. W Bai, Y Li, W Luo, Z Lai, Y Wang, W Chen, H Sun. arXiv preprint arXiv:2511.14271, 2025
  • August 2024: Talk at University of Chicago IMSISolving Inverse Problems with Inaccurate Priors or Physical Model.
  • 2025: Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching. W Bai, Y Li, W Chen, W Luo, H Sun. Transactions on Machine Learning Research, 2025
  • 2025: Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation. W Bai, Y Li, W Luo, W Chen, H Sun. arXiv preprint arXiv:2509.15772, 2025
  • 2025: Blind inversion using latent diffusion priors. W Bai, S Chen, W Chen, H Sun. SIGGRAPH Asia 2025 Poster (IEEE Transactions on Image Processing), 2025
  • 2025: Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction. Y Wang, W Bai, C Zhang, D Zhang, W Luo, H Sun. Advances in Neural Information Processing Systems, 2025
  • 2025: Learning diffusion model from noisy measurement using principled expectation-maximization method. W Bai, W Tang, E Ye, S Chen, W Chen, H Sun. ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025
  • 2024: An expectation-maximization algorithm for training clean diffusion models from corrupted observations. W Bai, Y Wang, W Chen, H Sun. Advances in Neural Information Processing Systems, 2024
  • 2024: Integrating amortized inference with diffusion models for learning clean distribution from corrupted images. Y Wang, W Bai, W Luo, W Chen, H Sun. arXiv preprint arXiv:2407.11162, 2024
  • 2023: A temporal kernel for time series forecasting. W Bai, H Xue. Knowledge-Based Systems, 113623, 2023
  • 2021: Learning Deeper Non-Monotonic Networks by Softly Transferring Solution Space. ZF Wu, H Xue, W Bai. IJCAI, 3200-3206, 2021

Selected Publications (full list see Google Scholar)

VLM3D ICML 2026
Let Language Constrain Geometry: Vision-Language Models as Semantic and Spatial Critics for 3D Generation

Weimin Bai, Yubo Li, Weijian Luo, Zeqiang Lai, Yequan Wang, Wenzheng Chen, He Sun

ICML 2026

InstantViR CVPR 2026
InstantViR: Real-Time Video Inverse Problem Solver with Distilled Diffusion Prior

Weimin Bai, Suzhe Xu, Yiwei Ren, Jinhua Hao, Ming Sun, Wenzheng Chen, He Sun

CVPR 2026 Highlight

A real-time video super-resolution, inpainting, and deblurring framework utilizing amortized autoregressive distillation for fast inference.

MARVAL CVPR 2026
Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning

Yuxuan Gu, Weimin Bai, Yifei Wang, Weijian Luo, He Sun

CVPR 2026

The first reinforcement learning framework for masked auto-regressive models.

Dive3D TMLR 2025
Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching

Weimin Bai, Yubo Li, Weijian Luo, Wenzheng Chen, He Sun

Transactions on Machine Learning Research (TMLR), 2025

LatentDEM IEEE TIP 2025
Blind Inversion using Latent Diffusion Priors

Weimin Bai, Siyi Chen, Wenzheng Chen, He Sun

IEEE Transactions on Image Processing, 2025 (SIGGRAPH Asia 2025 Poster)

EMDiffusion NeurIPS 2025
Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction

Yifei Wang, Weimin Bai, Colin Zhang, Debing Zhang, Weijian Luo, He Sun

NeurIPS, 2025

Dive3D SPIE Photonics West 2025
Microscopic Image Restoration and Uncertainty Quantification using Physics-informed Generative Models

Weimin Bai, Yubo Li, Wenzheng Chen, Weijian Luo, He Sun

SPIE Photonics West (Long Oral Presentation), 2025

EMDiffusion NeurIPS 2024
An Expectation-Maximization Algorithm for Training Clean Diffusion Models from Corrupted Observations

Weimin Bai, Yifei Wang, Wenzheng Chen, He Sun

NeurIPS, 2024

EMDiffusion KBS 2023
A Temporal Kernel for Time Series Forecasting

Weimin Bai, Hui Xue

Knowledge-Based Systems, 2023


Mentors with whom I have worked on projects:

Friends with whom I have worked with:

  • Yifei Wang, Undergraduate student at Peking University, and current PhD student at Rice University (Houston).
  • Siyi Chen, Undergraduate student at Peking University, and current PhD student at University of Michigan (Ann Arbor).
  • Weiheng Tang, Undergraduate student at Peking University, and current PhD student at Purdue University.
  • Yuxuan Gu, Undergraduate student at Beihang University, M.S. student at Peking University.
  • Yubo Li, Undergraduate student at Tsinghua University, M.S. student at Peking University.
  • Suzhe Xu, Undergraduate student at Huaqiao University, current internship at Kuaishou Inc.

Academic Service

Journal Reviewer:
IEEE Transactions on Image Processing (TIP),
IEEE Transactions on Neural Networks and Learning Systems (TNNLS),
Knowledge-Based Systems (KBS),

Conference Reviewer:
NeurIPS 2025,
ICLR 2026/2025,
ICASSP 2024/2025/2026,
IJCAI 2023.