Minh-Quan Le

portrait_v1.JPG

Computer Vision Lab

Stony Brook, NY, USA 11794

I am currently a third-year Ph.D. student in Computer Science at Stony Brook University, NY, USA, advised by Prof. Dimitris Samaras.

Before joining SBU, I obtained my Bachelor’s Degree in Computer Science - Honors Program at University of Science, Vietnam National University - HCMC, under the supervision of Prof. Minh-Triet Tran, Prof. Tam Nguyen, and Dr. Trung-Nghia Le.

My research interests lie in Computer Vision and Machine Learning with focus on post-training methods in visual generative models and vision-language models.

news

Sep 08, 2025 I join Google as a Student Researcher.
Mar 25, 2025 I’m joining Computer Science Laboratory (LIX) of École Polytechnique, Paris as a visiting student.
Jan 22, 2025 Our paper Hummingbird done during my internship at Microsoft has been accepted to ICLR 2025!
Oct 28, 2024 1 paper CamoFA has been accepted to WACV 2025!
Jul 01, 2024 1 paper ∞-Brush has been accepted to ECCV 2024!
May 28, 2024 I start my research internship at Microsoft, ROAR.
Feb 26, 2024 1 paper has been accepted to CVPR 2024!
Dec 08, 2023 My first A* paper MaskDiff has been accepted to AAAI 2024 (Oral).
Aug 28, 2023 I start my Ph.D. at Department of Computer Science, Stony Brook University.

selected publications

  1. Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
    Minh-Quan Le*, Gaurav Mittal*, Tianjian Meng, A S M Iftekhar, and 4 more authors
    In The Thirteenth International Conference on Learning Representations, 2025
  2. ∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions
    Minh-Quan Le*, Alexandros Graikos*, Srikar Yellapragada, Rajarsi Gupta, and 2 more authors
    In European Conference on Computer Vision, 2024
  3. MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
    Minh-Quan Le, Tam V Nguyen, Trung-Nghia Le, Thanh-Toan Do, and 2 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2024
  4. Learned representation-guided diffusion models for large-image generation
    Alexandros Graikos*, Srikar Yellapragada*Minh-Quan Le, Saarthak Kapse, and 3 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

selected preprints

  1. What about gravity in video generation? Post-Training Newton’s Laws with Verifiable Rewards
    Minh-Quan Le, Yuanzhi Zhu, Vicky Kalogeiton, and Dimitris Samaras
    2025