Minh-Quan Le

portrait_v1.JPG

Computer Vision Lab

Stony Brook, NY, USA 11794

I am currently a third-year Ph.D. student in Computer Science at Stony Brook University, NY, USA, advised by Prof. Dimitris Samaras.

Before joining SBU, I obtained my Bachelor’s Degree in Computer Science - Honors Program at University of Science, Vietnam National University - HCMC, under the supervision of Prof. Minh-Triet Tran, Prof. Tam Nguyen, and Dr. Trung-Nghia Le.

My research interests lie in Computer Vision and Machine Learning with focus on post-training methods in visual generative models and vision-language models.

news

Sep 08, 2025 I join Google as a Student Researcher.
Mar 25, 2025 I’m joining Computer Science Laboratory (LIX) of École Polytechnique, Paris, France as a visiting student with DATAIA Fellowship.
Jan 27, 2025 I continue my research internship at Microsoft in Spring and Summer 2025.
Jan 22, 2025 Our paper Hummingbird done during my internship at Microsoft has been accepted to ICLR 2025!
Oct 28, 2024 1 paper CamoFA has been accepted to WACV 2025!
Aug 20, 2024 I continue my research internship at Microsoft in Fall 2024.
Jul 01, 2024 1 paper ∞-Brush has been accepted to ECCV 2024!
May 28, 2024 I start my research internship at Microsoft, ROAR.
Feb 26, 2024 1 paper has been accepted to CVPR 2024!
Dec 08, 2023 My first A* paper MaskDiff has been accepted to AAAI 2024 (Oral).
Aug 28, 2023 I start my Ph.D. at Department of Computer Science, Stony Brook University.

selected publications

  1. Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
    Minh-Quan Le*, Gaurav Mittal*, Tianjian Meng, A S M Iftekhar, and 4 more authors
    In The Thirteenth International Conference on Learning Representations, 2025
  2. ∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions
    Minh-Quan Le*, Alexandros Graikos*, Srikar Yellapragada, Rajarsi Gupta, and 2 more authors
    In European Conference on Computer Vision, 2024
  3. MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
    Minh-Quan Le, Tam V Nguyen, Trung-Nghia Le, Thanh-Toan Do, and 2 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2024
  4. Learned representation-guided diffusion models for large-image generation
    Alexandros Graikos*, Srikar Yellapragada*Minh-Quan Le, Saarthak Kapse, and 3 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

selected preprints

  1. What about gravity in video generation? Post-Training Newton’s Laws with Verifiable Rewards
    Minh-Quan Le, Yuanzhi Zhu, Vicky Kalogeiton, and Dimitris Samaras
    2025