publications

2025

  1. What about gravity in video generation? Post-Training Newton’s Laws with Verifiable Rewards
    Minh-Quan Le, Yuanzhi Zhu, Vicky Kalogeiton, and Dimitris Samaras
    2025
  2. Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
    Minh-Quan Le*, Gaurav Mittal*, Tianjian Meng, A S M Iftekhar, and 4 more authors
    In The Thirteenth International Conference on Learning Representations, 2025
  3. CamoFA: A Learnable Fourier-based Augmentation for Camouflage Segmentation
    Minh-Quan Le*, Minh-Triet Tran*, Trung-Nghia Le, Tam V Nguyen, and 1 more author
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025
  4. gunnel.jpg
    GUNNEL: Guided Mixup Augmentation and Multi-Model Fusion for Aquatic Animal Segmentation
    Minh-Quan Le*, Trung-Nghia Le*, Tam V Nguyen, Isao Echizen, and 1 more author
    Neural Computing and Applications, 2025

2024

  1. ∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions
    Minh-Quan Le*, Alexandros Graikos*, Srikar Yellapragada, Rajarsi Gupta, and 2 more authors
    In European Conference on Computer Vision, 2024
  2. MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
    Minh-Quan Le, Tam V Nguyen, Trung-Nghia Le, Thanh-Toan Do, and 2 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2024
  3. Learned representation-guided diffusion models for large-image generation
    Alexandros Graikos*, Srikar Yellapragada*Minh-Quan Le, Saarthak Kapse, and 3 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

  1. CG
    cg23sketch.jpg
    SketchANIMAR: Sketch-based 3D Animal Fine-Grained Retrieval
    Trung-Nghia Le, Tam V Nguyen, Minh-Quan Le, Trong-Thuan Nguyen, and 7 more authors
    Computers & Graphics, 2023
  2. CG
    cg23text.jpg
    TextANIMAR: Text-based 3D Animal Fine-Grained Retrieval
    Trung-Nghia Le, Tam V Nguyen, Minh-Quan Le, Trong-Thuan Nguyen, and 7 more authors
    Computers & Graphics, 2023
  3. flformer.jpg
    FL-Former: Flood Level Estimation with Vision Transformer for Images from Cameras in Urban Areas
    Quoc-Cuong Le, Minh-Quan Le, Mai-Khiem Tran, Ngoc-Quyen Le, and 1 more author
    In International Conference on Multimedia Modeling, 2023

2022

  1. ismar22.jpg
    Data-Driven City Traffic Planning Simulation
    Tam V Nguyen, Thanh Ngoc-Dat Tran, Viet-Tham Huynh, Bao Truong, and 5 more authors
    In 2022 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct), 2022
  2. CG
    cg22.jpg
    SHREC 2022 Track on Online Detection of Heterogeneous Gestures
    Marco Emporio, Ariel Caputo, Andrea Giachetti, Marco Cristani, and 7 more authors
    Computers & Graphics, 2022
  3. vbs22.jpg
    V-FIRST: A Flexible Interactive Retrieval System for Video at VBS 2022
    Minh-Triet Tran, Nhat Hoang-Xuan, Hoang-Phuc Trang-Trung, Thanh-Cong Le, and 5 more authors
    In International Conference on Multimedia Modeling, 2022

2021

  1. tip21.jpg
    Camouflaged Instance Segmentation In-the-Wild: Dataset, Method, and Benchmark Suite
    Trung-Nghia Le, Yubo Cao, Tan-Cong Nguyen, Minh-Quan Le, and 4 more authors
    IEEE Transactions on Image Processing, 2021
  2. CG
    cg21.jpg
    SHREC 2021: Skeleton-based Hand Gesture Recognition in the Wild
    Ariel Caputo, Andrea Giachetti, Simone Soso, Deborah Pintani, and 7 more authors
    Computers & Graphics, 2021
  3. Interactive Video Object Mask Annotation
    Trung-Nghia Le, Tam V Nguyen, Quoc-Cuong Tran, Lam Nguyen, and 3 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2021

2020

  1. CG
    cg20.jpg
    SHREC 2020: Retrieval of Digital Surfaces with Similar Geometric Reliefs
    Elia Moscoso Thompson, Silvia Biasotti, Andrea Giachetti, Claudio Tortorici, and 7 more authors
    Computers & Graphics, 2020
  2. cvprw20vos.jpg
    Multi-Referenced Guided Instance Segmentation Framework for Semi-supervised Video Instance Segmentation
    Minh-Triet Tran, T Hoang, Tam V Nguyen, Trung-Nghia Le, and 5 more authors
    In CVPR Workshops, 2020
  3. cvprw20itask.jpg
    iTASK - Intelligent Traffic Analysis Software Kit
    Minh-Triet Tran, Tam V Nguyen, Trung-Hieu Hoang, Trung-Nghia Le, and 7 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020