publications

2025

  1. Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
    Minh-Quan Le*, Gaurav Mittal*, Tianjian Meng, and 5 more authors
    In The Thirteenth International Conference on Learning Representations, 2025
  2. CamoFA: A Learnable Fourier-based Augmentation for Camouflage Segmentation
    Minh-Quan Le*, Minh-Triet Tran*, Trung-Nghia Le, and 2 more authors
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

2024

  1. ∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions
    Minh-Quan Le*, Alexandros Graikos*, Srikar Yellapragada, and 3 more authors
    In European Conference on Computer Vision, 2024
  2. MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
    Minh-Quan Le, Tam V Nguyen, Trung-Nghia Le, and 3 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2024
  3. Learned representation-guided diffusion models for large-image generation
    Alexandros Graikos*, Srikar Yellapragada*Minh-Quan Le, and 4 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

  1. CG
    cg23sketch.png
    SketchANIMAR: Sketch-based 3D Animal Fine-Grained Retrieval
    Trung-Nghia Le, Tam V Nguyen, Minh-Quan Le, and 8 more authors
    Computers & Graphics, 2023
  2. CG
    cg23text.png
    TextANIMAR: Text-based 3D Animal Fine-Grained Retrieval
    Trung-Nghia Le, Tam V Nguyen, Minh-Quan Le, and 8 more authors
    Computers & Graphics, 2023
  3. flformer.png
    FL-Former: Flood Level Estimation with Vision Transformer for Images from Cameras in Urban Areas
    Quoc-Cuong Le, Minh-Quan Le, Mai-Khiem Tran, and 2 more authors
    In International Conference on Multimedia Modeling, 2023

2022

  1. ismar22.png
    Data-Driven City Traffic Planning Simulation
    Tam V Nguyen, Thanh Ngoc-Dat Tran, Viet-Tham Huynh, and 6 more authors
    In 2022 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct), 2022
  2. CG
    cg22.png
    SHREC 2022 Track on Online Detection of Heterogeneous Gestures
    Marco Emporio, Ariel Caputo, Andrea Giachetti, and 8 more authors
    Computers & Graphics, 2022
  3. vbs22.png
    V-FIRST: A Flexible Interactive Retrieval System for Video at VBS 2022
    Minh-Triet Tran, Nhat Hoang-Xuan, Hoang-Phuc Trang-Trung, and 6 more authors
    In International Conference on Multimedia Modeling, 2022

2021

  1. tip21.png
    Camouflaged Instance Segmentation In-the-Wild: Dataset, Method, and Benchmark Suite
    Trung-Nghia Le, Yubo Cao, Tan-Cong Nguyen, and 5 more authors
    IEEE Transactions on Image Processing, 2021
  2. CG
    cg21.png
    SHREC 2021: Skeleton-based Hand Gesture Recognition in the Wild
    Ariel Caputo, Andrea Giachetti, Simone Soso, and 8 more authors
    Computers & Graphics, 2021
  3. Interactive Video Object Mask Annotation
    Trung-Nghia Le, Tam V Nguyen, Quoc-Cuong Tran, and 4 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2021
  4. gunnel.png
    GUNNEL: Guided Mixup Augmentation and Multi-Model Fusion for Aquatic Animal Segmentation
    Minh-Quan Le*, Trung-Nghia Le*, Tam V Nguyen, and 2 more authors
    arXiv preprint arXiv:2112.06193, 2021

2020

  1. CG
    cg20.jpg
    SHREC 2020: Retrieval of Digital Surfaces with Similar Geometric Reliefs
    Elia Moscoso Thompson, Silvia Biasotti, Andrea Giachetti, and 8 more authors
    Computers & Graphics, 2020
  2. cvprw20vos.png
    Multi-Referenced Guided Instance Segmentation Framework for Semi-supervised Video Instance Segmentation
    Minh-Triet Tran, T Hoang, Tam V Nguyen, and 6 more authors
    In CVPR Workshops, 2020
  3. cvprw20itask.png
    iTASK - Intelligent Traffic Analysis Software Kit
    Minh-Triet Tran, Tam V Nguyen, Trung-Hieu Hoang, and 8 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020