Dr. Kun Li
Ph.D.
Scene Understanding Group, ITC, University Of Twente
Location: Enschede, Netherlands
Email: k.li@utwente.nl || Google Scholar || ResearchGate || ORCID

About Me

I received my Ph.D. degree from the EOS Department of ITC Faculty, University of Twente, supervised by Prof. George Vosselman and Prof. Michael Ying Yang (University Of Bath, UK). Prior to this, I obtained my master (2018-2021, supervised by Prof. Xiangyun Hu) and bachelor (2014-2018, supervised by Prof. Zhenzhong Chen) degrees at Wuhan University. My research interests lie in Interactive Vision-Language Learning and Image Processing via deep learning-based techniques.


From April to August in 2025, I visited the Audio–Visual Computing Research Group (led by Prof. Sami Brandt) as a guest researcher at the IT University of Copenhagen in Denmark, working on multimodal learning and audio2video generation tasks partially funded by XTREME.

Research Interests

Vision and language learning, Large language models, Multimodal learning, Image segmentation, Object detection, Medical image analysis, Remote sensing, Satellite imagery interpretation, Change detection.

News

  • 2025.08: Sucessfully defended my Ph.D. thesis, young doctor! (University of Twente, Netherlands).
  • 2025.05: One paper about referring image segmentation accepted by ISPRS P&RS.
  • 2025.04: One paper about explainable VQA accepted by CVPRW2025 in Nashville, USA.
  • 2025.04: Visit Copenhagen as a guest researcher at the IT University of Copenhagen in Denmark.
  • 2024.06: One paper about aerial image VQA benchmark accepted by ISPRS P&RS.
  • 2024.03: One co-author paper about multimodal change detection accepted by INFFUS.
  • 2023.10: One paper about interactive image segmentation accepted by ICCVW2023 in Paris, France.
  • 2022.03: Pass the public Ph.D. Qualifier with committee: Prof. George Vosselman, Dr. Michael Ying Yang, Dr. Sylvain Lobry (Université de Paris, France).
  • 2021.09: Begin my new Ph.D. journey at the University of Twente in the Netherlands.
  • 2021.07: Funded by China Scholarship Council (CSC) for 4 years.

Educations

  • 2021.09-2025.08: Ph.D. in Faculty of Geo-Information Science and Earth Observation, University Of Twente, Netherlands.
  • 2018.09-2021.06: M.Sc. in School of Remote Sensing and Information Engineering, Wuhan University, China.
  • 2014.09-2018.06: B.Sc. in School of Remote Sensing and Information Engineering, Wuhan University, China.

Selected Peer-reviewed Publications

  • Scale-wise Bidirectional Alignment Network for Referring Remote Sensing Image Segmentation. [PDF]
    Kun Li, George Vosselman, Michael Ying Yang.
    ISPRS Journal of Photogrammetry and Remote Sensing (ISPRS P&RS), 2025.

  • Multimodal Rationales for Explainable Visual Question Answering. [PDF]
    Kun Li, George Vosselman, Michael Ying Yang.
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2025.

  • HRVQA: A Visual Question Answering Benchmark for High-Resolution Aerial Images. [PDF]
    Kun Li, George Vosselman, Michael Ying Yang.
    ISPRS Journal of Photogrammetry and Remote Sensing (ISPRS P&RS), 2024.

  • Transformer-based Multimodal Change Detection with Multitask Consistency Constraints. [PDF]
    Biyuan Liu, Huaixin Chen, Kun Li, Michael Ying Yang.
    Information Fusion (INFFUS), 2024.

  • Interactive Image Segmentation with Cross-Modality Vision Transformers. [PDF]
    Kun Li, George Vosselman, Michael Ying Yang.
    Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2023.

  • A Deep Interactive Framework for Building Extraction in Remotely Sensed Images Via a Coarse-to-Fine Strategy. [PDF]
    Kun Li, Xiangyun Hu.
    IEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2021.

  • Attention-Guided Multi-Scale Segmentation Neural Network for Interactive Extraction of Region Objects from High-Resolution Satellite Imagery. [PDF]
    Kun Li, Xiangyun Hu, Huiwei Jiang, Zhen Shu, Mi Zhang.
    Remote Sensing (RS), 2020.

Preprints

  • Learning from Exemplars for Interactive Image Segmentation. [PDF]
    Kun Li, Hao Cheng, George Vosselman, Michael Ying Yang.
    arXiv, 2024 (Under review).

Presentations

  • Lab presentation on Multimodal Image Segmentation at the Division of Forest and Forest Resources, NIBIO. [Link]
    Ås, Norway, 2025.07.

  • Poster presentation on CVPR 2025 Workshop on Multimodal Learning and Applications. [Link]
    Nashville, USA, 2025.06.

  • Lab presentation on Multimodal learning at Audio–Visual Computing Group, ITU. [Link]
    Copenhagen, Denmark, 2025.05.

  • Poster presentation on ICCV 2023 Workshop on New Ideas in Vision Transformers. [Link]
    Paris, France, 2023.10.

  • Spotlight on the 13th Lisbon Machine Learning Summer School (LxMLS 2023). [Link]
    Lisbon, Portugal, 2023.07.

  • Oral presentation on Netherlands Center for Geodesy and Geo-Informatics (NCG) Symposium. [Link]
    Enschede, Netherlands, 2023.07.

  • Spotlight on Meeting on Development And Sharing of Open Geodata. [Link]
    Enschede, Netherlands, 2023.01.

Professonal Activities

  • Top Reviewers for NeurIPS (2023, 2024).
  • Reviewer for conferences: CVPR, ICCV, ECCV, NeurIPS, ICLR, ICML, AAAI, ICME, ACM MM.
  • Reviewer for journals: TPAMI, IJCV, ISPRS P&RS, TGRS, GRSL.
  • IEEE/CVF student member.
  • Affiliated researcher at the Pioneer Centre for AI, Denmark.
  • Supervisor for Master Thesis ({Akshay Chaprana, 2023}).
  • Teaching Assistant for University of Twente courses ({2D and 3D Scene Analysis, 2021}, {Image Analysis, 2021, 2022}, {AI for Autonomous Robots, 2023}).

Last Updated on June, 2025

Published with GitHub Pages