Biography
I am a Staff Research Scientist at IBM Research, Abu Dhabi, UAE, where I work on advancing the frontiers of computer vision and deep learning. My research centers on visual–spatial and temporal perception, with an emphasis on explainable AI. My work focuses on developing innovative solutions spanning remote sensing, medical imaging, visual language models, and person search, emphasizing real-world deployment and practical impact.
Before joining IBM Research in October 2023, I was a Postdoctoral Researcher at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), where I worked with Prof. Fahad Khan and Dr. Hisham Cholakkal at the Intelligent Visual Analytics Lab. During this time, I contributed to cutting-edge research in vision transformers, multi-modal learning, and efficient deep learning architectures.
I received my Ph.D. in Computer Science and Engineering from Kyungpook National University, Daegu, Republic of Korea in 2021 under the supervision of Prof. Soon Ki Jung, where my thesis on deep learning for visual tracking received the Outstanding Research CSE Thesis Award. Prior to that, I completed my Master's degree at Sejong University, Seoul, South Korea in 2016 under the guidance of Prof. Sung Wook Baik, and my Bachelor's degree in Computer and Information Science from Pakistan Institute of Engineering and Applied Sciences (PIEAS) in 2011.
Recent Publications
GeoVLM-R1: Reinforcement Fine-Tuning for Improved Remote Sensing Reasoning
Mustansar Fiaz*, H. Debary, P. Fraccaro, D. Paudel, L. V. Gool, F.S. Khan, S. Khan
Under Review
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections
M. F. Imam, R. F. Marew, J. Hassan, Mustansar Fiaz, A. F. Aji, H. Cholakkal
BMVC 2025
HyRet-Change: A hybrid retentive network for remote sensing change detection
Mustansar Fiaz, M. Noman, H. Debary, K. Ali, H. Cholakkal
IEEE IGARSS 2025 (Oral)
EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues
S. Soni*, A. Dudhane*, H. Debary*, Mustansar Fiaz*, M. A. Munir, M. S. Danisho, P. Fraccaro, C. Watson, L. J. Klein, S. Khan, F.S. Khan
CVPR 2025
COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes
M. Ali, M. Javaid, M. Noman, Mustansar Fiaz, S. Khan
WACV 2025 (Oral)
ChangeBind: A Hybrid Change Encoder for Remote Sensing Change Detection
Mustansar Fiaz, M. Noman, H. Cholakkal
IEEE IGARSS 2024 (Oral)
Guided-attention and gated-aggregation network for medical image segmentation
Mustansar Fiaz, M. Noman, H. Cholakkal, R.M. Anwer, J. Hanna, F.S. Khan
Pattern Recognition, 2024
ELGC-Net: Efficient Local–Global Context Aggregation for Remote Sensing Change Detection
M. Noman, Mustansar Fiaz, H. Cholakkal, S. Khan, F.S. Khan
IEEE TGRS, 2024
Remote Sensing Change Detection With Transformers Trained from Scratch
M. Noman, Mustansar Fiaz, H. Cholakkal, S. Narayan, R.M. Anwer, S. Khan, F.S. Khan
IEEE TGRS, 2024
SA2-Net: Scale-aware Attention Network for Microscopic Image Segmentation
Mustansar Fiaz, M. Heidari, R.M. Anwer, H. Cholakkal
BMVC 2023 (Oral)
SAT: Scale-Augmented Transformer for Person Search
Mustansar Fiaz, H. Cholakkal, R.M. Anwer, F. Khan
WACV 2023
PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person Search
Mustansar Fiaz, H. Cholakkal, S. Narayan, R.M. Anwer, F. Khan
ACCV 2022