Random Image

Sourajit Saha

PhD Student, Computer Science at University of Maryland, Baltimore County

Email: ssaha2@umbc.edu Location: ITE 338, UMBC, Baltimore, MD 21250

Interactive Visual Search | Multimodal Retrieval | Multimodal Reasoning

Video Understanding | Computer Vision | Vision and Language

CV

Looking for Research Internship (Winter 25-26, Summer 26)   |   Upcoming Travel: NeurIPS 2025 – San Diego, CA (Dec 2–7)

Bio

I am a Computer Science PhD student, working under the guidance of Tejas Gokhale in the UMBC Cognitive Vision Group at University of Maryland, Baltimore County (UMBC). I work on interactive multimodal retrieval/search, multimodal reasoning, and video understanding. My research spans the following areas:

Interactive Multimodal Retrieval, Search: Enhancing few-shot and zero-shot video search and retrieval, Developing Scene Graph-based Chain-of-Thought retrieval frameworks, Designing dialogue-driven interactive retrieval systems, Robust Ranking Evaluation.
Video Understanding: Information Theory guided Video Question Answering (VQA), Progressive Video Captioning, Semantic Frame Selection, Video Editing.
Visual Reasoning: Understanding spatial relationships and transformations, Studying counterfactual visual reasoning, Exploring techniques for image and model editing.

News
Click to see older news
Publications

Most recent publications on Google Scholar.

Side Effects of Erasing Concepts from Diffusion Models. Shaswati Saha, Sourajit Saha, Manas Gaur, Tejas Gokhale

EMNLP 2025 paper code


Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling. Sourajit Saha, Tejas Gokhale

WACV 2025 paper video poster code


RFC-Net: Learning High Resolution Global Features for Medical Image Segmentation on a Computational Budget (Student Abstract). Sourajit Saha, Shaswati Saha, Md Osman Gani, Tim Oates, David Chapman

AAAI 2023 paper code


Mitigating Domain Shift in AI-Based TB Screening With Unsupervised Domain Adaptation. Nishanjan Ravin, Sourajit Saha, Alan Schweitzer, Ameena Elahi, Farouk Dako, Daniel Mollura, David Chapman

IEEE Access paper code


Pairwise Meta Learning Pipeline: Classifying COVID-19 abnormalities on chest radio-graphs. Sourajit Saha, Yaacov Yesha, Yelena Yesha, Aryya Gangopadhyay, David Chapman, Michael Morris, Babak Saboury, Phuong Nguyen

SPIE Medical Imaging Conference 2022 Paper


A comprehensive set of novel residual blocks for deep learning architectures for diagnosis of retinal diseases from optical coherence tomography images. Sharif Amit Kamran, Sourajit Saha, Ali Shihab Sabbir, Alireza Tavakkoli

Springer Book Series, 2020 paper code


Optic-Net: A Novel Convolutional Neural Network for Diagnosis of Retinal Diseases from Optical Tomography Images. Sharif Amit Kamran, Sourajit Saha, Ali Shihab Sabbir, Alireza Tavakkoli

ICMLA 2019 paper code

Academic Service
Collaborators
Acknowledgement

Website theme inspirations: Aniruddha Saha, Martin Saveski, Aditi Partap.

Sourajit Saha
Last updated 12/01/2025