Sourajit Saha
PhD Student, Computer Science at University of Maryland, Baltimore County
Email: ssaha2@umbc.edu Location: ITE 338, UMBC, Baltimore, MD 21250
Computer Vision | Interactive Video Retrieval | Visual Reasoning | Vision and Language
CV
Looking for Research Internship (Winter 2025-2026, Summer 2026)
♦ Interactive Video Retrieval, Search, and Understanding: Advancing interactive video retrieval via VLMs, scene-graph reasoning, VQA-based finetuning, and dialogue-driven systems for improved semantic understanding.
♦ Visual Reasoning: Investigating spatial reasoning, counterfactual visual inference, and editing techniques to enhance model interpretability, adaptability, and causal understanding.
♦ Reliable Vision Systems: Evaluating vision models by detecting hallucinations and measuring generative quality in T2I and T2V outputs for fidelity and alignment.
Bio
I am a Computer Science PhD student, working under the guidance of
Tejas Gokhale
in the UMBC Cognitive Vision Group
at University of Maryland, Baltimore County (UMBC).
I work on interactive video retrieval/search, visual reasoning, and improving/assessing reliability for vision systems.
My research in interactive video retrieval and search spans four key areas:
News
- Jul 2025: Serving in program committee for AAAI 2026
- Jul 2025: Recieved Lambda Research Grant Award at CVPR 2025
- Mar 2025: Reviewing manuscripts for ICCV 2025 (Track: Vision, language, and reasoning)
- Jan 2025: One paper accepted for Oral at WACV 2025
- Dec 2024: Received UMBC GSA Travel Grant
- Oct 2024: One paper accepted at WACV 2025 in Tucson, Arizona (Preprint)
- Mar 2024: Got accepted into SCALE 2024 at JHU to work on event-based visual content retrieval on summer'24
- Feb 2024: Guest lecture on CMSC 491/691: Computer Vision at UMBC
- Feb 2024: Reviewing manuscripts for IJCAI 2024, ACM Transactions on Computing for Healthcare
- Nov 2023: Guest lecture on CMSC 678: (Graduate) Machine Learning at UMBC
- Oct 2023: Reviewing manuscripts for AAAI 2024 (student abstract)
- Sep 2023: Joined Cognitive Vision Group at UMBC, lead by Tejas Gokhale
- Aug 2023: One paper accepted at ICCV 2023 in OODCV, Paris, France
Publications
Most recent publications on Google Scholar.
Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling. Sourajit Saha, Tejas Gokhale
WACV 2025, ICCV 2023 (OODCV workshop) paper video poster code
RFC-Net: Learning High Resolution Global Features for Medical Image Segmentation on a Computational Budget (Student Abstract). Sourajit Saha, Shaswati Saha, Md Osman Gani, Tim Oates, David Chapman
Mitigating Domain Shift in AI-Based TB Screening With Unsupervised Domain Adaptation. Nishanjan Ravin, Sourajit Saha, Alan Schweitzer, Ameena Elahi, Farouk Dako, Daniel Mollura, David Chapman
Pairwise Meta Learning Pipeline: Classifying COVID-19 abnormalities on chest radio-graphs. Sourajit Saha, Yaacov Yesha, Yelena Yesha, Aryya Gangopadhyay, David Chapman, Michael Morris, Babak Saboury, Phuong Nguyen
SPIE Medical Imaging Conference 2022 Paper
A comprehensive set of novel residual blocks for deep learning architectures for diagnosis of retinal diseases from optical coherence tomography images. Sharif Amit Kamran, Sourajit Saha, Ali Shihab Sabbir, Alireza Tavakkoli
Academic Service
- Conferences: AAAI 2026, ICCV 2025, AAAI 2024 (student abstract), IJCAI 2024
- Journals: APSIPA Transactions on Signal and Information Processing (Cambridge University Press), ACM Transactions on Computing for Healthcare Computers and Electronics in Agriculture
Program Committee / Reviewer
- CVF, AAAI, ACL
Membership
- UMBC (TA): CMSC 678: Machine Learning Fall 2024
- UMBC (TA): CMSC 691: Computer Vision Spring 2024
- UMBC (TA): CMSC 678: Machine Learning Fall 2023
- UMBC (TA): CMSC 471: Introduction to Artificial Intelligence Spring 2023
- UMBC (TA): CMSC 313: Assembly Language and Computer Organization Fall 2021
- UMBC (TA): CMSC 341: Data Structures Spring 2021
Teaching
Collaborators
- Tejas Gokhale
- Agneet Chatterjee
- Manas Gaur
- Shaswati Saha
Current Collaborators
- Tim Oates
- David Chapman
- Sharif Amit Kamran
- Ali Shihab Sabbir
Previous Collaborators
Acknowledgement
Website theme inspirations: Aniruddha Saha, Martin Saveski, Aditi Partap.