PhD Researcher · IIT Kanpur

Arvapalli
Sai Susmitha

AI Researcher working on Computer Vision, Medical Imaging, and Trustworthy AI — with a growing focus on LLMs, Multimodal AI, and translational applications.

Arvapalli Sai Susmitha
00

News

Researcher at the intersection of AI & medicine.

I am a PhD candidate at IIT Kanpur, advised by Prof. Vinay P. Namboodiri at the University of Bath. My research centres on Computer Vision applied to Medical Images — spanning medical image retrieval, segmentation, and uncertainty estimation, with a strong emphasis on building trustworthy and interpretable AI systems.

More recently I have expanded into Large Language Models, Multimodal AI, and AI Agents. I am passionate about translational AI — bridging cutting-edge research with real-world clinical and societal impact. I am actively seeking research positions as an AI Researcher, Research Scientist, or Research Engineer.

Medical Imaging Uncertainty Estimation Image Retrieval Vision Transformers VLMs LLMs Multimodal AI Bayesian Deep Learning NLP · Indian Languages Contrastive Learning
  • 2018–now
    Indian Institute of Technology Kanpur
    Integrated Masters & PhD, Computer Science
    CPI 8.9
  • 2014–2018
    IIIT Guwahati
    B.Tech, Computer Science & Engineering
    CPI 8.57
  • 2012–2014
    Sri Gayatri Academy, Hyderabad
    Intermediate (State Board)
    96.1%
  • 2012
    Sri Chaitanya Techno School, Hyderabad
    10th (State Board)
    Grade 9.8
02

Research Projects

Jan 2026 – present
Multimodal Medical VQA for Indian Languages
Guide: Prof. Arnab Bhattacharya, IITK

Curating multilingual Medical VQA datasets for Indian languages. Exploring VLMs to build a foundation for an Indian-language-capable medical assistant system.

VLMsMedical AIMultimodal AI
Jan 2026 – present
Grammar Error Correction for Indian Languages
Guide: Prof. Arnab Bhattacharya, IITK

Developing GEC for Indian languages using foundation models. Building curated datasets for fine-tuning and benchmarking, extending toward intelligent language agents.

NLPLLMsIndian Languages
Apr 2025 – present
Uncertainty Estimation in Medical Image Retrieval
Guide: Prof. Vinay Namboodiri, University of Bath

Exploring Bayesian metric learning and other techniques to quantify retrieval confidence and improve interpretability and diagnostic safety in clinical workflows.

UncertaintyBayesian MLCBMIR
Jul 2024 – Mar 2025
Segmentation-Guided Medical Image Retrieval
Guide: Prof. Vinay Namboodiri, University of Bath

Created a dynamic context-switching network using Swin Transformers that selects optimal spatial context for each image to maximise retrieval performance.

Swin TransformerSegmentationCBMIR
Jul 2023 – Jul 2024
Vision Transformers for Medical Image Retrieval
Guide: Prof. Vinay Namboodiri, University of Bath

Analysed ViT architectures and contrastive learning for medical retrieval, demonstrating superior performance over CNN baselines with quantitative XAI evaluation.

ViTContrastive LearningXAI
Apr 2019 – Jul 2023
Bayesian Ensembles for Medical Image Segmentation
Guide: Prof. Vinay Namboodiri, University of Bath

Developed nine Bayesian ensemble methods for segmentation. Strong correlation between uncertainty and misclassification, enhancing clinical interpretability and trust.

Bayesian EnsemblesSegmentationUncertainty
03

Publications

04

Awards & Recognition

Jul 2025
Doctoral Consortium Best Project / Methodology
Medical Image Understanding and Analysis Conference (MIUA)
Jul 2020 – Jun 2024
TCS Research Fellowship
Tata Consultancy Services — 4-year doctoral fellowship
Oct 2023
1st Position — Generative AI Hackathon 2
Google Developer Student Clubs, IIT Kanpur
Jul 2019
Top 20 Performer & Cash Prize
CVIT Summer School, IIIT Hyderabad
Apr 2026
Invited Talk
BITS Pilani Hyderabad Campus — Novel Insights on Medical Image Retrieval
Ongoing
Reviewer
MICCAI 2026 · MIUA · MIDL · JEI (SPIE) · ICECET
06

Teaching & Service

Teaching Assistantship

  • CS771: Introduction to Machine Learning (×2)2019, 2022
  • CS673: Machine Translation2020
  • ESC101: Introduction to Programming — Head Tutor2020–2022
  • MBA975: AI, ML & Deep Learning (E-Masters)2024
  • EE957: Computer Vision & Image Processing (E-Masters)2025

Leadership & Outreach

  • Web Development Secretary, GH1 Hall — redesigned the hall website after 8+ years2025–2026
  • Head Tutor, ESC101 — mentored large undergraduate cohorts2020–2022
  • HEC Member, GH1 Hall, IIT Kanpur2019–2021

Outside research, I love to dance — good music has a way of taking over, and I've performed at cultural events at IIT Kanpur and during my undergraduate years. I genuinely enjoy conversations and people: some of my favourite moments have come from unexpected discussions across disciplines. I also love to travel — from Japan and Malaysia to Switzerland, Italy, Germany, and the UK — and research has been a great excuse to add Paris, Leeds, and soon Taipei to that list.

07

Get in Touch

I'm open to research collaborations, academic discussions, and opportunities in AI, Computer Vision, Medical AI, LLMs, Multimodal AI, and translational research. Feel free to reach out!

KD 315, Dept. of Computer Science
IIT Kanpur, India 208016