Homepage - Avigyan Bhattacharya

Avigyan Bhattacharya

MS Robotics student at CMU

Hey there! I'm currently a second-year Master's (by Research) student in Robotics at Carnegie Mellon University, associated with two wonderful groups, AirLab (Dr. Wenshan Wang and Dr. Sebastian Scherer) and TBDLab (Dr. Aaron Steinfeld). My primary research interest lies in Computer Vision and Multimodal LLMs, particularly in solving challenges related to scene understanding and robot navigation.

Before joining CMU, I earned my Bachelor's in Computer Science from Jadavpur University in India, where I worked on various core vision problems such as object detection, image segmentation and super-resolution. I also spent a year at IIT Bombay in the CMInDS department advised by Dr. Biplab Banerjee, working on how to handle out-of-distribution data in different types of vision applications using VLMs and generative models.

Curriculum Vitae

avigyanb(at)andrew.cmu.edu Google Scholar GitHub LinkedIn

Education

Carnegie Mellon University

Robotics Institute
M.S. Robotics Student

Aug. 2024 - Present
Jadavpur University

B.E. in Computer Science and Engineering

Aug. 2019 - May. 2023

Experience

Robotics Institute, CMU

Graduate Research Assistant at AirLab and TBDLab

Nov. 2024 - Present
IIT Bombay

Research Fellow in AI

Jul. 2023 - Jul. 2024
TCS Research and Innovation Labs

Research Intern in Robotics

May 2022 - Aug. 2022

News

2025

Our latest work from AirLab, RayFronts has been accepted to IROS!

Jun 16

2024

Moved to Pittsburgh to start my Master's in Robotics at CMU

Aug 01

2023

C-SAW wins the Best Paper Award at ACM ICVGIP!

Dec 17

Joined IIT Bombay as a Research Fellow in the CMInDS department

Jul 05

Graduated from Jadavpur University with a Bachelor's in CSE

May 31

Selected Publications (view all )

RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration

Omar Alama, Avigyan Bhattacharya, Haoyang He, Seungchan Kim, Yuheng Qiu, Wenshan Wang, Cherie Ho, Nikhil Keetha, Sebastian Scherer

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2025 Oral

RayFronts offers a unified representation for open-set semantic mapping that bridges the gap between in-range and beyond-range perception in robotic systems. By encoding semantics into both voxels and rays, it enables efficient, fine-grained mapping and planning, even beyond sensor limits, while running in real-time. Our evaluations show significant gains in both segmentation accuracy and exploration efficiency, making RayFronts well-suited for open-world robot navigation.

[Paper] [Code] [Video] [Website]

RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration

Omar Alama, Avigyan Bhattacharya, Haoyang He, Seungchan Kim, Yuheng Qiu, Wenshan Wang, Cherie Ho, Nikhil Keetha, Sebastian Scherer

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2025 Oral

[Paper] [Code] [Video] [Website]

Enhancing the Domain Robustness of Self-Supervised Pre-training with Synthetic Images

Mohamad Hassan N C, Avigyan Bhattacharya, Victor G. Turrisi da Costa, Biplab Banerjee, Elisa Ricci

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2024

We explore how synthetic data can enhance domain adaptability in self-supervised learning by leveraging InstructPix2Pix to generate semantically consistent, multi-domain image variants. By incorporating these synthetic samples into the training process, we significantly improve generalization across diverse benchmarks like DomainNet, PACS, and Office-Home. Our results highlight the promise of diffusion-based augmentation for robust SSL pre-training.

[Paper]

Enhancing the Domain Robustness of Self-Supervised Pre-training with Synthetic Images

Mohamad Hassan N C, Avigyan Bhattacharya, Victor G. Turrisi da Costa, Biplab Banerjee, Elisa Ricci

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2024

[Paper]

C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing

Avigyan Bhattacharya, Mainak Singha, Ankit Jha, Biplab Banerjee

ACM Indian Conference on Vision, Graphics and Image Processing (ICVGIP) 2023 Best Paper Award

We introduce a method designed to enhance CLIP’s performance on domain and class generalization tasks in optical remote sensing. By combining a self-supervised loss with a novel prompt learning strategy that incorporates both domain and content-specific features, C-SAW significantly improves zero-shot generalization without fine-tuning the CLIP backbone. Our experiments show consistent gains across multiple remote sensing benchmarks.

[Paper] [Code] [Extended Journal Version @ CVIU]

C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing

Avigyan Bhattacharya, Mainak Singha, Ankit Jha, Biplab Banerjee

ACM Indian Conference on Vision, Graphics and Image Processing (ICVGIP) 2023 Best Paper Award

[Paper] [Code] [Extended Journal Version @ CVIU]

Ontology guided Context Understanding for Robotic Task Execution

Dibyarup Dutta, Avigyan Bhattacharya, Snehasis Banerjee

IEEE International Conference on Robotics and Automation (ICRA) - Workshop 2023

Understanding whether a robot can perform a given task in a specific environment requires more than just perception—it calls for reasoning. In this work, we combine an extended IEEE CORA ontology with semantic web technologies and perception to build scene and knowledge graphs that support task feasibility analysis. The system operates in realistic simulation environments and is designed for smooth transition to physical-world deployment.

[Paper] [Patent]

Ontology guided Context Understanding for Robotic Task Execution

Dibyarup Dutta, Avigyan Bhattacharya, Snehasis Banerjee

IEEE International Conference on Robotics and Automation (ICRA) - Workshop 2023

[Paper] [Patent]

Generating Synthetic Computed Tomography (CT) Images to Improve the Performance of Machine Learning Model for Pediatric Abdominal Anomaly Detection

Samayan Bhattacharya*, Avigyan Bhattacharya*, Sk Shahnawaz (* equal contribution)

IEEE/CVF International Conference on Computer Vision (ICCV) - BioImage Computing Workshop 2023

To address the risks and accessibility issues of CT scans in diagnosing pediatric abdominal pain, we explore generating synthetic CT images from orthogonal X-rays using machine learning. This approach enhances anomaly detection such as cysts and appendicitis, while reducing radiation exposure and dependence on high-end equipment. Our method yields a 9.75% performance boost over X-rays alone, showing strong potential for safer, low-resource diagnostics.

[Paper]

Generating Synthetic Computed Tomography (CT) Images to Improve the Performance of Machine Learning Model for Pediatric Abdominal Anomaly Detection

Samayan Bhattacharya*, Avigyan Bhattacharya*, Sk Shahnawaz (* equal contribution)

IEEE/CVF International Conference on Computer Vision (ICCV) - BioImage Computing Workshop 2023

[Paper]

Warning

Action required

Education

Experience

News

Selected Publications (view all )

RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration

RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration

Enhancing the Domain Robustness of Self-Supervised Pre-training with Synthetic Images

Enhancing the Domain Robustness of Self-Supervised Pre-training with Synthetic Images

C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing

C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing

Ontology guided Context Understanding for Robotic Task Execution

Ontology guided Context Understanding for Robotic Task Execution

Generating Synthetic Computed Tomography (CT) Images to Improve the Performance of Machine Learning Model for Pediatric Abdominal Anomaly Detection

Generating Synthetic Computed Tomography (CT) Images to Improve the Performance of Machine Learning Model for Pediatric Abdominal Anomaly Detection

All publications