Arnesh Banerjee

Arnesh Banerjee

I like to train models, build agents, and tinker with multi-agent systems for autonomy and safety.

I am an undergraduate researcher (B.Tech CSE, Data Science) at Heritage Institute of Technology, Kolkata. I am interested in multi-agent reinforcement learning, computer vision, AI safety, LLMs, and Applied ML. Past projects include applied ML for cancer prognosis, thermographic image segmentation for cancer detection using hybrid CNN-Transformer architectures, and a modified Safe RLHF pipeline for safety benchmarking and safer alignment of large language models. I also worked on identifying failure modes in LLMs in the context of mathematical reasoning.

My ongoing work is based on RL environments and simulation for autonomous systems — a MARL drone simulator for defence applications. I am currently interning at IIT Kharagpur, working on India's first genomic language model (IgLM).

Interested in joining a PhD program after my Bachelors.

Reach me at arnesh.banerjee.ds27@heritageit.edu.in

* * *

Research Experience

IIT Kharagpur — May 2026 – present, on-site [offer, KRITI portal]

Advisor: Dr. Sourangshu Bhattacharya

Jadavpur University — Nov 2025 – May 2026, remote

Advisor: Prof. Debotosh Bhattacharjee

New Jersey Institute of Technology — Jun – Nov 2025, on-site / virtual [certificate]

Advisor: Dr. Arnob Ghosh

Heritage Institute of Technology — Oct 2024 – Mar 2025, on-site [AGC 2026 certificate]

Advisor: Ms. Arpita Talukdar

* * *

Publications

Recursive and Wrapper-Based Feature Selection for Breast Cancer Diagnosis and Prognosis [oral, certificate]
Ayushi Bhattacharjee, Arnesh Banerjee, Arpita Talukdar.
4th Analytics Global Conference (AGC 2026), March 2026.

* * *

Preprints

Certifiable Safe RLHF: Semantic Grounding and Fixed Penalty Constraint Optimization for Safer LLM Alignment [under review, COLM 2026]
Kartik Pandit, Sourav Ganguly, Arnesh Banerjee, Shaahin Angizi, Arnob Ghosh. 2025.
arXiv:2510.03520

An Intelligent Weakly Supervised Framework for Breast Thermography Segmentation Using Hybrid CNN–Transformer Networks [in prep]
Arnesh Banerjee, Debotosh Bhattacharjee.
In preparation for Expert Systems with Applications.

* * *

Ongoing Research

Co-evolutionary Multi-Agent RL for Autonomous Drones
Arnesh Banerjee.
With the AI for Defence Lab, ULiège, Belgium.

Understanding the Limitations of LLMs in Mathematical Reasoning
Arnesh Banerjee, Ayushi Bhattacharjee, Subhajit Datta.
Advisor: Prof. Subhajit Datta. B.Tech coursework.

Analyzing Historical Revisionism in LLMs in the Context of Indian History
Kartik Pandit, Sourav Ganguly, Arnesh Banerjee, Ayushi Bhattacharjee, Avirup Chakraborty, Arnob Ghosh.
Advisor: Dr. Arnob Ghosh.

* * *

Blogs

Coming soon — I plan to write about RL environments, MARL, interpretability, and notes from papers I find interesting.

* * *

Achievements

* * *

Last updated June 2026 · arnesh.banerjee.ds27@heritageit.edu.in