Soumyadip

Profile Picture

Robotics Software Engineer working on humanoids at Jio Reality Labs . My interests lie at the intersection of Robot Learning, Computer Vision (2D/3D) and Deep Learning.

Before that, I was a Research Engineer at Infosys CAI × IIIT Delhi on the ALIVE autonomous vehicle project, under Prof. Saket Anand and Prof. Sanjit Kaul. My work touched the full stack: ADAS development, HD-Map generation, end-to-end traffic light following, and a closed-loop Virtual Testbed for Autonomous Vehicle evaluation, for more checkout my article. Following that, I was a Senior Computer Vision Engineer at OpenCV University, with a focus on 3D vision, neural rendering, and robotics perception pipelines.

Born and raised in Dhupguri, a small town in West Bengal, I'm a first-generation college graduate with a B.Tech in Electrical Engineering from IEM, Kolkata (2022). Research experience includes landscape segmentation on satellite data at IIT KGP (Dr. Debashish Chakravarty) and prior learning for GANs at UNSW (Dr. Tanmoy Dam). Kaggle Competition Expert 1x 🥈 1x 🥉 Medals specializing in Object Detection and Segmentation.

💡 Open to Research & Engineering Opportunities in: Reinforcement Learning Robot Learning VLA and World Model - Steerability Generative Models VLM & LLM Computer Vision (2D/3D) Deep Learning

Skills

Languages Python C++ Bash SQL ML / DL & Edge/Cloud PyTorch TensorFlow LeRobot TensorRT NVIDIA GR00T N1.6 Imitation Learning DAgger NVIDIA Riva AWS EC2 AWS Rekognition Robotics & Computer Vision ROS2 PCL Eigen Ceres-Solver Behaviour Trees Lanelet2 Isaac Sim Carla Rerun NeRF Studio gsplat COLMAP hloc OpenCV Domains Robot Learning Reinforcement Learning Computer Vision (2D/3D) SLAM Gaussian Splatting

Publications

A Novel Approach for Urban Unsupervised Segmentation Classification in SAR Polarimetry

IEEE IGARSS 2021

Proposed an innovative unsupervised segmentation approach for Synthetic Aperture Radar (SAR) imagery, leveraging polarimetric features to improve urban area classification accuracy without the need for large annotated datasets.
COVID-DeepNet: Deep Convolutional Neural Network Architecture Designed for Early Prognosis of COVID-19 Using Post-anterior View of Chest X-Rays

Springer Book Chapter 2022

Developed a custom CNN architecture optimized for analyzing chest X-rays to aid in the early detection and prognosis of COVID-19, addressing critical needs during the pandemic.
Analysis of Depth Sensing and Lane Detection Algorithms for Advanced Driver Assistance Systems

Springer Book Chapter 2023

Conducted a comprehensive analysis of various depth-sensing and lane-detection methodologies, evaluating their robustness and real-time performance within Advanced Driver Assistance Systems (ADAS).

Skills

Publications

Blogs & Writing