CV

Click me for the pdf version of the CV!

Education

  • 2023.09 - 2025.12

    Boston, massachusetts

    Master of Science
    Northeastern University
    Artificial Intelligence
    CGPA : 3.76
    • Algorithms, Foundations of AI, Unsupervised ML, Pattern Recognition and Computer Vision, Machine Learning, Mobile Robotics, Verifiable ML, AI for HCI

Work

  • 2025.01 - 2025.04
    Computer Vision Intern
    Norfolk Southern
    Developed an agentic RAG solution and engineered GPU-accelerated, high-throughput computer vision and Big Data pipelines.
    • Built a semi-automatic labeler that proposes label priors, which was shown to 3x human semantic segmentation labeling speed.
    • Integrated an automated data-augmentation prompter that generated polylines to guide, accelerate, and automate data curation, seamlessly importable into the CVAT platform for training defect detection models.
    • Developed a PyTorch-based zero-shot segmentation model that leverages depth maps from a foundational depth estimator toidentify parts of railway cars by their relative depth in the scene
    • Prototyped a few-shot segmentation pipeline using VRP-SAM to accurately segment novel classes from as few as 20–50 exemplars, compared to conventional models that require 5,000–10,000 labeled images for fine-tuning.
  • 2024.05 - 2024.08
    Generative AI Intern
    Inflohealth
    Built a language model pipeline that processes millions of radiology reports to curate datasets for downstream healthcare analytics.
    • Implemented a multimodal RAG workflow on the Qdrant vector database for large-scale PDF-based question answering.
    • Developed a large-scale PDF retrieval pipeline using the ColPali model, improving top-5 recall by 8% over OCR solutions.
    • Built a multi-agent system using LangGraph that divides the task into sub-tasks, leverages a document retriever, a metadata crawler, and feedback loops to maximize cited content in the generated answer.
    • Implemented a system prompt refinement module using Mutual Information Maximization on LLM-generated candidates which demonstrated an average reduction of 6.2 steps for task completion, and a 14% increase in task completion rate.
    • Built a multi stage reranking based RAG workflow with LlamaIndex for retrieval based on medical rules to maximize.
    • Migrated the data loader pipeline to NVIDIA DALI and Ray, boosting throughput of large-scale model evaluation by 2.1 times.
  • 2023.03 - 2023.07
    Artificial Intelligence Intern
    Sentient.io
    Customized open-source computer vision models with application-specific optimizations for AI microservices.
    • Implemented a video action-recognition pipeline, quantizing the model with TensorRT to reduce latency by 12%.
    • Reduced video object detection misclassification rate by 7% using a Kalman filter motion model.
    • Built a general-purpose auto-labeler using SAM and GroundingDINO, which was applied to three datasets (15,000+ images).
  • 2021.04 - 2022.09
    Artificial Intelligence Engineer Intern
    Juhomi
    Co-developed an AI-powered, microservice-based retail analytics platform from the ground up; it was contracted by four MNCs.
    • Managed a crowdsourced data annotation job for object detection of 252 product classes across 4750 images in Amazon Mechanical Turk, and developed a YOLOv5-based auto-labeller in Amazon SageMaker to extend the dataset to 21,000 images.
    • Integrated image super-resolution and optical character recognition (OCR) improving mAP by 0.2 over naive object detection.
    • Demonstrated the combined detection pipeline’s ability to perform zero-shot object detection of previously unseen classes.
    • Upgraded the API architecture to support asynchronous communication using FastAPI with Uvicorn, enabling an API throughput increase of 341% (i.e., from 12 requests per second to 53 requests per second).
    • Developed continuous training and monitoring workflows using Apache Airflow, Data Version Control, and MLflow, enabling class-specific model updates through model-guided data selection (active learning).

Projects

  • 2025.07.02
    HuskyGuide: Database Interface Agent (sponsored)
    • Spearheaded the development of a multi-agent system leveraging Northeastern University’s knowledge base for policy navigation.
    • Developed a SQL agent that generates complex queries through multi-hop reasoning using execution plans and database contents.
    • Illustrated the algorithm’s behavior and efficacy through comprehensive visualizations
  • 2024.10.02
    ProdSeek: Semantic Product Catalog Search
    • Developed a recommender that takes product selections from images and suggests similar products from a product image corpus
    • Built an adaptive vector search using YOLOv3; embeds live product selections and performs real-time semantic retrieval.
    • Conducted ablation studies demonstrating higher product specificity and qualitative semantic capacity than ResNet embeddings.
  • 2024.10.02
    RobAnn: Exploring Neural Network Robustness
    • Developed an algorithm to quantify deep neural networks’ resistance to adversarial attacks and noisy perturbations.
    • Performed feature-wise robustness analysis to expose adversarially susceptible neurons by targeted weight perturbation.
    • Implemented it with LangChain and packaged it as a FastAPI app that supports integration into concurrent multi-agent systems.

Publications

  • 2023.08.01
    AR-enabled textbooks
    IEEE
    • Published in 2023 4th International Conference on Electronics and Sustainable Communication Systems (ICESC)
    • Built an Augmented Reality mobile application that scans QR codes embedded in textbooks to render animated 3D models.
    • Secured TNSCST funding to deploy the AR pipeline in 9th-grade textbooks for the Tamil Nadu state curriculum.
  • 2023.07.21
    Transformers for Browse Node Classification with Class Imbalance
    IEEE
    • Published in the 2023 International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES)
    • Fine-tuned the DeBERTa transformer model for e-commerce classification on 10M+ records across 250 browse node categories
    • Applied Focal Loss to mitigate severe class imbalance by down-weighting the gradient updates of dominant classes over time.
    • Achieved an increase of 2% in validation accuracy with faster convergence compared to vanilla DeBERTa and other BERT variants.
  • 2023.06.15
    Safety surveillance using Explainable Object Detection
    Springer
    • Published in Smart Trends in Computing and Communications (SmartCom 2023) as a part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 650))
    • Built an automatic object detection explainer that visualizes the top three salient activation regions influencing predictions.
    • Implemented a novel model-agnostic pipeline that automatically finds salient layers to bypass manual exploration.
    • Demonstrated the pipeline on an artificially biased dataset simulating effects of irresponsible data collection practices.
    • Augmented the object detection pipeline with Sobel features to improve generalizability and reduce bias.

Skills

Programming Languages
Python
C++
C
SQL
Frameworks & Libraries
PyTorch
TensorFlow
HuggingFace
LangChain
Ray
NVIDIA Rapids
NVIDIA DALI
FastAPI
Tools & Platforms
Git
Apache Spark
Airflow
MLflow
Docker
Uvicorn
Databricks
DataRobot
Databases & Vector Stores
PostgreSQL
Qdrant
Cloud Providers & Services
GCP: Cloud Run
GCP: Kubernetes Engine
AWS: SageMaker
AWS: Lambda
AWS: EC2
AWS: App Runner