CV | Kumar Selvakumaran

Basics

Name	Kumar Selvakumaran
Email	kumar.selvak.27@gmail.com
Phone	(857) 396-6078
Url	https://alshedivat.github.io/al-folio/

Education

2023.09 - 2025.12

Boston, massachusetts
Master of Science

Northeastern University

Artificial Intelligence

CGPA : 3.76
- Algorithms, Foundations of AI, Unsupervised ML, Pattern Recognition and Computer Vision, Machine Learning, Mobile Robotics, Verifiable ML, AI for HCI

Work

2025.01 - 2025.04
Computer Vision Intern

Norfolk Southern

Developed an agentic RAG solution and engineered GPU-accelerated, high-throughput computer vision and Big Data pipelines.
- Built a semi-automatic labeler that proposes label priors, which was shown to 3x human semantic segmentation labeling speed.
- Integrated an automated data-augmentation prompter that generated polylines to guide, accelerate, and automate data curation, seamlessly importable into the CVAT platform for training defect detection models.
- Developed a PyTorch-based zero-shot segmentation model that leverages depth maps from a foundational depth estimator toidentify parts of railway cars by their relative depth in the scene
- Prototyped a few-shot segmentation pipeline using VRP-SAM to accurately segment novel classes from as few as 20–50 exemplars, compared to conventional models that require 5,000–10,000 labeled images for fine-tuning.
2024.05 - 2024.08
Generative AI Intern

Inflohealth

Built a language model pipeline that processes millions of radiology reports to curate datasets for downstream healthcare analytics.
- Implemented a multimodal RAG workflow on the Qdrant vector database for large-scale PDF-based question answering.
- Developed a large-scale PDF retrieval pipeline using the ColPali model, improving top-5 recall by 8% over OCR solutions.
- Built a multi-agent system using LangGraph that divides the task into sub-tasks, leverages a document retriever, a metadata crawler, and feedback loops to maximize cited content in the generated answer.
- Implemented a system prompt refinement module using Mutual Information Maximization on LLM-generated candidates which demonstrated an average reduction of 6.2 steps for task completion, and a 14% increase in task completion rate.
- Built a multi stage reranking based RAG workflow with LlamaIndex for retrieval based on medical rules to maximize.
- Migrated the data loader pipeline to NVIDIA DALI and Ray, boosting throughput of large-scale model evaluation by 2.1 times.
2023.03 - 2023.07
Artificial Intelligence Intern

Sentient.io

Customized open-source computer vision models with application-specific optimizations for AI microservices.
- Implemented a video action-recognition pipeline, quantizing the model with TensorRT to reduce latency by 12%.
- Reduced video object detection misclassification rate by 7% using a Kalman filter motion model.
- Built a general-purpose auto-labeler using SAM and GroundingDINO, which was applied to three datasets (15,000+ images).
2021.04 - 2022.09
Artificial Intelligence Engineer Intern

Juhomi

Co-developed an AI-powered, microservice-based retail analytics platform from the ground up; it was contracted by four MNCs.
- Managed a crowdsourced data annotation job for object detection of 252 product classes across 4750 images in Amazon Mechanical Turk, and developed a YOLOv5-based auto-labeller in Amazon SageMaker to extend the dataset to 21,000 images.
- Integrated image super-resolution and optical character recognition (OCR) improving mAP by 0.2 over naive object detection.
- Demonstrated the combined detection pipeline’s ability to perform zero-shot object detection of previously unseen classes.
- Upgraded the API architecture to support asynchronous communication using FastAPI with Uvicorn, enabling an API throughput increase of 341% (i.e., from 12 requests per second to 53 requests per second).
- Developed continuous training and monitoring workflows using Apache Airflow, Data Version Control, and MLflow, enabling class-specific model updates through model-guided data selection (active learning).

Projects

2025.07.02
HuskyGuide: Database Interface Agent (sponsored)
- Spearheaded the development of a multi-agent system leveraging Northeastern University’s knowledge base for policy navigation.
- Developed a SQL agent that generates complex queries through multi-hop reasoning using execution plans and database contents.
- Illustrated the algorithm’s behavior and efficacy through comprehensive visualizations
2024.10.02
ProdSeek: Semantic Product Catalog Search
- Developed a recommender that takes product selections from images and suggests similar products from a product image corpus
- Built an adaptive vector search using YOLOv3; embeds live product selections and performs real-time semantic retrieval.
- Conducted ablation studies demonstrating higher product specificity and qualitative semantic capacity than ResNet embeddings.
2024.10.02
RobAnn: Exploring Neural Network Robustness
- Developed an algorithm to quantify deep neural networks’ resistance to adversarial attacks and noisy perturbations.
- Performed feature-wise robustness analysis to expose adversarially susceptible neurons by targeted weight perturbation.
- Implemented it with LangChain and packaged it as a FastAPI app that supports integration into concurrent multi-agent systems.

Publications

2023.08.01
AR-enabled textbooks

IEEE
- Published in 2023 4th International Conference on Electronics and Sustainable Communication Systems (ICESC)
- Built an Augmented Reality mobile application that scans QR codes embedded in textbooks to render animated 3D models.
- Secured TNSCST funding to deploy the AR pipeline in 9th-grade textbooks for the Tamil Nadu state curriculum.
2023.07.21
Transformers for Browse Node Classification with Class Imbalance

IEEE
- Published in the 2023 International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES)
- Fine-tuned the DeBERTa transformer model for e-commerce classification on 10M+ records across 250 browse node categories
- Applied Focal Loss to mitigate severe class imbalance by down-weighting the gradient updates of dominant classes over time.
- Achieved an increase of 2% in validation accuracy with faster convergence compared to vanilla DeBERTa and other BERT variants.
2023.06.15
Safety surveillance using Explainable Object Detection

Springer
- Published in Smart Trends in Computing and Communications (SmartCom 2023) as a part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 650))
- Built an automatic object detection explainer that visualizes the top three salient activation regions influencing predictions.
- Implemented a novel model-agnostic pipeline that automatically finds salient layers to bypass manual exploration.
- Demonstrated the pipeline on an artificially biased dataset simulating effects of irresponsible data collection practices.
- Augmented the object detection pipeline with Sobel features to improve generalizability and reduce bias.

Skills

	Programming Languages
	Python
	C++
	C
	SQL

	Frameworks & Libraries
	PyTorch
	TensorFlow
	HuggingFace
	LangChain
	Ray
	NVIDIA Rapids
	NVIDIA DALI
	FastAPI

	Tools & Platforms
	Git
	Apache Spark
	Airflow
	MLflow
	Docker
	Uvicorn
	Databricks
	DataRobot

	Databases & Vector Stores
	PostgreSQL
	Qdrant

	Cloud Providers & Services
	GCP: Cloud Run
	GCP: Kubernetes Engine
	AWS: SageMaker
	AWS: Lambda
	AWS: EC2
	AWS: App Runner

Basics

Education

Northeastern University

Artificial Intelligence

CGPA : 3.76

Work

Norfolk Southern

Developed an agentic RAG solution and engineered GPU-accelerated, high-throughput computer vision and Big Data pipelines.

Inflohealth

Built a language model pipeline that processes millions of radiology reports to curate datasets for downstream healthcare analytics.

Sentient.io

Customized open-source computer vision models with application-specific optimizations for AI microservices.

Juhomi

Co-developed an AI-powered, microservice-based retail analytics platform from the ground up; it was contracted by four MNCs.

Projects

Publications

IEEE

IEEE

Springer

Skills