Sreekant Baheti Sreekant13

Sreekant Baheti

M.S. Computer Science · Los Angeles, CA

Building AI systems that ship: agentic LLM pipelines, healthcare ML, and the engineering rigor in between.

About Me

I'm a recent M.S. Computer Science graduate from USC (May 2026) with a background in software and machine learning engineering. I've worked across banks, fintech, and AI startups, and I like problems that sit at the intersection of well-designed systems and real machine learning, where the work has to be correct, fast, and shippable.

Right now I'm focused on agentic LLM systems, applied NLP, and ML for healthcare. I care a lot about the production side: containerized inference, CI/CD, fine-tuning that actually moves a metric, and pipelines that don't fall over at 1,200 requests an hour.

Outside of code, I think about sustainability, interdisciplinary AI, and the bigger questions about why any of this matters.

Technical Skills

Languages

Machine Learning & AI

Data & Streaming

Cloud & DevOps

Featured Projects

Orq: AI Incident Response Operator

A production-style incident response orchestrator built on a 10-node Temporal DAG, with Claude-powered root cause analysis driving the diagnostic path. Tested across 230+ synthetic incidents, with MTTR dropping from 47 minutes to 18 minutes. TypeScript · Next.js · Temporal · Claude API · Railway

relevant-priors-api

A FastAPI service for radiology prior relevance prediction, deployed on Render. TF-IDF with n-gram features feeding a scikit-learn classifier, hitting 98.27% accuracy on the public smoke-test set. Designed to slot into a healthcare ML workflow as a low-latency relevance scorer. Python · FastAPI · scikit-learn · Render

LLaMA 3.2 Vision LoRA Fine-Tuning

Fine-tuned a 4-bit quantized LLaMA 3.2 Vision model on the MultiUI/GUI dataset using Unsloth, exploring efficient multimodal adaptation under tight memory budgets. PyTorch · Unsloth · LoRA · LLaMA 3.2

EcoMate-AI

A Streamlit app that estimates personal carbon footprints by extracting activity data from natural-language text and images using a multimodal GenAI pipeline. Python · Streamlit · GenAI · Multimodal

JanusGraph YCSB Benchmarking

A custom YCSB Java binding for JanusGraph, with schema and CRUD driven through remote Gremlin traversal. Used to analyze single-node vs. multi-node performance characteristics under standard workload mixes. Java · JanusGraph · Gremlin · YCSB

Experience Highlights

Software Engineer Intern at The Verse (May - Aug 2025): Agentic LLM pipelines across 47 daily workflows, BART entity extractor fine-tuned from 0.74 to 0.91 F1, containerized inference handling 1,247 req/hr at peak.
Software Engineer at Bank of America: Production backend systems and data workflows.
Software Engineering Intern at HighRadius: AI-Enabled FinTech B2B invoice management using XGBoost.

GitHub Stats

Beyond Code

Long-standing interest in reinforcement learning, evolutionary optimization, and applied LLM research.
Thinking a lot about AI for healthcare and education, plus sustainability as a design constraint, not an afterthought.
Curious about the deeper why behind the work: spirituality, meaning, and what good engineering owes the people it touches.

Open to opportunities in ML engineering, software engineering, NLP, and healthcare AI.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly