M.S. Computer Science · Los Angeles, CA
Building AI systems that ship: agentic LLM pipelines, healthcare ML, and the engineering rigor in between.
I'm a recent M.S. Computer Science graduate from USC (May 2026) with a background in software and machine learning engineering. I've worked across banks, fintech, and AI startups, and I like problems that sit at the intersection of well-designed systems and real machine learning, where the work has to be correct, fast, and shippable.
Right now I'm focused on agentic LLM systems, applied NLP, and ML for healthcare. I care a lot about the production side: containerized inference, CI/CD, fine-tuning that actually moves a metric, and pipelines that don't fall over at 1,200 requests an hour.
Outside of code, I think about sustainability, interdisciplinary AI, and the bigger questions about why any of this matters.
A production-style incident response orchestrator built on a 10-node Temporal DAG, with Claude-powered root cause analysis driving the diagnostic path. Tested across 230+ synthetic incidents, with MTTR dropping from 47 minutes to 18 minutes. TypeScript · Next.js · Temporal · Claude API · Railway
A FastAPI service for radiology prior relevance prediction, deployed on Render. TF-IDF with n-gram features feeding a scikit-learn classifier, hitting 98.27% accuracy on the public smoke-test set. Designed to slot into a healthcare ML workflow as a low-latency relevance scorer. Python · FastAPI · scikit-learn · Render
Fine-tuned a 4-bit quantized LLaMA 3.2 Vision model on the MultiUI/GUI dataset using Unsloth, exploring efficient multimodal adaptation under tight memory budgets. PyTorch · Unsloth · LoRA · LLaMA 3.2
A Streamlit app that estimates personal carbon footprints by extracting activity data from natural-language text and images using a multimodal GenAI pipeline. Python · Streamlit · GenAI · Multimodal
A custom YCSB Java binding for JanusGraph, with schema and CRUD driven through remote Gremlin traversal. Used to analyze single-node vs. multi-node performance characteristics under standard workload mixes. Java · JanusGraph · Gremlin · YCSB
- Software Engineer Intern at The Verse (May - Aug 2025): Agentic LLM pipelines across 47 daily workflows, BART entity extractor fine-tuned from 0.74 to 0.91 F1, containerized inference handling 1,247 req/hr at peak.
- Software Engineer at Bank of America: Production backend systems and data workflows.
- Software Engineering Intern at HighRadius: AI-Enabled FinTech B2B invoice management using XGBoost.
- Long-standing interest in reinforcement learning, evolutionary optimization, and applied LLM research.
- Thinking a lot about AI for healthcare and education, plus sustainability as a design constraint, not an afterthought.
- Curious about the deeper why behind the work: spirituality, meaning, and what good engineering owes the people it touches.
Open to opportunities in ML engineering, software engineering, NLP, and healthcare AI.