Skip to content
View Sreekant13's full-sized avatar

Block or report Sreekant13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Sreekant13/README.md

Sreekant Baheti

M.S. Computer Science · Los Angeles, CA

Building AI systems that ship: agentic LLM pipelines, healthcare ML, and the engineering rigor in between.

LinkedIn Email Resume GitHub Portfolio


About Me

I'm a recent M.S. Computer Science graduate from USC (May 2026) with a background in software and machine learning engineering. I've worked across banks, fintech, and AI startups, and I like problems that sit at the intersection of well-designed systems and real machine learning, where the work has to be correct, fast, and shippable.

Right now I'm focused on agentic LLM systems, applied NLP, and ML for healthcare. I care a lot about the production side: containerized inference, CI/CD, fine-tuning that actually moves a metric, and pipelines that don't fall over at 1,200 requests an hour.

Outside of code, I think about sustainability, interdisciplinary AI, and the bigger questions about why any of this matters.


Technical Skills

Languages Python SQL Java C++ TypeScript Bash

Machine Learning & AI PyTorch TensorFlow scikit-learn Hugging Face LangChain Unsloth

Data & Streaming pandas NumPy Apache Spark Kafka DuckDB

Cloud & DevOps AWS GCP Docker GitHub Actions FastAPI Temporal


Featured Projects

Orq: AI Incident Response Operator

A production-style incident response orchestrator built on a 10-node Temporal DAG, with Claude-powered root cause analysis driving the diagnostic path. Tested across 230+ synthetic incidents, with MTTR dropping from 47 minutes to 18 minutes. TypeScript · Next.js · Temporal · Claude API · Railway

relevant-priors-api

A FastAPI service for radiology prior relevance prediction, deployed on Render. TF-IDF with n-gram features feeding a scikit-learn classifier, hitting 98.27% accuracy on the public smoke-test set. Designed to slot into a healthcare ML workflow as a low-latency relevance scorer. Python · FastAPI · scikit-learn · Render

LLaMA 3.2 Vision LoRA Fine-Tuning

Fine-tuned a 4-bit quantized LLaMA 3.2 Vision model on the MultiUI/GUI dataset using Unsloth, exploring efficient multimodal adaptation under tight memory budgets. PyTorch · Unsloth · LoRA · LLaMA 3.2

EcoMate-AI

A Streamlit app that estimates personal carbon footprints by extracting activity data from natural-language text and images using a multimodal GenAI pipeline. Python · Streamlit · GenAI · Multimodal

JanusGraph YCSB Benchmarking

A custom YCSB Java binding for JanusGraph, with schema and CRUD driven through remote Gremlin traversal. Used to analyze single-node vs. multi-node performance characteristics under standard workload mixes. Java · JanusGraph · Gremlin · YCSB


Experience Highlights

  • Software Engineer Intern at The Verse (May - Aug 2025): Agentic LLM pipelines across 47 daily workflows, BART entity extractor fine-tuned from 0.74 to 0.91 F1, containerized inference handling 1,247 req/hr at peak.
  • Software Engineer at Bank of America: Production backend systems and data workflows.
  • Software Engineering Intern at HighRadius: AI-Enabled FinTech B2B invoice management using XGBoost.

GitHub Stats

GitHub Stats Top Languages

GitHub Streak


Beyond Code

  • Long-standing interest in reinforcement learning, evolutionary optimization, and applied LLM research.
  • Thinking a lot about AI for healthcare and education, plus sustainability as a design constraint, not an afterthought.
  • Curious about the deeper why behind the work: spirituality, meaning, and what good engineering owes the people it touches.

Open to opportunities in ML engineering, software engineering, NLP, and healthcare AI.

Pinned Loading

  1. EcoMate-AI EcoMate-AI Public

    Forked from ShamikOfficial/EcoMate-AI

    EcoMate-AI is an AI-powered carbon footprint decoder that transforms your everyday tasks, receipts, bills, and activities into insightful environmental impact reports. Upload text or images and get…

    Python

  2. orq-runtime orq-runtime Public

    Python

  3. PocketLLM-Portal PocketLLM-Portal Public

    Lightweight web portal for CPU-friendly local LLM interaction. React + Fastify + SQLite + Ollama. USC CSCI-578 Software Architecture project

    TypeScript

  4. relevant-priors-api relevant-priors-api Public

    Python

  5. Streaming-and-Lakehouse-data-pipelines Streaming-and-Lakehouse-data-pipelines Public

    Python

  6. Traveling-Salesman-Problem Traveling-Salesman-Problem Public

    Projects based on course load(CSCI561)

    Python