Skip to content
View teslakoile's full-sized avatar

Block or report teslakoile

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
teslakoile/README.md

Hey, I'm Kyle πŸ‘‹

Data Engineer @ Thinking Machines Data Science Β· Philippines

Data Pipelines Generative AI MLOps Backend Software Engineering Cloud Infrastructure

I build data and AI infrastructure for enterprise clients and startups across financial services, investment management, education, and compliance.

Personal Site LinkedIn Email

What I Do

I've worked across enterprise and startup engagements spanning:

  • Data pipelines & warehousing β€” Record linkage on 15M+ records, ETL/ELT across Databricks, Snowflake, BigQuery
  • Agentic AI & MLOps β€” Snowflake Cortex agent workflows, prompt engineering, evaluation, OpenAI integrations
  • Backend engineering β€” 10+ microservices, FastAPI, Django, cross-region architectures on AWS/Azure/GCP
  • Infrastructure β€” Terraform, Kubernetes, CI/CD, production monitoring

Selected Work

πŸ”— Investment Data Platform Backend feature development across 10+ microservices integrating data from Sustainalytics, PitchBook, Bloomberg, and MSCI. FastAPI, Dagster, Snowflake, Kubernetes.

πŸ”— Enterprise Document Intelligence Platform Iterated on Snowflake Cortex AI agent workflows for large-scale document extraction and contextual querying. Agent tooling, prompt engineering, and evaluation across a 15-person cross-functional team.

πŸ”— Single Customer View Pipeline Probabilistic record linkage on ~15M records using Splink. Novel metaphone-based blocking strategies. Optimized runtime from 4+ hours to <1 hour. Serves 10+ business units daily.

πŸ“„ AIComprehend β€” IEEE Publication (ISNCC 2023) Adaptive reading comprehension platform using PFA-based ML. 79% accuracy, 66% AUC. 13.9% improvement in student test scores.

Tech I Work With

Data

Python SQL PySpark Databricks Snowflake BigQuery Airflow Dagster dbt

AI/ML

OpenAI Snowflake Cortex Vertex AI LangChain

Backend & Infra

FastAPI Django AWS Azure GCP Docker Kubernetes Terraform

Certifications

GCP Professional ML Engineer Β· Databricks Data Engineer Associate Β· Azure AI Engineer Associate Β· OpenAI AI Technical Practitioner Β· GCP Generative AI Leader Β· Apache Airflow 3 Fundamentals Β· AWS Cloud Practitioner Β· GCP Cloud Digital Leader Β· Azure Fundamentals

Community & Open Source

  • πŸ› οΈ Apache Airflow contributor
  • 🎀 Speaker at various tech events (Google Developer Groups, Python Philippines, Python Davao, Geeks on a Beach, AWS User Group, ISNCC 2023 Qatar to name a few)
  • πŸ‘₯ Google Developer Group Davao β€” Community Lead / L&D Lead Β· 10+ events Β· 2,000+ participants
  • 🌍 Global Shaper, World Economic Forum β€” Davao Hub

Pinned Loading

  1. ai-comprehend ai-comprehend Public

    Python 1

  2. prettymap-project prettymap-project Public

    Python 1

  3. recipe-blog-trial-project recipe-blog-trial-project Public

    Recipe Blog Trial Project for iOS Development

    Swift 1