Skip to content
View kimsijin33's full-sized avatar
😊
I may be slow to respond.
😊
I may be slow to respond.

Block or report kimsijin33

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kimsijin33/README.md

I'm AI Engineer Kim Si Jin.

AI Engineer Β· LLM/RAG Β· Government R&D Project Lead Β· End-to-End ML


πŸ‘¨β€πŸ’» About Me

I like deep neural nets. - AI Engineer with 4+ years of specialized AI/ML experience and 19+ years of IT infrastructure background.
I design and build RAG systems, LLM applications, and end-to-end AI pipelines β€” from data collection and model training to cloud deployment and monitoring.

  • πŸ† 1st place β€” Upstage AI OCR Competition (2025.12)
  • πŸ“„ SCOPUS-indexed international journal publication β€” GCN-based Fire Situation Recognition (2024)
  • πŸ› Led government R&D AI projects (ETRI, IITP) as AI Project Lead
  • πŸ€– Built RAG/NLP systems deployed in real-world services (112/119 emergency inference, B2B matching platform)
  • ☁️ Experienced in MLOps: MLflow, GitHub Actions, AWS EC2/S3, Docker-based CI/CD/CT pipelines

πŸ”‘ Core Skills

LLM Β· RAG
LangChain ChromaDB FAISS Prompt Engineering

ML Β· DL
PyTorch TensorFlow Scikit-learn XGBoost GCN BERT

MLOps Β· Cloud
AWS Docker MLflow GitHub Actions

Backend Β· Infra
Python FastAPI Spring Boot Node.js PostgreSQL


πŸš€ Featured Projects

πŸ”¬ GCN-Based Fire Situation Recognition (SCOPUS, 2024)

  • Graph Convolutional Network 기반 ν™”μž¬ 상황인식 λͺ¨λΈ 연ꡬ 개발
  • 95% prediction accuracy achieved Β· SCOPUS κ΅­μ œμ €λ„ λ“±μž¬
  • AI Project Lead Β· 데이터 μˆ˜μ§‘/μ „μ²˜λ¦¬/라벨링/λͺ¨λΈ ν•™μŠ΅ μ „ κ³Όμ • μˆ˜ν–‰

πŸ† Upstage AI OCR Competition β€” 1st Place (2025.12)

  • 영수증 κΈ€μž κ²€μΆœ(Text Detection) λͺ¨λΈ 섀계 및 Polygon μ’Œν‘œ κ²€μΆœ μ΅œμ ν™”
  • 3,600+ 이미지 λŒ€μƒ κ°•κ±΄ν•œ(Robust) κ²€μΆœ νŒŒμ΄ν”„λΌμΈ μ™„μ„±
  • Data augmentation + hyperparameter tuning으둜 μΌλ°˜ν™” μ„±λŠ₯ κ·ΉλŒ€ν™”

🚨 112/119 Emergency Urgency Inference β€” ETRI (2020–2024)

  • κΈ΄κΈ‰μ‹ κ³  ν…μŠ€νŠΈμ—μ„œ κΈ΄κΈ‰μ„±(Code 0~4)Β·μ‚¬κ±΄μœ ν˜• μΆ”λ‘  NLP λͺ¨λΈ 및 API 개발
  • AI Project Lead Β· 데이터 κ³Όν•™μž 3λͺ…, AI 개발자 2λͺ… νŒ€ λ¦¬λ“œ

πŸ“Š μ‹ ν˜Έ 처리 DSP λ°μŠ€ν¬ν†± μ†Œν”„νŠΈμ›¨μ–΄ 개발 β€” ETRI (2022–2024)

  • ν•œκ΅­μ „μžν†΅μ‹ μ—°κ΅¬μ› MATLAB으둜 μ œμž‘λœ μ‹ ν˜Έμ²˜λ¦¬ μ• ν”Œλ¦¬μΌ€μ΄μ…˜μ„ μœˆλ„μš° λ°μŠ€ν¬ν†± μ• ν”Œλ¦¬μΌ€μ΄μ…˜μœΌλ‘œ κ°œλ°œν•˜κΈ° μœ„ν•΄ C++ μ†Œν”„νŠΈμ›¨μ–΄λ‘œ λ³€ν™˜ν•¨
  • AI Project Lead Β· 데이터 κ³Όν•™μž 1λͺ…, AI 개발자 3λͺ… νŒ€ λ¦¬λ“œ

🌐 Global B2B Matching AI Platform β€” IITP μ •λ³΄ν†΅μ‹ κΈ°νšν‰κ°€μ› R&D (2023)

  • 400GB λŸ¬μ‹œμ•„ νŠΉν—ˆ 데이터 ETL νŒŒμ΄ν”„λΌμΈ 섀계 β†’ 30,000건 κ³ μˆœλ„ JSON 확보
  • Multilingual BERT 기반 κΈ°μ—…λ§€μΉ­ λΆ„λ₯˜ λͺ¨λΈ 개발 Β· 90% classification accuracy
  • 3단계 크둜슀 λ§€ν•‘ ν…Œμ΄λΈ”(IPC ↔ KSIC) κ³ μ•ˆ, VPN 기반 크둀링 μ—”μ§„μœΌλ‘œ 100,000건 λŸ¬μ‹œμ•„ κΈ°μ—… DB 확보

πŸ” λ―Έμ•„μ°ΎκΈ° ν”„λ‘œμ νŠΈ YOLO β€” ETRI (2020–2021)

  • ν•œκ΅­μ „μžν†΅μ‹ μ—°κ΅¬μ› YOLO λͺ¨λΈμ„ μ΄μš©ν•œ CCTV λ™μ˜μƒ λ―Έμ•„μ°ΎκΈ° ν”„λ‘œμ νŠΈ
  • AI Project Lead Β· 데이터 κ³Όν•™μž 3λͺ…, AI 개발자 2λͺ… νŒ€ λ¦¬λ“œ

πŸ€– RAG Chatbot β€” LangChain + Upstage Solar LLM (2025)

  • End-to-end RAG μ„œλΉ„μŠ€: 직접 μŠ€ν¬λž˜ν•‘ν•œ LangChain κΈ°μˆ λ¬Έμ„œ β†’ ChromaDB/FAISS 벑터 DB ꡬ좕
  • Top-k 검색 정확도 30% κ°œμ„  Β· Docker Compose CI/CD Β· 슀트리밍 UI κ΅¬ν˜„

πŸ“Š Apartment Price Prediction β€” Seoul (2025)

  • ꡭ토ꡐ톡뢀 λ§€λ§€ 데이터 + μ§€ν•˜μ² /λ²„μŠ€ μž…μ§€ 데이터 κ²°ν•©, Feature Engineering
  • XGBoost/LightGBM 앙상블 λͺ¨λΈ Β· RMSE κΈ°μ€€ 베이슀라인 λŒ€λΉ„ 35% 였차 κ°μ†Œ

πŸ“š Research & Publications

Year Title Venue
2024 Recognition of Fire Situation Using Graph Convolutional Network Model SCOPUS-indexed International Journal
2023 GCN λͺ¨λΈμ„ μ΄μš©ν•œ ν™”μž¬ 상황인식 ν•œκ΅­μ •λ³΄μ²˜λ¦¬ν•™νšŒ ν•™μˆ λ°œν‘œλŒ€νšŒ
2023 Graph Convolutional Network λͺ¨λΈμ„ μ΄μš©ν•œ ν™”μž¬ 상황인식 μ„μ‚¬ν•™μœ„ λ…Όλ¬Έ

πŸ… Certifications & Awards

πŸ₯‡ Upstage AI OCR Competition 1st Place 2025.12
πŸ“œ μ •λ³΄μ²˜λ¦¬κΈ°μ‚¬ 2012
🌐 CCNA (만점 1000/1000) 2006
🐧 LPIC-1 2013

πŸ“¬ Contact

Pinned Loading

  1. ClusterGCN ClusterGCN Public

    Forked from benedekrozemberczki/ClusterGCN

    A PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).

    Python 1

  2. -AI- -AI- Public

    Jupyter Notebook

  3. 8wk-dl-dano-ai.github.io 8wk-dl-dano-ai.github.io Public

    Forked from dano-ai/8wk-dl-dano-ai.github.io

    8-week deep dive into deep learning

    CSS

  4. AIforallthepeople AIforallthepeople Public

    Shell

  5. nngraph nngraph Public

    Forked from torch/nngraph

    Graph Computation for nn

    Lua

  6. pygcn pygcn Public

    Forked from tkipf/pygcn

    Graph Convolutional Networks in PyTorch

    Python