Hallucination-prevention RAG system with verbatim span extraction. Ensures all generated content is grounded in source documents with exact citations.
-
Updated
Mar 22, 2026 - Python
Hallucination-prevention RAG system with verbatim span extraction. Ensures all generated content is grounded in source documents with exact citations.
Easy modernBERT fine-tuning and multi-task learning
ModernBERT model optimized for Apple Neural Engine.
2nd place · Detect AI-generated text across 6 classes · ModernBERT + LDAM + per-class ensemble · Macro F1 0.95919 — MALTO Hackathon, Politecnico di Torino
ClassyText is a demo for zero-shot text classification using ModernBERT-large from Hugging Face.
Lightweight, Self-Hosted AI Guardrails Model based on ModernBERT.
End-to-end pipeline that identifies specialized research papers through automated classification, demonstrated with an LLMOps use case that includes data ingestion, model training, evaluation, and deployment.
Code for EXIST 2025 Task 1.1, 1.2 and 1.3
Challenge to distinguish whether a sentence from a news article expresses the subjective view of the author behind it or presents an objective view on the covered topic
High-performance, edge-native compliance engine for the Fair Housing Act (FHA). Powered by ModernBERT, providing privacy-first local inference to detect real estate violations in real-time.
Trajectory Classification with the new architecture of BERT, the 'ModernBERT'
We introduce a binarized approach to Lexical Complexity Prediction (Binary LCP) and systematically compare two generations of encoder-only Transformer models: BERT and ModernBERT. Work completed as part of Natural Language Processing, DATASCI 266.
Fine-tuned ModernBERT for software industry related article summaries
Data Processing Utilities And Training Code for r/changemyview Dataset
This is my attempt at writing an ai detector api by fine-tuning modernBert. The story about this project is discussed in the readme. You can find a link to the model playground below.
An example workflow for fine-tuning ModernBERT for a classification task using the IMDB dataset.
A finetuned ModernBERT model for named entity recognition (NER), trained on the CoNLL-2003 dataset to identify persons, organizations, locations, and miscellaneous entities in english text
Detect duplicate & unused Python code via AST hashing, Jaccard similarity, and semantic embeddings (ModernBERT, C2LLM, EmbeddingGemma). CLI + Python API w hybrid synthesis
Semantic code search using vector embeddings. Search your codebase with natural language queries at the function level.
Add a description, image, and links to the modernbert topic page so that developers can more easily learn about it.
To associate your repository with the modernbert topic, visit your repo's landing page and select "manage topics."