Skip to content

SoorejS/DebateLLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DebateLLM Fallacy Detector - Evaluation Suite

This is the evaluation framework for the DebateLLM Fallacy Detector, a DeBERTa-base model specialized in identifying logical fallacies in text. Developed as part of my GSoC (Google Summer of Code) project.

🚀 Project Overview

The primary aim of this suite is to rigorously evaluate the performance of RowdyI7er/DebateLLM. It includes tools for:

  1. Small-Scale Verification: Initial assessment of the model with 32 statements.
  2. Large-Scale Performance Benchmarking: A comprehensive test using 300 unique logical fallacy statements.
  3. Detailed Reporting: Automated generation of performance metrics, class breakdowns, and confusion analysis.
  4. Word-Level Explainability (SHAP): Heatmap-based analysis showing which specific keywords trigger each fallacy label (GSoC Highlight).

📋 Detected Fallacies

The model covers the following 8 categories:

  • Ad Hominem
  • Appeal to Authority
  • Bandwagon
  • False Dilemma
  • Hasty Generalization
  • No Fallacy (Standard Factual Statement)
  • Slippery Slope
  • Strawman

📁 Repository Structure

├── scripts/             # Main evaluation and report generation logic.
│   ├── evaluate_300.py   # Large-scale (300 statements) inferencing script.
│   └── ...
├── reports/             # Detailed Markdown performance reports.
│   └── performance_report_300.md
├── data/                # Results and label mappings.
│   └── evaluation_results_300.json
└── research/            # Investigation and debug history.

🛠 Quick Start

Installation

pip install torch transformers tqdm

Running the 300-Statement Evaluation

To replicate the performance test and generate a fresh report:

  1. Execute the Evaluation:
    python scripts/evaluate_300.py
  2. Generate the Report:
    python scripts/generate_report_300.py

Check the reports/ folder for the result.

Word-Level Explainability (SHAP)

To understand why the model predicted a certain fallacy:

python scripts/explain_fallacy.py "Your input sentence here"

The script will save a Heatmap HTML report in reports/explanations/. Open this file in your browser to see the word-level saliency map.

About

A LLM model for GSOC built via Deberta-Base

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors