DebateLLM Fallacy Detector - Evaluation Suite

This is the evaluation framework for the DebateLLM Fallacy Detector, a DeBERTa-base model specialized in identifying logical fallacies in text. Developed as part of my GSoC (Google Summer of Code) project.

🚀 Project Overview

The primary aim of this suite is to rigorously evaluate the performance of RowdyI7er/DebateLLM. It includes tools for:

Small-Scale Verification: Initial assessment of the model with 32 statements.
Large-Scale Performance Benchmarking: A comprehensive test using 300 unique logical fallacy statements.
Detailed Reporting: Automated generation of performance metrics, class breakdowns, and confusion analysis.
Word-Level Explainability (SHAP): Heatmap-based analysis showing which specific keywords trigger each fallacy label (GSoC Highlight).

📋 Detected Fallacies

The model covers the following 8 categories:

Ad Hominem
Appeal to Authority
Bandwagon
False Dilemma
Hasty Generalization
No Fallacy (Standard Factual Statement)
Slippery Slope
Strawman

📁 Repository Structure

├── scripts/             # Main evaluation and report generation logic.
│   ├── evaluate_300.py   # Large-scale (300 statements) inferencing script.
│   └── ...
├── reports/             # Detailed Markdown performance reports.
│   └── performance_report_300.md
├── data/                # Results and label mappings.
│   └── evaluation_results_300.json
└── research/            # Investigation and debug history.

🛠 Quick Start

Installation

pip install torch transformers tqdm

Running the 300-Statement Evaluation

To replicate the performance test and generate a fresh report:

Execute the Evaluation:
```
python scripts/evaluate_300.py
```
Generate the Report:
```
python scripts/generate_report_300.py
```

Check the reports/ folder for the result.

Word-Level Explainability (SHAP)

To understand why the model predicted a certain fallacy:

python scripts/explain_fallacy.py "Your input sentence here"

The script will save a Heatmap HTML report in reports/explanations/. Open this file in your browser to see the word-level saliency map.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
reports		reports
research		research
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DebateLLM Fallacy Detector - Evaluation Suite

🚀 Project Overview

📋 Detected Fallacies

📁 Repository Structure

🛠 Quick Start

Installation

Running the 300-Statement Evaluation

Word-Level Explainability (SHAP)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DebateLLM Fallacy Detector - Evaluation Suite

🚀 Project Overview

📋 Detected Fallacies

📁 Repository Structure

🛠 Quick Start

Installation

Running the 300-Statement Evaluation

Word-Level Explainability (SHAP)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages