Skip to content

added varied data for training#39

Merged
philion merged 1 commit intoLocal-Connectivity-Lab:llm-redactorfrom
rudra-singh1:llm-redactor
Dec 10, 2025
Merged

added varied data for training#39
philion merged 1 commit intoLocal-Connectivity-Lab:llm-redactorfrom
rudra-singh1:llm-redactor

Conversation

@rudra-singh1
Copy link
Collaborator

changes:

  • added varied training data (more diverse + 500 more tickets)
  • trained for 2000 iterations instead of 1000 (same 3B model though)
  • appended "do not include empty field tasks. only include properties that were actually found and redacted" to system_prompt in redact.py

goal: with these changes, model shouldn't overfit to training data anymore and adaptively redact PII versus trying to fill in a template

@philion philion merged commit baf1801 into Local-Connectivity-Lab:llm-redactor Dec 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants