bunkai is a parser for catalogued ROM file names that follow the following common naming conventions.
bunkai does not use regular expressions and supports a variety of edge cases for each supported naming convention. It is ported from the well tested shiratsu-naming Rust library, but provides a more ergonomic unified C# API. Unlike shiratsu-naming, bunkai does not keep trivia such as malformed TOSEC tag orders and is a lossy parser mainly for scraping purposes.
bunkai is built with the Pidgin parser combinators library.
Bunkai is a work in progress and while not all features available in shiratsu-naming will be implemented (particularly any trivia items like TOSEC warnings), the following features are intended.
- No-Intro
- Scene Number
- BIOS
- Title
- Region
- Language
- Multi Language
- Version
-
RevVersions -
vSingle Prefixed Versions -
VersionSingle Prefixed Versions withAlt - Unprefixed
1.xversions -
Versionprefixed date versions (Redump BIOS versions) - Comma-separated version tags
-
- Release
- Bad Dump
- Redump Disc tag
- Redump Multi-tap tag
- TOSEC
- ZZZ-UNK- (Omitted)
- Demo
- Dates
- Undelimited Dates
- Version
-
Revversions -
vversions - Version in flag
-
- Title
- Degenerate titles missing demo or date
- Unexpected spaces
- Publisher
- by-publisher in Title if ZZZ-UNK
- by-publisher after tags if ZZZ-UNK
- Region
- GoodTools region
- Language
- Multilanguage
- System
- Video
- Copyright
- Media
- Devstatus
- Dump info
- More info
- GoodTools
- Region
- Year
- Translation
-
REVRevision - Version
-
VWIPX -
VFinal_ -
Vunknown -
V x.xx -
V bX -
V nn -
V nnnn -
V_
-
- Language
- Multilanguage
- Volume
- Dump tags
- Hack tag
- Other
-
PD -
NTSC -
PAL -
NTSC-PAL -
PAL-NTSC
-