This is a BPE (Byte-Pair Encoding) Tokenizer that will significantly improve the performance of my upcoming AxiaLM models. It does not support case-sensitivity, it's all lowercase. I couldn't find out how to make case sensitivity without getting super complex or making hundreds of costumes.