Cao Labs - Compact n-gram

MAMatt-38•Created November 20, 2025

42 views

Instructions

N-gram + Backoff Model (120 blocks) This model is an optimized, general-purpose n-gram architecture using a adaptive backoff. Context window: 20 characters (sliding window) Global context: virtually infinite (streaming-compatible) Generation mode: character-level, with optional multi-token output (1 characters at once) Key Features Key feature: • Infinite-context generation The model can process arbitrarily long inputs without a significant slowdown, thanks to streaming n-gram computation. • Multi-token generation Although character-based, the model can generate multiple characters at once (“hel” instead of “h”), greatly increasing generation speed. • On-the-fly inference (no training needed) The model dynamically computes n-grams directly from the dataset during generation. This ensures adaptability, but very large datasets may slow down generation unless multi-token mode is used. (data By @Matt-38 and text By GPT-5.1)

Project Details

Project ID1245700094

CreatedNovember 20, 2025

Last ModifiedFebruary 7, 2026

SharedNovember 22, 2025

CommentsAllowed