Trait Tokenizer

Source

pub trait Tokenizer: Send + Sync {
    // Required methods
    fn count(&self, text: &str) -> i32;
    fn model_family(&self) -> &str;
    fn encode(&self, text: &str) -> Vec<u32>;
    fn decode(&self, tokens: &[u32]) -> String;
}

Expand description

Trait for counting tokens in text.

Used for token budget management in context assembly. Implementations can provide exact counts (using actual tokenizer) or heuristic estimates based on character ratios.