Tokens per Word
Token problems: 4 of 20
Token problems: the workbook
Tokens per Word
Does every language tokenize the same way? No. The ratio of tokens to words varies because tokenizers are trained mostly on English. Agents serving multilingual users will pay different costs for the same message depending on the language.
Problem
An AI agent processes a 20-word passage of English text and counts 26 tokens. What is the token-to-word ratio?
Divide tokens by words
Practice 1
An AI agent processes a 24-word passage of Spanish text and counts 36 tokens. What is the token-to-word ratio?
Divide tokens by words
Practice 2
An AI agent processes a 15-word passage of Hindi text and counts 45 tokens. What is the token-to-word ratio?
Divide tokens by words
Next:
5. Will It Fit?




