Join Us

Genuine Artificial Intelligence

July 17, 2025

Why LLM’s can’t count the R’s in the word “Strawberry”

Understanding Why Large Language Models Struggle with Counting Letters in Words

In recent discussions, you might have come across the humorous phenomenon where large language models (LLMs) seem unable to accurately count the number of specific letters—like the R’s in the word “Strawberry.” This has led to some playful criticisms, but what’s the underlying reason behind this limitation?

The Inner Workings of Large Language Models

At their core, LLMs process text by first dividing input into smaller segments known as “tokens.” These tokens could be individual words, parts of words, or even characters, depending on the model’s design. Once tokenized, each piece is transformed into a numerical format called a “vector,” which encapsulates the token’s meaning in a high-dimensional space. These vectors are then fed through multiple layers to generate predictions or outputs.

Why Counting Letters is a Challenge

It’s important to understand that LLMs are not designed for character-level precision. Their training focuses on understanding contextual patterns and predicting likely continuations or responses based on vast amounts of text data. Because the vector representations abstract away from the original textual details, they do not preserve exact character-by-character information. As a result, attempting to get the model to count specific letters within a word exceeds its primary capabilities, leading to frequent inaccuracies.

Conclusion

In essence, while LLMs excel at capturing semantic and contextual nuances across language, they are inherently limited when it comes to precise, character-level tasks such as counting specific letters within words. This limitation highlights the broader distinction between language understanding and detailed textual analysis.

For a deeper exploration, you can refer to this comprehensive diagram that visualizes these processes: Link to diagram.

Understanding these fundamentals helps us better appreciate both the strengths and limitations of large language models in various applications.