How many R's in strawberry.
Why does it get this question wrong when claude on website does not. The bottom one is claude 3.7 sonnet and the top one is gpt4.1 mini. I asked the same thing to claude on the website and it gave the correct answer.


1 Reply
because LLMs can't really count. So if you ask the same question multiple times even to the same model you won't necessarily get the same answer (this goes for a lot of things, but in this case it's more obvious)