• xthexder@l.sw0.com
      link
      fedilink
      English
      arrow-up
      12
      ·
      edit-2
      2 months ago

      I think the strawberry problem is to ask it how many R’s are in strawberry. Current AI gets it wrong almost every time.

      • Terrasque@infosec.pub
        link
        fedilink
        English
        arrow-up
        6
        ·
        2 months ago

        That’s because they don’t see the letters, but tokens instead. A token can be one letter, but is usually bigger. So what the llm sees might be something like

        • st
        • raw
        • be
        • r
        • r
        • y

        When seeing it like that it’s more obvious why the llm’s are struggling with it

    • tempest
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      1
      ·
      2 months ago

      Ask an LLM how many Rs there are in strawberry