• Catoblepas@piefed.blahaj.zone
    link
    fedilink
    English
    arrow-up
    11
    ·
    3 days ago

    OCR isn’t a large language model. That’s why sometimes with poor quality scans or damaged text you get garbled nonsense from it. It’s not determining the statistically most likely next word, it’s matching input to possible individual characters.