Phind is now using a V7 of their model for their own platform, as they have found that people overall prefer that output vs GPT4. This is extremely impressive because it’s not just a random benchmark that can be gamed, but instead crowd sourced opinion on real tasks

The one place everything still lags behind GPT4 is question comprehension, but this is a huge accomplishment

Blog post: https://www.phind.com/blog/phind-model-beats-gpt4-fast

note: they’ve only open released V2 of their model, hopefully they release newer versions soon… would love to play with them outside their sandbox

  • Marians@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    1
    ·
    7 months ago

    this is one of the most plausible claims to date because it is supported by anecdotal data from actual use scenarios rather than only benchmark games. puppet hockey