• 1rre@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    9
    ·
    13 hours ago

    I’ve found Gemini overwhelmingly terrible at pretty much everything, it responds more like a 7b model running on a home pc or a model from two years ago than a medium commercial model in how it completely ignores what you ask it and just latches on to keywords… It’s almost like they’ve played with their tokenisation or trained it exclusively for providing tech support where it links you to an irrelevant article or something

    • brucethemoose@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      13 hours ago

      Gemini 1.5 used to be the best long context model around, by far.

      Gemini Flash Thinking from earlier this year was very good for its speed/price, but it regressed a ton.

      Gemini 1.5 Pro is literally better than the new 2.0 Pro in some of my tests, especially long-context ones. I dunno what happened there, but yes, they probably overtuned it or something.