Screenshot of this question was making the rounds last week. But this article covers testing against all the well-known models out there.

Also includes outtakes on the ‘reasoning’ models.

  • criticon
    link
    fedilink
    English
    arrow-up
    10
    arrow-down
    2
    ·
    17 hours ago

    Even when they give the correct answer they talk too much. AI responses contain a lot of garbage. When AI gives you an answer it will try to justify itself. Since they won’t give you brief responses the responses will be long.

    • chunes@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      1
      ·
      16 hours ago

      I agree with you but found that DeepSeek was succinct.

      You need to bring your car to the car wash, so you should drive it there. Walking would leave your car at home, which doesn’t help.

      • [deleted]@piefed.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        5 hours ago

        The second sentence is worthless garbage rambling that repeats the same point as the first sentence.

        • chunes@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          5 hours ago

          Yeah, I guess. You could make the argument the answer should just be “Drive.”

          • [deleted]@piefed.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            5 hours ago

            It could be, although one concise sentence with the reason for the answer is better. The first sentence is optimal.

    • MDCCCLV
      link
      fedilink
      English
      arrow-up
      3
      ·
      16 hours ago

      Your post is much longer than it needs to be. That is the reason why, because they just copied people.

    • KeenFlame@feddit.nu
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      2
      ·
      12 hours ago

      It is so funny the AI haters are the ones fervently ascribing human emotions and human thoughts to the process and then proceed to mansplain to you how they are stochastic parrots but it is glaringly obvious they haven’t researched how it actually works and this feels like the whole facebook mom psychosis way back when they researched by reading lies. No, their responses can be very short also. It depends on your temp