• brucethemoose@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      23 hours ago

      It’s honestly not that big a deal, as it’s not like knowing anything about how it was trained (beyond the config) would help you modify it. It’s still highly modifiable. It’s not like anyone can afford to replicate it.

      It would be nice to publish the hyperparameters for research purposes, but… shrug.

      I think a subset of the exact training data/hyperparameters would help with quantization-aware-training, maybe, but that’s all I got.

  • Korkki@lemmy.ml
    link
    fedilink
    English
    arrow-up
    53
    arrow-down
    9
    ·
    edit-2
    2 days ago

    Now that it has been “freedomized” it instead says that Israel is actually 3000+ years old and Palestinians are invaders, and Israel has the right to defend itself like chatgpt does. Also that American style liberal democracy is the peak of human development and civilization. Don’t kid yourself that this has anything to do with truth or making it more “accurate”. Yeah just replace one set of official truths, half thrush with other set of official positions, half truths and outright lies to plug the gaps. Again, who fact checks the factcheckers? Even just out of spite I would not use any model that advertises itself as being trained to answer to the sensibilities of a western techbro liberals so that they can once more outsource their thinking to outside party, because at-least it isn’t the CCP propaganda. Like it’s so absolutely dreadful when deepseek copy-pastes official CPC party line word for word, on Chinese form of government instead of making up some Wikipedia tier wall of text where the word “authoritarian” is about as common of a word as “the”.

    I just don’t get it man. What kind of non thinking cretin purposefully would use this kind of model? It’s probably really the pettiness that gets to me so badly. Or maybe it’s that while this is purposefully made to push a certain narrative, the deepseek’s bias is probably just a result that it’s been trainer on Chinese internet data.

    • Squizzy@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      12 hours ago

      My chatgpt will list the questionable human righfs record of Israel ad agree with the conclusion that they are, under the same standards and logic, likely guilty of their accused atrocities as thenlikes of China or Russia.

  • biofaust@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    1 day ago

    I run an uncensored version on my PC since weeks, there are multiple ones on HuggingFace.

  • brucethemoose@lemmy.world
    link
    fedilink
    English
    arrow-up
    34
    arrow-down
    3
    ·
    edit-2
    2 days ago

    That’s the great thing about open models. Censorship? Once identified, all it takes is one person and a bit of cash to get rid of it, though it seems Perplexity did a particularly good job (unlike some “abliterated” models that are pretty dumbed down).

      • brucethemoose@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        2 days ago

        In the 32B range? I think we have plenty of uncensored thinking models there, maybe try fusion 32B.

        I’m not an expert though, as models trained from base Qwen have been sufficient for that, for me.

        • Even_Adder@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          1
          ·
          2 days ago

          I just want to mess with this one too. I had a hard time finding an abliterated one before that didn’t fail the Tiananmen Square question regularly.

  • FrankLaskey@lemmy.ml
    link
    fedilink
    English
    arrow-up
    24
    arrow-down
    6
    ·
    edit-2
    2 days ago

    I think we can all agree that modifications to these models which remove censorship and propaganda on behalf of one particular country or party is valuable for the sake of accuracy and impartiality, but reading some of the example responses for the new model I honestly find myself wondering if they haven’t gone a bit further than that by replacing some of the old non-responses and positive portrayals of China and the CPC with a highly critical perspective typified by western governments which are hostile to China (in particular the US). Even the name of the model certainly doesn’t make it sound like neutrality and accuracy is their primary aim here.

    • Aatube@kbin.melroy.org
      link
      fedilink
      arrow-up
      13
      arrow-down
      1
      ·
      2 days ago

      ehhhh, the only thing the model got quite wrong was the level of control on access to media, internet, and especially education. Other than that the article’s example responses seem pretty on-point. (I only otherwise found a blemish where a few words needed further clarification; I found no other errors in my first reading.) Though I do also find the name of the model quite off-putting.

    • brucethemoose@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      2 days ago

      Well you can merge it with the original model, to any degree, to get any sliding scale of “bias” you want.

      Practically, though, I guess that’s not super practical, as very few have the hardware or cash to deploy a custom full R1 themselves.

    • iopq@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      5
      ·
      2 days ago

      What part is highly critical of China? Facts can’t be critical

      • fruitycoder@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        5
        ·
        2 days ago

        Listen, I’m highly critical of the CCP, but LLMs aren’t facts machines, they are make text like what they are trained on machines.

        They have no grasp of truth, and we can only get some sense of truth of what the average collective text response of its dataset (at best!).

  • ZILtoid1991@lemmy.world
    link
    fedilink
    English
    arrow-up
    10
    arrow-down
    6
    ·
    2 days ago

    IDK, but this seems like wankery to me. Just google it if you want to know about it, the AI isn’t an “all knowing being” nor “the arbitrer of truth”.

    I have a feeling that a new logical fallacy will soon emerge (if it isn’t already widespread on certain places of the internet), that will be “X is true because the LLM said so”.

    • fruitycoder@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 days ago

      It’s really an extension of “Would some really do that? Just lie on the Internet?” But now “Would AI, which is built to create content like what people post on the Internet, really just lie?”