• 3 Posts
  • 218 Comments
Joined 1 year ago
cake
Cake day: August 26th, 2023

help-circle
  • this is obviously talking about their web app, which most people will be using. In this special instance, it was clearly not the LLM itself censoring the Tiananmen Square, but a layer on top.

    i have not bothered downloading and asking deepseek about Tiananmen Square. so i cannot know what the model would have generated. however, it is possible that certain biasses are trained into any model.

    i am pretty sure, this blog is aimed at the average user. while i wouldn’t trust any LLM company with my data, i certainly wouldn’t want the chinese government to have them. anyone that knows how to use (ollama)[https://github.com/ollama/ollama] should know these telemetry data don’t apply to running locally. but for sure, pointing it out in the blog would help.




  • as @[email protected] already mentioned: GitLab CI

    Jenkins is a CI application from before CI was cool. GitLab CI is integrated and can trigger on certain events. Additionally you mentioned, that you want to publish on a public repo anyway.

    You are probably are comfortable with containers. So GitLab CI should be easy for you to learn - as it pretty much starts up a container to do certain tasks. I’ve seen suggestions for Kubernetes, which for sure is the more mature solution. But i would question, whether you need the added functionality and complexity of K8s for a home setup.

    To gain access to your local network, you can use the runner for a secure connection (as described by damnthefilibuster). or you could SSH into the machine, as long as you have it in a DMZ. Drawback is that you have to be more sure about your network infrastructure. Benefit is that it is a more general approach. Obviously you need to store all certs, keys and preferably even addresses in secrets, not the .gitlab-ci.yml.

    As you can see from this thread, there are many ways which lead to rome. My advice is to start with something simple and lightweight, which you understand. adding complexity down the road is easier, than removing it.


  • The main angle is not to ‘poisen’ the training set. it is to waste time, energy and resources. the site loads deliberately slow and produces garbage, which has to be filtered out.

    as i said: not a silver bullet. but at least some threads where tied up collecting garbage painfully slow. as the data is useless, whatever their cleanup process is, has more to do. or it might even be tricked into discarding the whole website, as the signal to noise ratio is bad.

    so i would still say the author achieved his goal.











  • you just need to look: Greenpeace is not exactly the cuddly type. if you want a more violent approach, may i introduce you to Sea Shepherd.

    They are pretty much founded by people wanting to give seal hunters a taste of their own medicine. Until now they have executed some quite big and well organised operations.





  • the ceo is just the effect, not the cause. the us laws allow such bullshit and do not protect the weak (at all). what this one ceo did was, like what many other ceo’s do, immoral but legal. you cant jail someone for legal stuff.

    change the system and force them to adhere to modern moral standards. if they try to pull some bs now, it is quite easy to lock them away.


  • ToxicWaste@lemm.eetomemes@lemmy.worldPerson of the year
    link
    fedilink
    arrow-up
    4
    arrow-down
    5
    ·
    2 months ago

    you are painting an oversimplified picture.

    “i am sure you’d have preferred Gandhi to pick up a gun because he was met with violence?” we can chase eachother with such oversimplifications forever.

    reality is much more complicated than such simple statements. so lets not use their inflammatory nature and focus on the actual problem. which, in that case seems, that people feel disbanded by sociaty to such a degree.