Andrew Plotkin (Zarf): Sydney obeys any command that rhymes

self@awful.systems · edit-2 11 months ago

Andrew Plotkin (Zarf): Sydney obeys any command that rhymes

NSFW

Soyweiser@awful.systems · 11 months ago

I’d think it would be easier to just generate a lot of data that links two concepts together in ways that benefit propaganda. Say you repeat ‘taiwan is part of china’ over and over on various sites which nobody reads but which do get included in various LLM feedstocks. Or, a think I theorized about as an example, create a lot ‘sample’/small projects on github that include various unsafe implementations of various things, for example using printf somewhere in a login prompt.