finally debunked@slrpnk.net to

Ask Lemmy@lemmy.world · 1 year ago

for ML engineers: why can't you simply exclude the word "fuck"?

22

for ML engineers: why can't you simply exclude the word "fuck"?

finally debunked@slrpnk.net to

Ask Lemmy@lemmy.world · 1 year ago

So, I’ve heard that ML manipulates tokens and specifically for the English corpora they take place of words. If we want model to be polite and not to speak uncomfortable language we can remove certain words from the internal array where all tokens and their associative data are stored, for example “fuck”.

Chat

BURN@lemmy.world
link
fedilink
arrow-up
15
arrow-down
2·
1 year ago
ML/Generative AI don’t “store” an internal array of specifics. Instead it’s a statistical model based on the next (or in ChatGPT’s case, 3rd most likely) word in a sentence.

To avoid swearing or other really anything it needs to be excluded at a training level, before the algorithm is trained.

As it stands, we have very little to no visibility into why these models work. Even the researchers are trying to open the black box, but there’s so much that it’s nearly impossible to isolate a node that would or would not contain the work fuck
- BetaDoggo_@lemmy.world
  link
  fedilink
  arrow-up
  4·
  1 year ago
  Chatgpt’s sampling parameters are unknown, and it definitely doesn’t choose the 3rd most likely. More complicated sampling methods are probably used, such as temperature, top p and top k.
  - BURN@lemmy.world
    link
    fedilink
    arrow-up
    2
    arrow-down
    1·
    1 year ago
    Correct, but also way over the level of the average reader
    
    I probably should have used a different example other than ChatGPT tbh
    - wispydust@sh.itjust.works
      link
      fedilink
      arrow-up
      1·
      1 year ago
      That’s alright. You did good simplifying an unrelated idea for the sake of explaining another concept.
- xerox@lemm.ee
  link
  fedilink
  arrow-up
  2·
  1 year ago
  
  (or in ChatGPT’s case, 3rd most likely)
  
  Why 3rd?
  - BURN@lemmy.world
    link
    fedilink
    arrow-up
    10
    arrow-down
    1·
    1 year ago
    I believe that the 3rd or nth, word is because it sounds more human. The statistically first correct word ends up sounding very robotic and forced, where the 3rd is still very likely correct, but leads to variation in responses
    
    This is all from what I remember reading a mini-paper about it, so I could be wrong

Ask Lemmy@lemmy.world

asklemmy@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

A Fediverse community for open-ended, thought provoking questions

Please don’t post about US Politics. If you need to do this, try [email protected]

Rules: (interactive)

1) Be nice and; have fun

Doxxing, trolling, sealioning, racism, and toxicity are not welcomed in AskLemmy. Remember what your mother said: if you can’t say something nice, don’t say anything at all. In addition, the site-wide Lemmy.world terms of service also apply here. Please familiarize yourself with them

2) All posts must end with a '?'

This is sort of like Jeopardy. Please phrase all post titles in the form of a proper question ending with ?

3) No spam

Please do not flood the community with nonsense. Actual suspected spammers will be banned on site. No astroturfing.

4) NSFW is okay, within reason

Just remember to tag posts with either a content warning or a [NSFW] tag. Overtly sexual posts are not allowed, please direct them to either [email protected] or [email protected]. NSFW comments should be restricted to posts tagged [NSFW].

5) This is not a support community.

It is not a place for ‘how do I?’, type questions. If you have any questions regarding the site itself or would like to report a community, please direct them to Lemmy.world Support or email [email protected]. For other questions check our partnered communities list, or use the search function.

Reminder: The terms of service apply here too.

Partnered Communities:

No Stupid Questions

You Should Know

Logo design credit goes to: tubbadu

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1.56K users / day
5.6K users / week
11.1K users / month
21.5K users / 6 months
452 local subscribers
26.9K subscribers
4.36K Posts
234K Comments
Modlog