I’ve noticed some files I opened in a text editor have all kinds of crazy unrenderable chars

  • cheese_greater@lemmy.worldOP
    link
    fedilink
    arrow-up
    1
    ·
    edit-2
    11 hours ago

    I do understand I’m not able to read it myself, I’m more curious about the architecture of how that data is represented and stored and conceptually how such representation is practically organized/reified…

    • AbouBenAdhem@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      edit-2
      54 minutes ago

      The original binary format is split into six-bit chunks (e.g., 100101), which in decimal format correspond to the integers from 0 to 63. These are just mapped to letters in order:

      1. 000000 = A,
      2. 000001 = B,
      3. 000010 = C,

      etc.—it goes through the capital letters first, then lower-case letters, then digits, then “+” and “/”. It’s so simple you could do it by hand from the above description, if you were looking at the data in binary format.