On Tuesday, Microsoft Research Asia unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track. In the future, it could power virtual avatars that render locally and don’t require video feeds—or allow anyone with similar tools to take a photo of a person found online and make them appear to say whatever they want.

  • cygnus
    link
    fedilink
    arrow-up
    15
    arrow-down
    1
    ·
    edit-2
    7 months ago

    We’re going to need strong digital signatures on everything, and we need it fast, else we won’t be able to believe anything we see. It will be Steve Bannon’s “flood the zone with shit” dream come true.

    • simple@lemm.ee
      link
      fedilink
      English
      arrow-up
      6
      ·
      7 months ago

      We’re going to need strong digital signatures on everything

      That won’t help anything considering how easy it is to strip metadata.

      • cygnus
        link
        fedilink
        arrow-up
        10
        arrow-down
        1
        ·
        7 months ago

        I mean the opposite scenario, where if there’s no signature we assume it’s fake.

        • catloaf@lemm.ee
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          7 months ago

          We’ve had email forgery and signatures to prevent it for decades, but barely anyone does that either.