• wwb4itcgas@lemm.ee
    link
    fedilink
    English
    arrow-up
    3
    ·
    1 day ago

    Interesting point, although I don’t see how you’d manage to run modern TTS (the models can get very large, and that’s per voice; as an example Parler-TTS’s mini model is 800Mb, the HQ model is 2.3Gb - for one voice) + a LLM for content synthesis on any personal hardware, console or not. The storage requirements alone would make that grossly infeasible.

      • wwb4itcgas@lemm.ee
        link
        fedilink
        English
        arrow-up
        4
        ·
        12 hours ago

        “…and a great deal of patience as you wait for each NPC to formulate their replies. In the meantime, they’ll just be standing there looking at you with glassy eyes, smiling.”