cross-posted from: https://lemmy.ml/post/15471632

Codeberg was asking about this. The linked toot by a commenter points to :

SEqlite

These are CC-BY-SA 4.0 remixes of the Stack Exchange Creative Commons Data Dumps. 100% Unendorsed by Stack Exchange, Inc.

They are minimal. They provide the data you probably care about and the data you need to comply with the original license in SQLite format.

  • lambalicious@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    1
    ·
    7 months ago

    People who are looking to start a SE alternative but start with the idea of importing the original SE data dumps are already Doing It Wrong. Much of the issue that has led to the desire to fork SE comes due to the license of the posts and content, which lacks the NC (NonCommercial) component of Creative Commons. Without that component, any attempt to make a Fediverse alternative just ends up in Yet Another Endpoint that can be freely siphoned for data by corporations, for AIs, etc.

    • fruitycoder@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      2
      ·
      7 months ago

      Or used by people in any commercial product. Is there really enough people to justify a info exchange of just hobby projects?

      • lambalicious@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        1
        ·
        7 months ago

        See here’s the thing: Creative Commons is not an exclusionary license. If I want to make commercial use of something that has a CC-NC license, I explicitly can ask the author for a secondary license limited to the usage and scope that I need. The important thing here is that the author still retains control, as well as a data point of who is profiting from their stuff and how.

        • fruitycoder@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          7
          ·
          7 months ago

          So if someone wanted to use this for work they would have to have an issue, find an answer, contact a person, and hope they can use the thing they just found to their problem?

          Like, who wants that?

          Heck I don’t want every person on here who found something I said useful to be hounding me about using my code either.

          • lambalicious@lemmy.sdf.org
            link
            fedilink
            English
            arrow-up
            1
            ·
            7 months ago

            Like, who wants that?

            Have you literally missed out on the fact that the protest is happening? The protest is certainly not because SO answers are bad.

    • Miaou@jlai.lu
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      7 months ago

      They have already access to SO’s CC content, why would they get it from the fediverse?

      • lambalicious@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        1
        ·
        7 months ago

        They already have it.

        I said alternative to SO. As in, likely, a place to post new content (answers, comments). Nothing can really be done with the content OAI already got their hands on other than firing off a few well-placed EMP bombs.

        • Miaou@jlai.lu
          link
          fedilink
          English
          arrow-up
          1
          ·
          7 months ago

          Yes, but you mentioned importing old content is problematic, and I don’t see why?

          • lambalicious@lemmy.sdf.org
            link
            fedilink
            English
            arrow-up
            1
            ·
            7 months ago

            Because to import old content, you have to respect the old license (or get every contributor of back-then to relicense). That would mean having a site with contents under differing licenses depending on date, which is something the corpos can use as an excuse to continue siphoning everything without consequence.

            I’m fine with a mirror / archive of SO. But it shoudl very definitively be a different thing than an active SO alternative, and their users and data storages should be also different.