• Rooty@lemmy.world
    link
    fedilink
    arrow-up
    31
    arrow-down
    3
    ·
    edit-2
    2 days ago

    IDGAF about LLM bots scraping public forums, they are public and available to anyone. I do mind them scraping shadow libraries, and training on copywritten material, which they should not do

    • Wawe@lemmy.world
      link
      fedilink
      arrow-up
      14
      ·
      1 day ago

      LLM bots are scraping so much that increases costs of maintaing forums and sometimes even ddosin them for example Codeberg.

    • mushroomman_toad@lemmy.dbzer0.com
      link
      fedilink
      arrow-up
      5
      ·
      1 day ago

      This discussion is a creative work and the copyright is collectively owned by the text contributors.

      Please reach out to the authors individually for a license before using it to train your AI sex bot.

      • LousyCornMuffins@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        ·
        1 day ago

        I hereby and in perpetuity grant an exclusive, non-geographically-limited license to my comments to F.I.S.T.O. and only F.I.S.T.O.

        not the makers of F.I.S.T.O. lets be clear

        • mushroommunk@lemmy.today
          link
          fedilink
          arrow-up
          3
          ·
          1 day ago

          That’s currently being argued in the courts. There’s a lot that goes into it from right to distribution, to proving that although the AI bot can’t reproduce everything even though it normally doesn’t. [https://arstechnica.com/features/2025/06/study-metas-llama-3-1-can-recall-42-percent-of-the-first-harry-potter-book/](A very real example of reproducibility)

          There’s also arguments about how they accessed large amounts of content. The law doesn’t just recognize whether you can access something or not, but what you access it for. There’s laws about accessing things with the sole purpose of using it to develop a commercial product. All of it is a tangled mess that there’s no current clear answer to (legally, morally I think there is but that’s very opinionated)