• incognito_mode@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    1
    ·
    1 year ago

    This is a great point. The user data needs to be enshrined in such a way that it can be easily moved in a bulk migration without requiring a direct opt-in from every user. While at the same time making it clear how it’s being used/kept/sold/not sold/etc.

    I’m not against LLMs using the data generated on sites like this to inform useful answers when I ask ChatGPT a question. It genuinely makes AI a better tool, but I feel like the contributors of such content should know how their answers are being used.

    • lightrush@lemmy.ca
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      1 year ago

      LLMs are likely going to scrape no matter the license. I doubt OpenAI got a copyright license from Reddit to ingest it. In fact I’m not even sure they need one if ingestion can be make similar enough to “reading the web site”. And so making content CC probably won’t affect LLM use of public posts.

    • pwnstar@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      What license would be appropriate for that? I’ve always been interested since I do photography, and it seems like any site like that needs nearly full rights so that they can store and distribute as they see fit so that they can do backups, migration, etc. What license would give those, but keep the full rights of the creator intact?

      (I know nothing on the topic, just curious)