A title I never expected to write, but there it is.

NYT updated their TOS to prevent scraping for AI training purposes and might be able to cripple OpenAI with their lawsuit.

  • beef_curds [she/her]@hexbear.net
    link
    fedilink
    English
    arrow-up
    19
    ·
    edit-2
    1 year ago

    This is how it will shake out though.

    If you’re a big corporation (facebook/nyt), ChatGPT will have to pay you massive licensing fees to scrape. If you’re a regular person posting to a platform, you will have no protections because of a platform EULA. The platform will call your work “data” and sell all rights to it, to whatever AI can pay.

    The best AI will cost more than you can afford, because they have paid the most for their datasets (your work.)

  • LanyrdSkynrd [comrade/them, any]@hexbear.net
    link
    fedilink
    English
    arrow-up
    5
    ·
    1 year ago

    NYT also wanted to exclude Google from scrapping their site to build their AI but cannot without completely removing their links from Google search because Google will not provide that option.

    Instead they signed a contract with Google to allow them to use their content in Bard. Then they filed this lawsuit against OpenAI.

  • Dizzy Devil Ducky
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Cannot wait for someone like myself in the future to get a server running with multiple drives so I can set up a chatbot “AI” by going to each individual page and copying the text of the various articles.

    Obviously it’ll be private and only I would have access to it, so what would any of these companies be able to do if they cannot figure out that I am doing it? Send the police after someone who visits their old web articles multiple times throughout the day?