Outside of a web scraper, how sure are we that this poisons reddits actual data being sold to ai companies? It seems trivial for them to have an original comment field in the database that’s invisible to users or just use backed up data. Or even an anonymized copy of all all original comments not linked to any account that is solely for AI training.
The only thing “deleting” your data does is give Reddit an extra data point. Now they can tell their AI model “this comment was written by the kind of person who will try to delete their comment history”, which is an amazing data point for an AI model.
Outside of a web scraper, how sure are we that this poisons reddits actual data being sold to ai companies? It seems trivial for them to have an original comment field in the database that’s invisible to users or just use backed up data. Or even an anonymized copy of all all original comments not linked to any account that is solely for AI training.
The only thing “deleting” your data does is give Reddit an extra data point. Now they can tell their AI model “this comment was written by the kind of person who will try to delete their comment history”, which is an amazing data point for an AI model.
Is it more or less likely to be poisoning the data being sold than doing nothing at all?