- cross-posted to:
- technology@lemmy.world
- technology@lemmy.world
- cross-posted to:
- technology@lemmy.world
- technology@lemmy.world
I mean, here is a thought, if an AI tool uses creative commons data, then it’s derivatives fall under creative commons. I.e. stop charging for AI tools and people will stop complaining.
So what is the stack overflow replacement?
that would be great if they federated and implemented activitypub/atproto!
let’s all go back to experts exchange
Expert sex change?
This feels a little iffy to me. it rings of what happened with reddit.
Based users
Like when I heard reddit was doing the API lockdown, I wrote an automation bot over the weekend that self-destructed my subreddit and the entire post history.
The bot also automatically downloaded and archived all of the content on my local machine, and because at the time reddit had changed their API to only show the first X posts (100 or 1,000 or whatever) as my bot deleted the most recent posts, reddit had no choice but to show me the old content.
And that’s how I archived my subreddit. Reddit banned me two days later for automation, lol. I did not break any of the reddit or reddit api ToS during this process but I guess I upset someone.
I don’t think I’ve been banned, but I did a similar thing. I requested all my data from Reddit, then used that list of comment/post IDs to mass-edit them. I think I’m in the clear because I used the official third party API, with an official “app.” If you used the private API or instrumented this via the browser, that may be why you were banned.
Anyway, if you or someone else wants their full history, Reddit will give it to you via a data export request.
Unfortunately they still have everything. It’s good for the “human” visibility (lack of) but they have the data still
Oh I know, I just wanted a copy too.
Deleting posts from the user PoV was the only way I could come up with to force the API to show them to me.
We can’t even communicate with out being leeched upon. Fuck this is grim
Stack Overflow just earned a place under Reddit in the hosts block list.
Why delete the answer, why not edit it so that a human can see the answer but for AI its a load of nonsense?
People did that. Stack overflow reverted the change.
If that would happen, I assume companies would just grab an older copy of the dumps from before people started editing their stuff because of the AI bullshit.
SA would ban everyone sabotaging their business plans and things would move on like normal, like what happened to Reddit.
So we need to up vote wrong answers only?
Editing any content to reduce its quality is considered vandalism and gets reverted on SO.
There’s no way that would work either, they can just store the full edit history and auto-curate as needed.
There is, I believe, a fundamental misunderstanding as to what exactly a site like Stack Overflow is. It’s not a forum; there’s no such thing as “your posts.” It’s more like Wikipedia, as in a collaborative question-and-answer site, or a knowledgebase. Each question and answer can be edited like a mini wiki page. They aren’t “yours” any more than the Wikipedia page you created ten years ago is; you contributed it to the commons, so (at least in theory) you don’t have the right to take it back.
Whether whatever "Open"AI is doing is right is another question, of course. But, I don’t think destroying or poisoning the commons to strike back at it is any helpful either; it feels like “destroying it to save it.”
Fine, but when coding projects undergo licensing changes that the contributors are against, the code author has to remove those contributions and replace them.
I feel like this content craze is going to evaporate soon because all the new content from here forward is sure to be polluted by LLM output already. AI is fast becoming a snake eating its own tail.
That reminds me. I should go update my licenses to spit in the face of AI training companies.
Good luck with the deleting. It often just means
UPDATE comments SET is_deleted = 1 WHERE ID = 666;
.They are not deleting, they are editing. So the platform would have to undo those edits rather than just flipping the visibility flag.
And they are. 😞
There was similar things done on Reddit during the big exit. I doubt it achieved what people expected it to achieve. Even if they’re not visible externally, I’m sure they can easily access (thereby make deals to license) the data out of their backend / backup; just a matter of how hard they want to try (hint: it’s really not very hard).
Yeah during the reddit exodus, people were recommending to overwrite your comment with garbage before deleting it. This (probably) forces them to restore your comment from backup. But realistically they were always going to harvest the comments stored in backup anyway, so I don’t think it caused them any more work.
If anything, this probably just makes reddit’s/SO’s partnership more valuable because your comments are now exclusive to reddit’s/SO’s backend, and other companies can’t scrape it.
It was to make the data inaccessible to general people, therefore removing the reason people visit reddit. Even if reddit could still get the data, regular people would be inconvenienced (in theory) and look somewhere else.
Does GDPR apply to stackoverflow? Since my data there probably does not identify me as a person?
You van delete your data but I don’t think it magically makes derivative works disappear. Its licenses SA. This is good.
Would be a shame if someone used ChatGPT to generate bad answers and a short script to resubmit them back to Stackoverflow. So awful.
SO has mechanisms in place to filter out AI-generated content.
I don’t believe that.
This says nothing about filtering mechanisms
Ah, I think I got the source of misunderstanding: these mechanisms are not automated, but implemented as moderation guidelines and rules.
Data Rule Numero Uno:
Garbage in, garbage out.
Have fun training your LLM on a big steaming pile of hot garbage. That’s 80% of Stack Overflows content.
One time I was went on there to figure out an issue in Arduino. The answer one guy gave was “I don’t know how to do this in Arduino, here’s how you do this in Java”. Not only the the mods prevent any other answers from being posted, I tried the guy’s suggestion in Java and it didn’t even work
Mostly “this has been answered in another thread” and “why don’t you Google it” comments in my experience.
Can’t wait until the top answer to every Google search is “just google it”
The other 20% is mostly high quality however, and I’m sure they’d filter out the heavily downvoted crud.
You say that as if the garbage gets downvoted
It’s just a matter of time until all your messages on Discord, Twitter etc. are scraped, fed into a model and sold back to you
As if it didn’t happen already
Like AI doesn’t know how to use the way back machine?