Well then I’m not pirating anything, I’m just downloading data and if it happens to correlate to the new Aliens movie then that’s not my problem. 😮
I didn’t pirate anything. I just lopped off a few frames from the original file and check it out, it produces a new hash.
Different hash, different files, so it’s not actually breaking copyright!
It is fucking wild that this is basically what AI companies are arguing. “We did so much piracy it no longer counts as piracy.”
That’s not what they’re arguing, not even close.
The coolest and most frightening thing about all that is the number of books they train the models on are immense, but the model data is very tiny comparatively. And while the compression is amazingly lossy it still has an amazing amount of the data in there.
To nvidas credit, The training models do not contain the contents of the books, but they can still tell you intimate details about the books without it being able to provide a photographic reproduction of everything in the book.
We’ve literally created something that can analyze books in the same way that we read them and retain the same lossy levels of information. That’s honestly pretty f****** amazing.
Obviously intellectual property laws aren’t designed for this. Hell even our concept of intellectual property isn’t designed for this. If this was a corporation that hired a thousand people to read a bunch of books and be on tap for queries about the information in those books nobody would complain. One copy of each book purchased would be enough to cover the intellectual property restrictions for this.
Also obviously this isn’t what happened and people see money lying on the table.
They literally trained their models on the books lol
You’d think they of all people would understand the concept of data leakage.
just sounds like copyright laundering to me.
That books3 data keeps biting AI asses and it makes me so happy.
Like what the fuck did you expect when the data was explicitly always known to have come from private tracker Bibliotik?
Just because you literally downloaded everything from a pirate site, you went so big, that you’re argument is that it’s not piracy anymore? Get the fuck out of here.
Like maybe if they had bought all the books and collated them into a text file themselves, instead of some dweeb online doing it with one of the largest private trackers for ebooks that literally releases tools to remove DRM from ebooks, they’d have a case that it isn’t piracy.
But they chose to get it from literally a private pirate site whose goal is explicitly to share ebooks and remove DRM from ebooks.
Get fucked, corpos.