Adjective_Noun_#### are default generated by reddit, so they upgraded to their own generator at least it seems.
Adjective_Noun_#### are default generated by reddit, so they upgraded to their own generator at least it seems.
It doesn’t matter what it tells me. Personal data is clearly defined under GDPR as data that can be used to identify a person. It is irrelevant if you or I can do it with publicly available data, reddit has the data and that is enough to qualify it as such.
A DPA might absolutely disagree with my reading of the situation. I would be surprised, if a DPA considered usernames as non personal identifable information and know of no such ruling.
Ah, alright. Didn’t check old.reddit
You have to give one, while signing up (just checked); unless you go through apple or google ID services. Either way, they still log your IP and other meta data not to mention your username does exist.
I’d argue it is, but, that’s where the judgement of the DPAs comes in. It’s definitely possible that some, if not all of them, reject this as “it’s fine”. But unless eyes are being put on it, any shenanigans will simply occur.
I don’t know how it might go, but giving it a try is basically free.
Also, I appreciate your consideration of my perspective!
It is not enough, no. The LLM might reveal training data, showing the original text and that is a simple Google search with site:reddit.com away from identifing the user. It’s trivial and thus not anonymized.
It doesn’t matter, as long as the text is supplied as is, a simple Google search with the text and site:reddit.com will reveal the author, keeping it identifiable. True anonymization under GDPR almost does not exist, as it would destroy the dataset and make it unusable.
That is not quite correct. As long as it is possible to identify the user, it is personal data. True anonymization under GDPR is nearly impossible without destroying the data set.
Reddit would have to fully delete it, otherwise simply searching Google with the exact text with site:reddit.com on any comment immediately reveals who the author is.
It doesn’t matter if the dataset in use allows for identification, as long as identification remains possible.
Every post is tied to a username and email address, making it personal information, since each poster can be identified. I’m sure they’re also tracking further metrics such as IP addresses, browser fingerprints, etc. It is immaterial if we from the outside are able to identify users, it only matters if it’s possible given the data available to the processor. In this case, it is. Not to mention, there is a good chance texts and posts themselves contain plenty of personal information, such as linking to other user profiles, mentioning and discussing people, etc.
DPOs in Europe don’t always work with lawyers. I mainly deal with mid-sized companies and work with lawyers on the end of the larger corporations, absolutely. I was simply clarifying I am not a lawyer and don’t claim to be one.
Nope, your username and email are required and linked to your data, so it’s entirely personal information. True anonymization is impossible with open text fields, as it’s always possible that people reference other users within their posts, etc.
Of course, what the DPAs do with it, is another matter. Doesn’t hurt to try.
I’m not a lawyer, but a data protection officer with certification in Germany.
I posted an extensive write up over here: https://kbin.social/m/reddit@lemmy.world/t/854162/Any-EU-based-users-of-reddit-should-immediately-file-a
Scroll down to the last section for tl;dr instructions :)
If you are in the EU file a complaint with your supervisory authority as reddit is illegally processing and selling data of children to be trained with by Google, not to mention all other users who weren’t properly informed, nor consent retrieved. This one is going to be fun.
I managed to get this instead of a pay raise. One of the very best decisions I ever made. Sure more money is always nice but it wouldn’t be the difference between buying a house and not anyway. Might as well reclaim life time and enjoy what I got, while I got it.
Awesome, thanks for the pointers! I’ll look into it.
How are you digitizing BluRays? I’ve not found a way yet due to the DRM on those fuckers.
This is modern alchemy trying to turn lead into gold. Just change the meaning of the magic words et voilá you make gold while the other party is robbed blind and can’t do anything about it after the fact.
And of course, it’s totally legal and totally cool.
It’s fine. The PCI-e is another one for a graphics card that requires more connectors to be attached.