| ▲ | mhitza 5 hours ago | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
i haven't read the full study, but its been on my mind for a while. https://en.wikipedia.org/wiki/Stylometry The best course of action to combat this correlation/profiling, seems to be usage of a local llm that rewrites the text while keeping meaning untouched. Ideally built into a browser like Firefox/Brave. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | DalasNoin 5 hours ago | parent | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
We don't use (much) stylometry, so this won't help. This is totally something you could try, but we use interests and clues. Semantic information you reveal about yourself. The blog post might be more approachable if you want to get a quick take: https://simonlermen.substack.com/p/large-scale-online-deanon... | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | DalasNoin 4 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
There is also a practical issue here that people usually don't write a lot on linkedin, most people just have structured biographical information. We use very limited stylometry in section 6 for matching reddit users who we synthetically split according to time. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | patcon 4 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
L33tsp34k also accomplishes this. The original anonymising hacker stylometry :) I am intrigued by the idea that in the future, communities might create a merged brand voice that their members choose to speak in via LLMs, to protect individual anonymity. Maybe only your close friends hear your real voice? Speaking of which, here's a speculative fiction contest: https://www.protopianprize.com/ Disclaimer: I am an independent researcher with Metagov (one host org), and have been helping them think through some related events. EDIT: I've belatedly realized that stylometry isn't involved, but I think some of the above "what if" thought could still hold :) | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | 5o1ecist 4 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
> seems to be usage of a local llm that rewrites the text while keeping meaning untouched. There are no two ways of expressing something in ways that might create equal impressions. Relevant: https://www.perplexity.ai/search/hey-hey-someone-on-hn-wrote... | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | IncreasePosts 4 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
I don't think this is working any more, but there was a stylometic analysis of HN users a few years ago, and it was extremely effective (at least, for myself and people who felt the need to post in the comments): https://news.ycombinator.com/item?id=33755016 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | palmotea 4 hours ago | parent | prev [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
> The best course of action to combat this correlation/profiling, seems to be usage of a local llm that rewrites the text while keeping meaning untouched. A problem with that is then your post may read like LLM slop, and get disregarded by readers. Another reason why LLMs are destruction machines. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||