#AI is thwarting the study of human #language.
https://www.404media.co/project-analyzing-human-language-usage-shuts-down-because-generative-ai-has-polluted-the-data/
( #paywalled)
https://www.404media.co/project-analyzing-human-language-usage-shuts-down-because-generative-ai-has-polluted-the-data/
( #paywalled)
"The creator of an #OpenSource project that scraped the internet to determine the ever-changing popularity of different words in human language usage says that they are sunsetting the project because generative AI spam has poisoned the internet…'Now the web at large is full of #slop generated by #LLMs, written by no one to communicate nothing. Including this slop in the data skews the word frequencies.'”