#Google Books Is Indexing #AI-Generated Books

👉 #GoogleBooks is indexing low quality, AI-generated books that will turn up in search results, and could possibly impact Google #Ngram viewer, an important tool used by researchers to track #language use throughout history. 

https://timesofindia.indiatimes.com/technology/tech-news/google-books-important-source-for-academics-may-have-a-bot-problem/articleshow/109089043.cms

#GoogleNgram #NgramViewer #linguistics #diachrony #diachroniclinguistics #research #languages #aigeneratedcontent #AIgeneratedBooks

Google Books, important source for academics, may have a ‘bot’ problem - Times of India

TECH NEWS News: Google Books faces issues with low-quality AI books affecting Ngram viewer. Recent additions not impacting Ngram results but may in future updates. Go

The Times of India

I had not touched the #GoogleNgram viewer in a while and a colleague is asking about it. Started poking around and found this nice overview of different methods for getting data beyond the viewer.

"Working with Google Ngrams: A Data-Wrangling Tale," Blair Fix, @blair_fix

https://economicsfromthetopdown.com/2020/10/19/working-with-google-ngrams-a-data-wrangling-tale/

Also shares an #RStats package - ngramr - for working with the tool. Nice.

Working With Google Ngrams: A Data-Wrangling Tale – Economics from the Top Down

Here's a tale about empirical work that demonstrates a simple rule: the deeper you go, the harder it gets.

Economics from the Top Down