Tetsuro Miyatake (@tmiyatake1)

Anthropic가 AI 학습용으로 전 세계의 책을 수집해 전 페이지를 스캔하고 일부를 파기하는 내부 프로젝트 'Project Panama' 관련 문서와 책 창고 사진이 공개되었다는 워싱턴포스트 보도. 학습 데이터 수집 방식과 저작권·윤리 논란을 불러일으킬 소지가 있음.

https://x.com/tmiyatake1/status/2018475269158215855

#anthropic #projectpanama #dataset #bookscanning #aitraining

Tetsuro Miyatake (@tmiyatake1) on X

AnthropicのAIトレーニングのために世界中の本を集めて全ページをスキャンする取り組み「Project Panama」の社内資料で公開された本の倉庫の様子がすごい。 https://t.co/pXUwxRosxE

X (formerly Twitter)

Hackaday: A Scanner For Arduino-Powered Book Archiving. “Scanners for loose papers have become so commonplace that almost every printer includes one, but book scanners have remained frustratingly rare for non-librarians and archivists. [Brad Mattson] had some books to scan, but couldn’t find an affordable scanner that met his needs, so he took the obvious hacker solution and built his own.”

https://rbfirehose.com/2025/07/07/hackaday-a-scanner-for-arduino-powered-book-archiving/

"While destructive scanning is a common practice among some book digitizing operations, Anthropic's approach was somewhat unusual due to its documented massive scale. By contrast, the Google Books project largely used a patented non-destructive camera process to scan millions of books borrowed from libraries and later returned. For Anthropic, the faster speed and lower cost of the destructive process appears to have trumped any need for preserving the physical books themselves, hinting at the need for a cheap and easy solution in a highly competitive industry.

Ultimately, Judge William Alsup ruled that this destructive scanning operation qualified as fair use—but only because Anthropic had legally purchased the books first, destroyed each print copy after scanning, and kept the digital files internally rather than distributing them. The judge compared the process to "conserv[ing] space" through format conversion and found it transformative. Had Anthropic stuck to this approach from the beginning, it might have achieved the first legally sanctioned case of AI fair use. Instead, the company's earlier piracy undermined its position."

https://arstechnica.com/ai/2025/06/anthropic-destroyed-millions-of-print-books-to-build-its-ai-models/

#AI #GenerativeAI #Anthropic #Claude #FairUse #BookScanning #Copyright #IP

Anthropic destroyed millions of print books to build its AI models

Company hired Google's book-scanning chief to cut up and digitize "all the books in the world."

Ars Technica
Anthropic destroyed millions of print books to build its AI models

Company hired Google's book-scanning chief to cut up and digitize "all the books in the world."

Ars Technica

Does anyone on the Fediverse do book scanning? If so, what is your setup?

I'm not looking for a howto as much as I am curious what book scanners use? Their phone? A flatbed scanner? An inexpensive book scanner like the CZUR? A professional book scanning setup?

#BookScanning #Books

Feels like a missed opportunity that the Document Scan feature of #Apple Notes doesn’t use the #iPhone depth camera to un-curl a page when #BookScanning
@Rachel_Thorn ah, it looks like the book scanning forum has been down for months, but it's been around for long enough - since 2009 - that I have some faith it will make it back.
Instead of flipping the books for each page, I'd get all the right-hand pages, then all the left-hand pages.
That Japanese man ... I think he did something creative with a folding footstool to hold a phone, but I can't find it right now.
I think #bookscanning is a better hashtag.

Is it bad that I get a book, that isn't available as digital, and immediately cut it up and scan it? That is what I do though.....and I kinda like it.

#bookscanning

IntelligenceGo on Instagram: "Book scanning 📚 Follow @intelligencego . . #book #library #librarian #livros #books #scanner #scanning #biblioteca #livraria #engineers #engineering #technology #intelligencego #intelligence #engenheiro #engenharia #machine #archtecture"

IntelligenceGo shared a post on Instagram: "Book scanning 📚 Follow @intelligencego . . #book #library #librarian #livros #books #scanner #scanning #biblioteca #livraria #engineers #engineering #technology #intelligencego #intelligence #engenheiro #engenharia #machine #archtecture". Follow their account to see 523 posts.

Instagram