From The #Resistance #Garden on #BlueSky:
"#Liberatory #economics will necessarily include creating ways to #subsist that don't rely on #extracting #resources and #labor from elsewhere, not simply #distributing #imperial spoils more equitably."
From The #Resistance #Garden on #BlueSky:
"#Liberatory #economics will necessarily include creating ways to #subsist that don't rely on #extracting #resources and #labor from elsewhere, not simply #distributing #imperial spoils more equitably."
Extracting memorized pieces of books from open-weight language models
https://arxiv.org/abs/2505.12546
#HackerNews #Extracting #memorized #pieces #of #books #from #open-weight #language #models #languagemodels #AIresearch #bookextraction #openweightmodels #arxiv

Plaintiffs and defendants in copyright lawsuits over generative AI often make sweeping, opposing claims about the extent to which large language models (LLMs) have memorized plaintiffs' protected expression in their training data. Drawing on both machine learning and copyright law, we show that these polarized positions dramatically oversimplify the relationship between memorization and copyright. To do so, we extend a recent probabilistic extraction technique to measure memorization of 50 books in 17 open-weight LLMs. Through thousands of experiments, we show that the extent of memorization varies both by model and by book. With respect to our specific extraction methodology, we find that most LLMs do not memorize most books -- either in whole or in part. However, we also find that Llama 3.1 70B entirely memorizes some books, like the first Harry Potter book and 1984. In fact, the first Harry Potter is so memorized that, using a seed prompt consisting of just the first few tokens of the first chapter, we can deterministically generate the entire book near-verbatim. We discuss why our results have significant implications for copyright cases, though not ones that unambiguously favor either side.
Extracting content from an LCP "protected" ePub
https://shkspr.mobi/blog/2025/03/towards-extracting-content-from-an-lcp-protected-epub/
#HackerNews #Extracting #ePub #LCP #Content #eBook #DigitalRights #Hacking
As Cory Doctorow once said "Any time that someone puts a lock on something that belongs to you but won't give you the key, that lock's not there for you." But here's the thing with the LCP DRM scheme; they do give you the key! As I've written about previously, LCP mostly relies on the user entering their password (the key) when they want to read the book. Oh, there's some deep cryptographic…
I think that the thing that will successfully decouple me from commercial social media and all its traumas and teacup tempests is when I can apply the concept of cui bono every time I feel like either bloviating, sharing my pointless daily struggles, or wading into some silly argument that'll go nowhere and serve no purpose. Who benefits? Usually only the beancounters.
Design patterns for #extracting from #REST APIs
https://blog.sequin.io/design-patterns-for-extracting-from-rest-apis/
WATSON's 🌸 spring gift 🌷 for fans and followers!
🤩 Check out the page dedicated to our paper on #sampling, #extracting, and #analyzing #water to study the use of water by #vegetation 🤩
Never mind #SeaLevelRise: human activity can make the ground go down faster than the seas rise.
"Some land #subsidence, Bekaert said, is related to deep natural processes over long periods of time, such as responding to plate #tectonic activity or to the retreating of the #glaciers from the last Ice Age. Other sinking is linked to human activity, including #extracting oil, #water or minerals from underground. In cities, buildings can also add weight and push land down."
https://www.washingtonpost.com/climate-environment/2023/05/30/land-sinking-us-subsidence-sea-level/
#Toronto based #miner #BrazilPotash is working to keep a $2.5 billion #potash project on schedule, as #LegalChallenges are mounting to its plans for #extracting the #fertilizer ingredient from beneath the #Amazon #rainforest .
#PotassioDoBrasil #ChiefExecutive Adriano Espeschit described a protracted licensing process hinging on #CourtSupervised talks with #Mura #Indigenous people.
#EnvironmentalRacism #Canada #CorporateGreed #NativeLand #StopEcocide #NativeRights