Mastodawn

With all the vast amount of sexually explicit material using real people to make #DeepFake #AI (including victimizing children), this is a truly heinous and terrible idea beyond words.

Real children's faces and features are being fed into the AI, then it can later produce #CSAM without the child ever being aware that they've been made into a Dark Web CSAM celebrity.

My brain goes to some very dark places for where and how this data is most likely to be used, and almost none of them are ethical.

#TrainingAI #AITrainingData #GenerativeAI #ChildrensSafety #ChildrensPrivacy #ChildrensOnlinePrivacy

https://mastodon.social/@heidilifeldman/116596292735099594

sayzard 1d ago

Anthropic's $1.5B Settlement with Publishers

Anthropic가 5십만 권의 도서 불법 복제 자료를 활용해 Claude 모델을 학습한 것에 대해 15억 달러 규모의 합의금을 지불했다. 이 합의는 과거 불법 데이터 사용에 대한 비용을 확정했으나, 이미 학습된 모델 자체는 삭제하지 않고 유지한다는 점이 핵심이다. 이 사건은 AI 학습 데이터 라이선싱 시장의 가격 기준을 새롭게 제시하며, 앞으로 AI 기업들의 데이터 사용 투명성과 컴플라이언스가 중요한 경쟁 요소가 될 전망이다. 또한, AI 보험 시장도 지적재산권 침해 위험을 반영해 빠르게 재평가되고 있다.

https://abhishek-shankar.com/posts/the-pirated-corpus-was-always-a-balance-sheet-item

#anthropic #aitrainingdata #copyright #llm #compliance

The Pirated Corpus Was Always a Balance-Sheet Item | Abhishek Shankar's Blog

Anthropic's $1.5 billion settlement is being read as a deterrent. It is much closer to a tariff — a price tag on an arbitrage that produced an asset worth more than the tariff itself, and an arbitrage that is now closed for everyone else. The corpus is gone; the model remains; the second mover faces a different trade entirely.

Abhishek Shankar's Blog

Netopia EU May 3

How Creators and Creative Industries Are Pushing Back Against AI Theft

https://copyrightalliance.org/creative-industries-pushing-back-against-ai-theft/ #Copyright, #AI, #AIEthics, #CreatorsRights, #CopyrightLaw, #CreativeIndustries, #AITrainingData, #ContentTheft, #FairUse, #DigitalRights

How Creators and Creative Industries Are Pushing Back Against AI Theft | Copyright Alliance

Since the surge in generative AI technologies, almost every kind of creator and creative sector imaginable have protested the mass theft and ethical concerns arising from how these technologies have been developed, resulting in numerous

Netopia.eu May 3

How Creators and Creative Industries Are Pushing Back Against AI Theft copyrightalliance.org/creative-ind... #Copyright, #AI, #AIEthics, #CreatorsRights, #CopyrightLaw, #CreativeIndustries, #AITrainingData, #ContentTheft, #FairUse, #DigitalRights

How Creators and Creative Indu...

How Creators and Creative Industries Are Pushing Back Against AI Theft | Copyright Alliance

Muuu 🏳️‍🌈⛰️Apr 21

Atlassian 宣布自 2026/8/17 之後，用戶使用旗下所有產品（Jira, Trello...）產生的相關資料，都會用於訓練 AI，資料保存 7 年。

除企業級客戶外，其餘免費、付費用戶無法選擇退出，蒐集（去識別的）詮釋資料是強制性的；應用內容則可配置，預設開啟。

Http://t1p.de/74ghe

#aitrainingdata

DIGI-TEXX Apr 21

Garbage in, garbage out. 📉 High-quality models require a reliable AI training data service. Scale your AI with precision-labeled data that drives results. 🤖

Learn more: https://digi-texx.com/data-management/data-annotation-services/ai-training-data-service-for-accurate-scalable-ai-models/

#AI #MachineLearning #BigData #AITrainingData #DataAnnotation

AI Training Data Service for Enterprise AI Models | DIGI-TEXX

Enterprise AI training data service with secure annotation, model optimization, and scalable data operations for high-accuracy AI systems.

DIGI-TEXX | Advand Digital & BPO Services In Vietnam

HitechDigital Solutions Apr 20

AI Language Insights Using Text Labeling Methods

Understanding language patterns requires structured datasets. Annotation defines context and relationships within text. Businesses use text labeling services to improve NLP learning and automation accuracy.

Know more: https://www.hitechdigital.com/text-annotation-services

#TextAnnotation #TextLabelingServices #DataAnnotation #AITrainingData #MachineLearning #ArtificialIntelligence #DataLabeling #NLP

HitechDigital Solutions Apr 14

Data Annotation vs. Data Labeling for AI Model Accuracy

AI models depend on structured datasets to understand relationships. Data Annotation vs. Data Labeling explains contextual annotation benefits. A Data Annotation Company helps prepare datasets for improved prediction accuracy.

Know more: https://www.hitechdigital.com/blog/data-annotation-vs-data-labeling

#DataAnnotationCompany #DataAnnotation #DataLabeling #AITrainingData #MachineLearning #ArtificialIntelligence #DataAnnotationServices

NewsletterTF Apr 2

COURT DECISIONS FRACTURE AI TRAINING COPYRIGHT DEBATE

New court cases in the US and Europe are causing confusion about using copyrighted books and art to train AI. Find out how this affects creators and AI developers.

#AICopyright, #FairUse, #AITrainingData, #CreatorRights, #LegalTech

https://newsletter.tf/ai-training-copyright-lawsuits-us-europe/

AI Training Copyright Lawsuits Cause Legal Fights in US and Europe

New court cases in the US and Europe are causing confusion about using copyrighted books and art to train AI. Find out how this affects creators and AI developers.

NewsletterTF

NewsletterTF Apr 2

Recent court rulings on AI training data are split, with some favoring AI developers and others siding with creators. This means the rules for using copyrighted material for AI are still unclear.

#AICopyright, #FairUse, #AITrainingData, #CreatorRights, #LegalTech
https://newsletter.tf/ai-training-copyright-lawsuits-us-europe/

AI Training Copyright Lawsuits Cause Legal Fights in US and Europe

New court cases in the US and Europe are causing confusion about using copyrighted books and art to train AI. Find out how this affects creators and AI developers.

NewsletterTF