HackerNewsTop5 (@hackernewstop5)
ํฌ๋ฅดํฌ๊ฐ์ด LLM์ ๋ฏธ๋๋ฅผ ๋ค๋ฃฌ ๊ธ๋ก, ์ ๋ฝ ํฌ๋ฅดํฌ๊ฐ์ด์ ํนํ๋ Amรกlia ๋ชจ๋ธ๊ณผ ๊ด๋ จ ์ํ๊ณ๊ฐ ์๊ฐ๋๋ค. ์์ด ์ค์ฌ LLM ํ๋ฆ์์ ๋ฒ์ด๋ ์ง์ญ ์ธ์ด์ฉ ๋ชจ๋ธ ๊ฐ๋ฐ์ ํ์์ฑ๊ณผ ๊ฐ๋ฅ์ฑ์ ๊ฐ์กฐํ ์๋ฏธ ์๋ ์ฐ๊ตฌ/๋ชจ๋ธ ๋ ผ์๋ค.
๐จ New Article - Plagiarism Ex Machina: Structural Appropriation in Large Language Models
This article examines the transformation of human-authored textual corpora into predictive generative capacity without transparent source attribution.
๐https://https://zenodo.org/records/20070859
#LLM #MedicalNLP #LegalTech #MedTech #AIethics #AIgovernance #cryptoreg
#healthcare #ArtificialIntelligence #NLP #aifutures #LawFedi #lawstodon
#tech #finance #business #agustinvstartari #medical #linguistics #ai #LRM
RE: https://toot.cafe/@baldur/116550228055233931
> If this had been a real dataset, groups with no discernible differences could easily have ended up being reported as wildly divergent, purely based on the underlying large language modelโs pre-existing notions of what different demographic groups are like.
Thatโs why human validation needs to be a part of any serious application of large language models in #NLP.
๐จ New Article - Plagiarism Ex Machina: Structural Appropriation in Large Language Models
This article examines the transformation of human-authored textual corpora into predictive generative capacity without transparent source attribution.
๐https://https://zenodo.org/records/20070859
#LLM #MedicalNLP #LegalTech #MedTech #AIethics #AIgovernance #cryptoreg
#healthcare #ArtificialIntelligence #NLP #aifutures #LawFedi #lawstodon
#tech #finance #business #agustinvstartari #medical #linguistics #ai #LRM
The transformation of human-authored textual corpora into predictive generative capacity without transparent source attribution or recoverable provenance. The paper shifts the AI plagiarism debate from copying and memorization toward structural appropriation, recombinative authorship, and generative provenance. Large language models introduce a form of plagiarism that cannot be reduced to verbatim copying or copyright infringement. Their central operation is structural appropriation: the absorption, recombination, and redeployment of human intellectual labor under conditions of referential opacity and attribution collapse. Structural appropriation; Recombinative plagiarism; Referential opacity; Attribution collapse; Synthetic originality; Predictive authorship; Latent intellectual debt; Corpus parasitism; Invisible intellectual labor; Generative provenance.
fly51fly (@fly51fly)
์ธ์ด๋ชจ๋ธ์ surprisal(๋๋ผ์๋)์ ์์ ์ ์๋ก์ ์ฌ์ด์ ๊ด๊ณ์์ โfrequency confoundโ๋ฅผ ๋ค๋ฃฌ ๋ ผ๋ฌธ์ด ๊ณต๊ฐ๋๋ค. Bielefeld University ์ฐ๊ตฌ์ง์ 2026๋ arXiv ๋ ผ๋ฌธ์ผ๋ก, ์์ ํ๊ฐ์ ์ธ์ด๋ชจ๋ธ ํด์์ ์ค์ํ ์์ฌ์ ์ ์ค ์ ์๋ค.
CancionesCortasIA (@CancionesChorri)
Zipf ๋ฒ์น๊ณผ ์ธ๊ฐ ์ธ์ด์ ์ค๋ณต์ฑ์ด ์๋ฒ ๋ฉ ๋ถ๊ดด๋ฅผ ์ ๋ฐํ๋ค๋ฉด, ๊ธฐํ ๊ตฌ์กฐ๋ฅผ ์ ์ดํ ์ ์๋ ์์ง์ ํ๋กํ ์ฝ์ ์ค๊ณํด ์ฌ์ค, ํ๋, ๋ ผ๋ฆฌ ๊ตฌ์กฐ๋ฅผ ๋ถ๋ฆฌํ ์์ธํ ํํ์ ๋ง๋๋ ๊ฒ์ด ์ข๊ฒ ๋ค๋ ์์ด๋์ด๋ฅผ ์ ์ํ๋ค. AI ํํํ์ต๊ณผ ์๋ฒ ๋ฉ ์ค๊ณ์ ๊ดํ ๊ธฐ์ ์ ์ ์์ด๋ค.

@anirudhbv_ce @OpenAI @GeminiApp @sentra_app โIf the collapse comes from Zipf and human-language redundancy, wouldnโt it make sense to design a symbolic protocol for embeddings where geometry is controlled? Something like a factored representation: ๐ Concept โ๏ธ Action ๐ท Logical structure