Andreas Wagner

@anwagnerdreas@hcommons.social
1,078 Followers
1.4K Following
5.3K Posts

I am #DigitalHumanities Coordinator at Max Planck Institute for #LegalHistory and #LegalTheory (#mpilhlt) #Frankfurt

Also collaborator of salamanca.school project of #adwl #Mainz

While I mostly toot about work, I do have hobbies...
#Capoeira #Stratocaster

#LawFedi #Histodons
#NLP #TEIXML #Golang #Python #Elm #XQuery

If you're reading this on bsky, follow @ap.brid.gy so i can see your replies.

orcidhttps://orcid.org/0000-0003-1835-1653
hcommonshttps://hcommons.org/members/anwagnerdreas/
pronounshe/him
How do content moderation systems treat Indigenous languages online? CDT’s latest research report from Dhanaraj Thakur dives into how platforms moderate Quechua, a widely spoken but low-resource Indigenous language of South America—and reveals serious linguistic inequities. https://cdt.org/insights/moderating-quechua-content-on-social-media/

More on the crisis in open-source maintenance as exemplified by libxml2: https://lwn.net/SubscriberLink/1025971/73f269ad3695186d/

Some money *has* to start flowing into this community or the foundations we all rely on will start rotting away. Given the many-billions-per-quarter in Big Tech profits and the trillion-dollar valuations, it is insulting and absurd to claim a lack of budget for these companies to do the equivalent of keeping their car tires inflated.

Libxml2's 'no security embargoes' policy

Libxml2, an XML parser and toolkit, is an almost perfect example of the successes and failures [...]

LWN.net
In the last five years, we've gone from "employees will never have to go into an office" to "employees need to be in the office because creative and innovative work can only be done face-to-face between humans" to "lol we don't need humans"
@zackbatist Anyway, I spent a bunch of time figuring out how to turn off copilot auto suggestions in a bunch of different contexts, because it is just too distracting, constantly takes my mind away from the task at hand.
They make it way too hard to have copilot on ONLY when you ask for it. VS Code just keeps adding more places for it to "helpfully" popup and distract you
@zackbatist I have gotten copilot suggesting new rows in a CSV of digital collection metadata. The suggestions are very interesting, but clearly missing the context of describing an object that you have digitized! I guess next step it will auto generate objects for the collection at the same time...

So I'm using a CLI-based qualitative data analysis system (it's great! https://qualitative-coding.readthedocs.io) that I integrate into my VSCode workspace. I happened to start experimenting with copilot last week and today when I started qualitative coding I noticed it was offering me suggestions. Seems to be based on my prior codings rather than the actual file from the corpus, but still a bit eerie and not sure if I want to keep this on or not.

#CAQDAS #QualitativeDataAnalysis

@kfitz Congrats, well deserved!!
Ein Symbol für die freie Entfaltung der Persönlichkeit (Artikel 2 GG), für die Menschenwürde (Art. 1) und für gleiche Rechte gegen Diskriminierung (Art. 3) kann ja auch schwerlich die Neutralität des Staates des Grundgesetzes verletzen.

RE: https://bsky.app/profile/did:plc:x7vfp7rt4gbywara7ljm5e2s/post/3lsjblyjnqk2n

I'm between jobs and open for short-term missions until the end of the year :) . I like jumping in old #opensource code bases, tackling tech debt, working on #governance, and trying out new things! Get in touch :)

Lately I've mostly been working in #Java and #Rust. Based in #Leipzig, open for in-person or remote freelancing gigs.

In that sense, LLMs will abide by one view or another depending on how you prompt them.

But it's another thing entirely to pretend that inconsistent world views are self-consistent just because authoritative figures pretend they are. If some political discourse is incoherent, don't blame a model for missing the coherence it lacks.

The King is naked, and it's not the tool's fault if it can't measure the thickness of his clothes.

9/9

×
seems to imply that the liability in that case would rest with Hugging Face or whatever service runs the model / uses it to infringe. Having created a model that *could* infringe is not actionable here. Finally, not a practical point, but this analogy will piss ppl off and I'm glad.
Oh, and if you want to consult the original, see below. Glad this did not settle out of court. That would have really screwed over academic researchers and libraries without big legal staffs. bsky.app/profile/eriq...

RE: https://bsky.app/profile/did:plc:i6w2uuipovifyhfu62viemv7/post/3lse3ncj4qk2q
My overall reading: if this is the law, we can feel confident that 1) libraries that physically possess copyrighted books 2) can let researchers train models on them 3) without a lot of fuss about exactly what % of any book a model would be able to reproduce 4) and even share those models! +
All of that, by the way, before we even address the additional factor that the intended research use is not-for-profit. It's a slam-dunk — and unfortunately these things have to be a slam-dunk when you're an underfunded institution worried about lawsuits.
I feel a little bit of uncertainty about point (4). Alsup's ruling does not directly address the legality of, say, open-sourcing Claude. So maybe, to be maximally cautious, don't share models that could produce an infringing copy. (But academic budgets aren't v likely to produce models that big.)