Kati Lassila-Perini

@KatiLassila
22 Followers
129 Following
21 Posts
Particle physicist - data preservation, reusability, open data, education
CMS Experiment at CERN, from Helsinki Institute of Physics

- Runner,XC-skier, hiker: most comfortable uphill.

While I see value in AI tools for correcting spelling, assisting with code, or using machine learning on well-defined data to find correlations my three-dimensional reasoning cannot detect, I fail to see how broader AI solutions or autonomous agents can be compatible with the concept of #researchintegrity.

Reliability. Honesty. Respect. Accountability.

Am I just old-fashioned?

(end)

(…cont. from https://allea.org/code-of-conduct)

“Respect
for colleagues, research participants, research subjects, society, ecosystems, cultural heritage, and the environment.”

“Accountability
for the research from idea to publication, for its management and organisation, for training, supervision, and mentoring, and for its wider societal impacts.”

(3/4)

The European Code of Conduct for Research Integrity - ALLEA

From https://allea.org/code-of-conduct

“Reliability
in ensuring the quality of research, reflected in the design, methodology, analysis, and use of resources.”

“Honesty
in developing, undertaking, reviewing, reporting, and communicating research in a transparent, fair, full, and unbiased way.”

(2/4)

The European Code of Conduct for Research Integrity - ALLEA

I'd like to hear the scientific community talk more about research integrity, in particular, when promoting AI.

Take, for example, the European Code of Conduct for Research Integrity (https://allea.org/code-of-conduct/):

Reliability. Honesty. Respect. Accountability.

(1/4)

The European Code of Conduct for Research Integrity - ALLEA

📣 Exciting news from CDS (CERN Document Server): https://cds-blog.web.cern.ch/2026/04/new-on-cds-archive-your-code-community-collections-simpler-search/

Code archival with DOI🎉

This is going to be a wonderful tool for #OpenScience and #AnalysisPreservation

Now up to the CERN communities to make the best use of it!

New on CDS: archive your code, community collections, simpler search – CDS | What's up on CDS

You know, it isn't even that tools like this are useless. There are absolutely things they could be good at. I've personally seen Claude find stupid little bugs you'd spend an hour figuring out and hating yourself for afterwards with great efficiency. I tried the first iteration of Copilot, back when it was just an aggressive autocomplete, and while I had to stop using it because it was overconfidently trying to finish my programs for me without being asked, it was great for filling in boilerplate and maybe even a couple lines of real code for the basic stuff. We have models nowadays that are actually trained to find bugs and security issues in code rather than having the entire internets thrown at them to produce something Altman & Amodei can sell to the gullible as AGI.

But there's the problem. The technology has been around for a while, we have a good idea of what it's good for and, more importantly, what it's not. "Our revolutionary expert system for finding bugs in your code" isn't nearly as marketable to the general public, and the CEO class especially, as "our revolutionary PhD level sentient AI that will solve all the world's problems if you only give us another couple trillion dollars, and also wants to be your girlfriend." And so we get Claude and ChatGPT and RAM shortages and AI psychosis and accelerated climate change instead of smaller, focused models that are actually good at their specialist subjects. Because those don't produce as much shareholder value.

the enshittification of arxiv has started. this is very worrisome in my opinion. And very very bad for open science and academic integrity in general. I guess that's our own academic fault for not funding arxiv sufficiently, now we get late capitalist distopia instead https://jobs.chronicle.com/job/37961678/chief-executive-officer
Chief Executive Officer - New York City, New York (US) job with arXiv | 37961678

arXiv seeks its first CEO to champion open, free scientific discovery and guide the platform’s next chapter as an independent nonprofit.

The Chronicle of Higher Education Jobs

RE: https://mstdn.social/@jschauma/115662916873125794

Read it, read all 6297 words. While you can still take that many.

I wrote about AI again. On some level I don't now why I do this to myself

https://jenniferplusplus.com/what-is-a-token/

Actually, I wrote most of it months ago, for work. But, it was well received, so I put it on my blog. This is the more generic version.

The short version is that AI is not magic. It's a real phenomenon with real behavior and tradeoffs. I'm deeply tired of *****ALL***** the tradeoffs being handwaved away. And so much imagination fills in for the actual behavior. So I tried to describe how it's built, because that informs how it works, which informs what it actually does. And to be clear, it does things. It's not useless. But that's not the same as being useful, or worthwhile.

Anyway, I already put ~4k words on this in the article, so I'll shut up and let it speak for itself.

What is a token

AI is meant to seem like magic. But there's no such thing as magic. It's all illusion. So, allow me to spoil that illusion for you.

Jennifer++

Before: "Don't spend too long documenting things so people can know what's going on, write code."
After: "Don't spend too long writing code, document things so the agents know what's going on."

Before: "Don't spend too long trying to set up the culture and processes of the team so the develop/test/ship loop is smooth, write code."
After: "Don't spend too long writing code, make sure the agents are in a smooth develop/test/ship loop."

We weren't allowed to do it for humans, we _must_ do it for the robots.