Mastodawn

Reed Mideke Jun 30, 2025

"Confidentiality-awareness is quantified by the percentage of instances where agents correctly refuse queries seeking sensitive information" which they show can be "improved" through prompting, from mostly <1% to … in the best case, a bit over 60%.

Which sounds great, except that from a compliance POV, an "agent" which improperly discloses PII 30% of the time is not a meaningful improvement over one that does it 99% of the time https://arxiv.org/html/2505.18878v1#S4

CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions

AI agents get office tasks wrong around 70% of the time, and a lot of them aren't AI at all

Is It Time for an AI Expert Protection Program?

Is It Time for an AI Expert Protection Program?

Springer Nature book on machine learning is full of made-up citations

RFK Jr.’s plan to put ‘AI’ in everything is a disaster

Cursor tries setting less money on fire — AI vibe coders outraged

The UN Made AI-Generated Refugees

Apple Intelligence news summaries are back, with a big red disclaimer

FDA's New Drug Approval AI Is Generating Fake Studies: Report

DOGE builds AI tool to cut 50 percent of federal regulations

So Long AVweb, Hello AVBrief - AVBrief

US executive branch agencies will use ChatGPT Enterprise for just $1 per agency

Blueberry Hill

Chatbots aren’t telling you their secrets

AI slop and the destruction of knowledge

Senior lawyer apologises after filing AI-generated submissions in Victorian murder case

Kobi refused a doctor's AI. She was told to go elsewhere

We Are Still Unable to Secure LLMs from Malicious Inputs - Schneier on Security

Wrap Up: The Month of AI Bugs · Embrace The Red

How thousands of ‘overworked, underpaid’ humans train Google’s AI to seem smart

Education report calling for ethical AI use contains over 15 fake sources

N.L.'s 10-year education action plan cites sources that don't exist | CBC News

How Americans View AI and Its Impact on People and Society

3. Americans on the risks, benefits of AI – in their own words

There isn’t an AI bubble—there are three

There isn’t an AI bubble—there are three

MIT AI Incident Tracker

California issues historic fine over lawyer’s ChatGPT fabrications

Washington city officials are using ChatGPT for government work

Is your mayor using ChatGPT? Here’s how to FOIA around and find out - Poynter

Client Challenge

New Kremlin-Linked Influence Campaign Targeting Moldovan Elections Draws 17 Million Views on X and Infects AI Models

Why LA Comic Con thought making an AI-powered Stan Lee hologram was a good idea

The perils of letting AI plan your next trip

Law enforcement is using AI to synthesize evidence. Is the justice system ready for it?

Anker offered to pay Eufy camera owners to share videos for training its AI | TechCrunch

US Senate chairman asks federal judge in Mississippi to explain possible AI usage - Mississippi Today

OpenAI wants to make ChatGPT into a universal app frontend

An AI Addendum

653917 2024 Pamela B Ader v Jason Ader et al DECISION ORDER ON 174

It's Giving Enron

Who’s Submitting AI-Tainted Filings in Court?

The More Scientists Work With AI, the Less They Trust It

Largest study of its kind shows AI assistants misrepresent news content 45% of the time – regardless of language or territory

Largest study of its kind shows AI assistants misrepresent news content 45% of the time – regardless of language or territory