#AIEngineering #airisk #aisafety

Sorry, AV is just not ready for real world applications. “Oops, we didn’t think of that” is a naive, unscientific, and negligent way to design systems. These are not software bugs; they are serious design flaws and limitations of the technology. AV is still an open research area; public roads should not be used for research and experimentation.

https://www.pcmag.com/news/waymo-recalls-3800-robotaxis-to-keep-them-out-of-freeway-construction-zones

AI robots can go rogue – a researcher on how easily it happens | The-14

AI robots can be manipulated into dangerous actions. Researchers warn safety systems, laws and accountability are lagging behind innovation.

The-14 Pictures
New Chinese AI Governance white paper, from two days ago english.scio.gov.cn/whitepapers/... #AISE26 Chinese ambassador just talked about it, seems to carry on from the French/Indian #AIAction philosophy, not the transhumanist/regulatory evasive #AISafety one.

Full text: More Just and Equit...
Full text: More Just and Equitable Global Governance: China's Principles, Proposals and Actions

MicrosoftがOpenAIのモデルを中国企業に販売しているというニュース、かなり複雑な気持ちになりました。OpenAIやAnthropicはIP保護や悪用リスクを理由に自ら中国市場に参入しない方針を掲げている一方で、Microsoftが代理店のように提供している。

企業価値として「AIの安全性」を掲げるのであれば、この二枚舌的な構造は整合性を問われるべきだと思います。ビジネス拡大と倫理の板挟みは理解しつつも、投資家やユーザーに対して透明性のある説明が必要ではないでしょうか。

個人的には、この動きが中国のAIエコシステムにどのような影響を与えるのか、そして米中AI競争をどう変質させるのか、引き続き注視したいと思います。

#AI #Microsoft #China #OpenAI #AISafety

What happens when an AI company argues the government should be able to block unsafe models, then has its own models blocked?

Anthropic's new policy agenda calls for binding pre-deployment testing and a government power to block frontier releases, through a process it says is fair and fact-based. Soon after, the US used export controls to suspend its two newest models over a jailbreak Anthropic says other models already match.

https://benjaminhan.net/posts/20260618-policy-on-the-ai-exponential/?utm_source=mastodon&utm_medium=social

#Anthropic #AISafety #Policy #AI

Policy on the AI Exponential – synesis

Dario Amodei argues that AI capability is outrunning the policy apparatus, and lays out recommendations across regulation, macroeconomics, scientific innovation, civil liberties, and geopolitics.

synesis
Anthropic just pulled its most powerful models under White House pressure 🏛️🔗🛑 The US government directly intervened on frontier AI safety — a first. Biosecurity concerns cited at the G7 summit. The era of unchecked AI deployment is ending. #AI #Anthropic #AISafety

First time the CEOs of OpenAI, Anthropic and Google sat at the same table as G7 heads of state.

Évian, June 17. Sam Altman, Demis Hassabis and Dario Amodei joined Trump, Macron and the rest for a closed-door AI governance session. About a dozen more tech CEOs tagged along.

Macron told Washington not to keep cutting-edge AI to itself. Altman called for a global forum to set the rules.

https://youtu.be/YqUmblj8Wxs

#G7 #AISafety #AIGovernance

Trump, G7 leaders meet OpenAI, Google, Anthropic CEOs on AI safety

YouTube

Anthropic pulled its top AI models offline Friday. The US government told them to.

Commerce gave Anthropic 90 minutes to pull Fable 5 and Mythos 5. Export-control letter, national security.

Amazon found Fable 5 would patch exploits when asked to "fix this code" — the very capability it was meant to suppress.

Anthropic complied. First major US AI lab forced to disable a flagship model on government order.

https://youtu.be/JojP3Hy0hQs

#Anthropic #AISafety #NationalSecurity

Anthropic pulls Fable and Mythos AI models after US government directive

YouTube
#ChatGPT: The latest public version of ChatGPT generates scenes of gruesome graphic violence with a simple prompt, British AI security startup @mindgard researchers have told the BBC, concerned that AI created gore "of its own volition":
#AISafety
👇
https://www.bbc.co.uk/news/articles/c802ldjdklzo
OpenAI works to stop ChatGPT generating 'sex crime scene' images

But researchers say its still possible to trick the AI chatbot into producing graphic content.

BBC News
NVIDIA has released SkillSpector, an open-source tool that scans AI skills for security risks before deployment. The tool uses static analysis, custom detectors and SARIF reporting to identify vulnerabilities in AI agents. https://www.marktechpost.com/2026/06/17/nvidia-skillspector-guide-scanning-ai-skills-for-security-risks-with-static-analysis-and-sarif-reports/ #AIagent #AI #GenAI #AISafety
NVIDIA SkillSpector Guide: Scanning AI Skills for Security Risks with Static Analysis and SARIF Reports

SkillSpector scans AI skills for risks using static analysis, custom detectors, risk visualization, and SARIF reporting before deployment.

MarkTechPost