Mastodawn

Alex Carter Jun 17, 2025

Can AI be hacked into going rogue?
Can we really trust large language models like ChatGPT?

In our latest Neuro Sec Ops episode, we expose the wild world of LLM jailbreaks, dive into AI guardrails, and unpack the battle between security vs. usability.

🔊 Buckle up — this is AI safety like you’ve never heard it.

🎧 Listen now: https://open.spotify.com/episode/6jw1aKK8qE6bnnLiKj8Lz2?si=1X8Kav6yQS6aaOwgGO7c9w

#AIsecurity #LLMjailbreak #CyberThreats #Guardrails #AIsafety #GPT4 #MachineLearning #CyberPodcast

Guardrails for AI: Can We Stop LLMs from Going Rogue?

Neuro Sec Ops · Episode

Spotify

Alex Carter Jun 16, 2025

Which AI vulnerability worries you the most?
#AIsecurity #LLMjailbreak #AIsafety #Cybersecurity

Jailbreak attacks on LLMs

Prompt injection hacks

Over-blocking legit queries

Hidden model biases

100%

Poll ended at Jun 23, 2025 at 1:44pm.

Hacker News Apr 25, 2025

New LLM jailbreak bypasses all major FMs

https://hiddenlayer.com/innovation-hub/novel-universal-bypass-for-all-major-llms/

#HackerNews #LLMjailbreak #FMs #bypass #innovation #AIsecurity #hiddenlayer

Novel Universal Bypass for All Major LLMs

HiddenLayer’s latest research uncovers a universal prompt injection bypass impacting GPT-4, Claude, Gemini, and more, exposing major LLM security gaps.

HiddenLayer | Security for AI