Mastodawn

Traditional QA frameworks break under probabilistic AI systems. Discover how to test with AI agents and rigorously evaluate them before they hit production. https://hackernoon.com/testing-ai-agents-and-testing-with-ai-agents-are-two-sides-of-the-same-coin #aitesting

Testing AI Agents and Testing With AI Agents Are Two Sides of the Same Coin | HackerNoon

Traditional QA frameworks break under probabilistic AI systems. Discover how to test with AI agents and rigorously evaluate them before they hit production.

imbus Canada Corporation Jun 18

⏰ Last Day to Register: Don't miss your chance to attend our upcoming webinar:
Topic: Testing AI-Based Systems: Considerations for Quality and Risk

Join us to learn:
✅ How test maturity impacts AI system testing
✅ Why conventional test techniques may not be sufficient for AI components
✅ Key quality and risk considerations for AI-based systems

📅 Last day to register — reserve your spot now! https://zurl.co/FsRSl

#AITesting #AI #SoftwareTesting #QualityAssurance #QA #Webinar

Seasia Infotech Jun 12

AI Quality Engineering: The Future of Faster, Smarter Software Delivery

AI-powered quality engineering is transforming software delivery by automating testing, predicting defects, and accelerating release cycles. Discover how intelligent QA strategies help businesses improve software quality while reducing time-to-market.

#AIQualityEngineering #SoftwareTesting #QualityAssurance #AITesting #SoftwareDevelopment #

Priti Gaikwad Jun 2

Here are 9 automation testing practices that help teams ship faster, reduce maintenance, and improve release confidence. 🚀

Which one has made the biggest difference for your team?

#AutomationTesting #QA #TestAutomation #SoftwareTesting #AITesting #QualityEngineering

Analyst207 May 22

Cisco Tests AI for Incident Reports, Finds Mixed Results

Cisco's experiment with AI-generated incident reports yielded mixed results, with large language models producing significant inaccuracies, unusual conclusions, and inconsistent writing styles when used for long-form technical content. The findings revealed four predictable failure modes, highlighting the need for guardrails…

https://osintsights.com/cisco-tests-ai-for-incident-reports-finds-mixed-results?utm_source=mastodon&utm_medium=social

#ArtificialIntelligence #LargeLanguageModels #IncidentResponse #AiTesting #CiscoTalos

Cisco Tests AI for Incident Reports, Finds Mixed Results

Discover how Cisco tested AI for incident reports, finding mixed results and four predictable failure modes, and learn why LLMs need guardrails - read now.

OSINTSights

Aurélien Lair May 20

🤖 How do you actually know if your AI agent is any good? Great practical read on evaluating AI agent performance metrics, methods & the traps to avoid. A must for anyone moving into LLM evals.

👉 https://tinyurl.com/26pfmobc

#AITesting #LLMEvals #QualityEngineering #AIagents

URL Shortener, Branded Short Links & Analytics | TinyURL

Seasia Infotech May 18

AI-generated code passed QA… but still failed in production

Traditional testing workflows are struggling to keep up with AI-assisted development. Passing unit tests no longer guarantees reliability, security, or scalability in real-world environments.
This blog explores why AI-generated code can “look correct” while still introducing hidden risks — and what modern QA teams must do differently.

#AI #SoftwareTesting #QA #AITesting #CodeQuality #Automation

Analyst207 May 11

Anthropic's Mythos AI Falls Short in Bug-Hunting Test

Anthropic's highly-hyped Mythos AI failed to impress in a recent bug-hunting test against cURL's codebase, with results that were largely dismissed as overhyped marketing. The limited test, run by cURL developer Daniel Stenberg, revealed that Mythos fell short of expectations.

https://osintsights.com/anthropics-mythos-ai-falls-short-in-bug-hunting-test?utm_source=mastodon&utm_medium=social

#AiTesting #BugHunting #OpenSource #ProjectGlasswing #LinuxFoundation

Anthropic's Mythos AI Falls Short in Bug-Hunting Test

Discover why Anthropic's Mythos AI falls short in bug-hunting tests and learn from Daniel Stenberg's experience - read the full story now and explore AI limitations.

OSINTSights

LBHuston May 5

Strong ethical justification throughout.

Evaluation Report: Qwen-3 1.7B in LMStudio on M1 Mac

I tested Qwen-3 1.7B in LMStudio 0.3.15 (Build 11) on an M1 Mac. Here are the ratings and findings: Final Grade: B+ Qwen-3 1.7B is a capable and well-balanced LLM that excels in clarity, ethics, an…

Not Quite Random

HackerNoon Apr 23

Most AI testing tools reset after every run. Learn how a unified data layer gives intelligent test automation the memory it needs to get smarter with every test https://hackernoon.com/your-ai-testing-tool-has-no-memory-heres-why-thats-a-problem #aitesting

Your AI Testing Tool Has No Memory: Here's Why That's a Problem | HackerNoon

Most AI testing tools reset after every run. Learn how a unified data layer gives intelligent test automation the memory it needs to get smarter with every test