我的場景下 #DeepSeekV4 Pro 和 #OpenCode 已經接近完美,例如我想 (from scratch) 開發一個 #Rust 版本的RocketChat Client,可以叫它參考官方文檔開發真實測試 #TestSuite 運行並收集數據,再做 Rust Client,它會完全自己測試自己修Bug,例如中間我要求它加入替換用戶名和觸發is typing,這兩個功能官方文檔都沒有寫的,它是直接查看rocketchat的源碼研究,結果亦很完美,未來要維護也有充足的測試

#DataFlowDiagram

有了 #DFD#TestSuite 作為基礎,再要求AI一邊優化DFD一邊開發代碼,中間用大量的測試作為驗證,可以幾乎零人手參與AI原生進行開發,我使用的是 #DeepSeekV4#OpenCode

#DataFlowDiagram

Nebraska.Code 2025 hosted on Whova

July 23 – 25, 2025, Lincoln, NE

Nebraska.Code 2025 hosted on Whova

July 23 – 25, 2025, Lincoln, NE

🎩🕵️‍♂️⏰ Someone just spent 1,250 words trying to convince us why we need a "test suite" for TOTP codes, as if the tech gods are holding their breath for this groundbreaking revelation. It's a real cliffhanger concerning the three big players (Google, Apple, Yubico) not playing nice in the digital sandbox. 🌐🔐👨‍💻
https://shkspr.mobi/blog/2025/03/towards-a-test-suite-for-totp-codes/ #TOTPcodes #TestSuite #DigitalSecurity #TechDebate #BigTech #HackerNews #ngated
Towards a test-suite for TOTP codes

Because I'm a massive nerd, I actually try to read specification documents. As I've ranted ad nauseam before, the current TOTP spec is irresponsibly obsolete. The three major implementations of the spec - Google, Apple, and Yubico - all subtly disagree on how it should be implemented. Every other MFA app has their own idiosyncratic variants. The official RFC is infuriatingly vague. That's no…

Terence Eden’s Blog
Towards a test-suite for TOTP codes

Because I'm a massive nerd, I actually try to read specification documents. As I've ranted ad nauseam before, the current TOTP spec is irresponsibly obsolete. The three major implementations of the spec - Google, Apple, and Yubico - all subtly disagree on how it should be implemented. Every other MFA app has their own idiosyncratic variants. The official RFC is infuriatingly vague. That's no…

Terence Eden’s Blog
Micropub Rocks!

Over the weekend, I put together an open-source Test Suite for GenAI to help me test the LLMs in the multi-modal GenAI framework. It's a simple tool designed to help developers test and validate Generative AI models across platforms like Ollama, OpenAI, Anthropic, and Amazon Bedrock. Testing is essential in GenAI because it ensures that models generate accurate, reliable, and unbiased outputs—crucial given their complexity and impact.

https://github.com/joelee/GenAI-Prompt-Test-Suites

#GenAI #LLM #TestSuite

GitHub - joelee/GenAI-Prompt-Test-Suites: Test Suites for GenAI Prompt Testing

Test Suites for GenAI Prompt Testing. Contribute to joelee/GenAI-Prompt-Test-Suites development by creating an account on GitHub.

GitHub

If you can’t run your #testsuite as part of the #pipeline, then what’s the point? A consistent method of building an unknown quantity for use in #production? That sounds marginal at best.

What isn’t the #documentation for the pipeline telling me? Or what haven’t I seen in it yet? I don’t want to #debug the #testsuite itself. We already know it works from #CLI.

The recording of our Fediverse Developer Network meeting yesterday is now online: https://fedidevs.org/notes/2024-03-07/ and there are meeting notes, too.

Main subject was an early show-and-tell-and-feedback of FediTest, developing a #testsuite for the #Fediverse.

Thanks @andypiper running the meeting and making the recording.

For more updates on #FediTest, follow @feditest.

Fediverse Developer Network | 2024-03-07 Online meeting (focused on FediTest)

Fediverse Developer Network