"We envision a more helpful generation of coding agents to both carry out their tasks and formally prove their implementations against strict specifications. Instead of debugging machine-generated logic, humans dictate what they want. Today, we are taking the first major step toward that vision."

https://mistral.ai/news/leanstral

#formalVerification #mistralAI #artificialintelligence #softwaredevelopment

Leanstral: Open-Source foundation for trustworthy vibe-coding | Mistral AI

First open-source code agent for Lean 4.

BOOTOSHI (@KingBootoshi)

작성자가 TLA+를 접하고 에이전틱(agentic) 코딩에 매우 유용하다고 감탄하고 있습니다. TLA+로 설계 상태(state)의 모든 가능한 시나리오를 수학적으로 검증해 버그와 충돌을 예방할 수 있으며, 문제가 발견되면 에이전트가 즉시 피드백을 받아 반복적으로 수정한다고 설명합니다.

https://x.com/KingBootoshi/status/2032775953479520306

#tla+ #formalverification #agentic #verification

BOOTOSHI 👑 (@KingBootoshi) on X

HOLY FUK I JUST LEARNED ABOUT TLA+ AND IT'S SO GOOD FOR AGENTIC CODING ur telling ME that i can mathematically fact check every possible scenario of my design STATE to prevent bugs and crashes AND IF IT FINDS SOMETHING THE AGENTS GET INSTANT FEEDBACK AND LOOP FIXING IT TILL IT

X (formerly Twitter)

🔬 I released Benelos, a lightweight model checker for C developers.

You describe your system in plain C — state struct, transition functions, invariants, liveness goals — and Benelos does a breadth-first exploration of the full reachable state space to verify correctness.

No new language. No external toolchain. Just a single header file.

🔗 codeberg.org/cdsoft/benelos

#OpenSource #C #FormalVerification #EmbeddedSystems

#Exploitation in the era of #formalverification A peek at a new frontier with adacore/Spark

https://av.tib.eu/media/62268

Exploitation in the era of Formal Verification: A peek at a new frontier with adacore/Spark

For decades, software vulnerabilities have remained an unsolvable security problem regardless of years of investment in various mitigations, hardening and fuzzing strategies. In the last years there have been moves to formal methods as a path toward better security. Verification and formal methods can produce rigorous arguments about the absence of the entire classes of security bugs, and are a powerful tool to build highly secure software. AdaCore/SPARK is a formally defined programming language intended for the development of high integrity software used in systems where predictable and highly reliable operation is crucial. The formal, unambiguous, definition of SPARK allows a variety of static analysis techniques to be applied, including information flow analysis, proof of absence of run-time exceptions, proof of termination, proof of functional correctness, and proof of safety and security properties. In this talk we will dive-into AdaCore/SPARK, cover the blind spots and limitations, and show real-world vulnerabilities which we met during my work and which are still possible in the formally proven software. We will also show an exploit targeting one of the previously described vulnerabilities.

Abstract Interpretation With Professor Patrick Cousot | Lecture Series on AI #11 | J.P. Morgan

YouTube

🚀 We’re growing! Cryspen is hiring.

Are you passionate about high-assurance software or cutting-edge cryptography? We are looking for talented individuals to join our team and help us build the next generation of secure-by-design systems.

🔹 Compiler and Verification Tools Engineer
🔹 Cryptography Engineer

Join us in our mission to make the digital world more secure.

👉 Apply here: https://join.com/companies/cryspen

#Hiring #Cryptography #FormalVerification #SoftwareEngineering #TechJobs

Jobs at Cryspen | JOIN

Jobs at Cryspen. Browse all our open positions and become part of our growing team! We are currently looking for additions to our company. Apply today!

**Theorem proving with the real numbers**
https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-408.html
by John Robert Harrison
November 1996

The author used #HOL to formalise real numbers, including metric, sequences and limits, continuity and differentiation, power series and transcendental functions, integration.

There is also a #CAS and IEEE floating standard verification tools.

#ProofAssistant #Math #FormalVerification #Logic #PLT

Department of Computer Science and Technology – Technical reports: UCAM-CL-TR-408

@jbz You might be joking, but this is actually not so far fetched as one might think:

Code Migration with Formal Verification for Performance Improvement of Legacy Code:

https://www.codemetal.ai/research/code-migration-with-formal-verification-for-performance-improvement-of-legacy-code

Combining AI with Formal Verification for Efficient Migration of Legacy Code:

https://www.codemetal.ai/research/combining-ai-with-formal-verification-for-efficient-migration-of-legacy-code

IMHO, this is one of the most useful applications of #LLMs.

Either way, the future of #AI will belong to automated reasoning/symbolic AI (maybe in combination with LLMs).

#LLM #FormalVerification

Code Metal - Verifiable Code Translation

Verifiable Code Translation for Mission Critical Industries.

Code Metal
🔥🤖 Wow, someone's trying to make "formal verification" the next hip thing, because apparently AI is now the cool kid handing out billion-dollar lunch money 💸. Meanwhile, they forget that "testing" isn't just a 13-minute read or a new year's resolution! 🎉🚀
https://alperenkeles.com/posts/test-dont-verify/ #formalverification #AIinnovation #billiondollarlunchmoney #testingmatters #techtrends #HackerNews #ngated
Test, don't (just) verify

"LLM-Verifier systems get a boost with 4/$\delta$ Bound theorem! #FormalVerification #LLM #SoftwareVerification"

The introduction of Formal Verification tools with large language models (LLMs) has revolutionized software verification, but current methods lack reliability due to the absence of a solid theoretical foundation. This research addresses this gap by developing an LLM-Verifier Convergence...

#FormalVerification #LargeLanguageModels #LLM-VerifierSystems #SoftwareVerification