RegexPSPACE: Regex LLM Benchmark

RegexPSPACE는 LLM의 공간 복잡도 한계를 평가하기 위해 PSPACE-완전 정규표현식 문제(동등성 결정과 최소화)를 기반으로 한 최초의 벤치마크를 제안한다. 100만 개 이상의 정규표현식 인스턴스를 포함하는 대규모 데이터셋을 구축하고, 6개 LLM과 5개 LRM을 대상으로 평가를 수행해 LLM의 장황함과 반복 같은 공통 실패 패턴을 발견했다. 이 연구는 LLM과 LRM의 고급 추론 능력과 공간 계산 한계를 정량적으로 분석하는 새로운 평가 프레임워크를 제공한다.

https://arxiv.org/abs/2510.09227

#llm #benchmark #regex #reasoning #pspace

RegexPSPACE: A Benchmark for Evaluating LLM Reasoning on PSPACE-complete Regex Problems

Large language models (LLMs) show strong performance across natural language processing (NLP), mathematical reasoning, and programming, and recent large reasoning models (LRMs) further emphasize explicit reasoning. Yet their computational limits, particularly spatial complexity constrained by finite context windows, remain poorly understood. While recent works often focus on problems within the NP complexity class, we push the boundary by introducing a novel benchmark grounded in two PSPACE-complete regular expression (regex) problems: equivalence decision (RegexEQ) and minimization (RegexMin). PSPACE-complete problems serve as a more rigorous standard for assessing computational capacity, as their solutions require massive search space exploration. We perform a double-exponential space exploration to construct a labeled dataset of over a million regex instances with a sound filtering process to build the benchmark. We conduct extensive evaluations on 6 LLMs and 5 LRMs of varying scales, revealing common failure patterns such as verbosity and repetition. With its well-defined structure and quantitative evaluation metrics, this work presents the first empirical investigation into the spatial computational limitations of LLMs and LRMs, offering a new framework for evaluating their advanced reasoning capabilities. Our code is available at https://github.com/hyundong98/RegexPSPACE .

arXiv.org
Can be useful
regex101: build, test, and debug regex
https://regex101.com/
#regex #regularexpression #coding
regex101: build, test, and debug regex

Regular expression tester with syntax highlighting, explanation, cheat sheet for PHP/PCRE, Python, GO, JavaScript, Java, C#/.NET, Rust.

regex101

Yep. It's a SpamAssassin/Email/character set thing. Because character sets aren't reliable then the default behaviour is to leave it as-is and do byte matching.

https://stackoverflow.com/questions/49130946/spamassassin-matching-han-chinese-characters

Edit: Except that the docs say the default in 4.x is to enable the conversion. So apparently I'm just dealing with matching UTF-8 bytes rather than character codes 😑

#SpamAssassin #Regex

Spamassassin matching Han/chinese characters

I'm trying to implement a rule that matches all chinese characters (Han) with SpamAssassin version 3.3.1 running on Perl version 5.10.1 So far I tried the following rules: body SPAM44 /\p{Han}...

Stack Overflow

I can't even get `[\x{0400}-\x{058F}]` to match (Cyrillic and Armenian characters) 🤨

#SpamAssassin #Regex #WTF

#ActuallyAutisticElder #AutisticAdults #AdultAutistic
#LateDiagnosedAutism #LateDiagnosed

I was a great technical editor and decent technical writer. I didn't recognize my finding #RegEx unintuitive was a symptom. Bewildered me. #Perl programming brain smash.

https://en.wikipedia.org/wiki/Obfuscated_Perl_Contest

Obfuscated Perl Contest - Wikipedia

#AutismSpectrumDisorder #Autistic #FWakeAutismLanguage
#FWakeHyperLexia #FWakeRegEx #ActuallyAutistic

"Opel was hosting friends at his studio on July 7, 1979, when two armed men, Robert E. Kelly and Maurice Keenan, stormed the studio in an attempted robbery. They demanded drugs and money; Opel denied he had the former and proceeded to kick them out."

My entire life this use of English stack smashes my #autism : "A and B, denied the former."

It's like #RegEx catastrophic backtracking in my #HyperLexia

Finally spent my birthday book tokens at a new bookshop in Stroud. I must have unintentionally walked in and done a search for “The.*of.*”.

#WernerHerzog #UrsulaLeGuin #Stroud #Regex

🧠 #Regex doesn’t have to be scary. At #PSConfEU 2025, @[email protected] broke it down with: ✔ Match, replace, split ✔ Greedy vs lazy ✔ Real-world regex tips in #PowerShell & VS Code 🎟️ Early bird 2026 → psconf.eu #TextParsing #VSCode

- YouTube
Home - PSConfEU

Discover PowerShell scripting & automation at psconf.eu. Join experts, learn, & boost productivity. Elevate your skills today!

PSConfEU

I was inspired to do this by the work "A German Word Family Dictionary" by Howard H. Keller (1978). The words (3000 I believe) are ordered alphabetically by root in that work.

Btw, if you want to get this book, get the print version. The ebook edition’s formatting is horribly fucked up to the point that it's useless. It's mangled OCR slop that was never reviewed by a human.

#german #deutsch #linguistics #language #regex #languageLearning

Many words feature two prefixes. I decided to consider the innermost prefix the "main" prefix, resulting in these sorts of groupings. #german #deutsch #linguistics #language #regex #languageLearning