fly51fly (@fly51fly)
AI 추론 과정을 얼마나 읽기 쉽게 표현할 수 있는지 측정하는 연구 논문이 공개되었습니다. 사람의 이해가 다른 사람의 추론 과정을 가르치는 데 도움이 되는지 검토하며, 모델의 reasoning trace 해석 가능성과 교육 가능성을 다룹니다.
fly51fly (@fly51fly)
AI 추론 과정을 얼마나 읽기 쉽게 표현할 수 있는지 측정하는 연구 논문이 공개되었습니다. 사람의 이해가 다른 사람의 추론 과정을 가르치는 데 도움이 되는지 검토하며, 모델의 reasoning trace 해석 가능성과 교육 가능성을 다룹니다.

Transformers are the dominant architecture in AI, yet why they work remains poorly understood. This paper offers a precise answer: a transformer is a Bayesian network. We establish this in five ways. First, we prove that every sigmoid transformer with any weights implements weighted loopy belief propagation on its implicit factor graph. One layer is one round of BP. This holds for any weights -- trained, random, or constructed. Formally verified against standard mathematical axioms. Second, we give a constructive proof that a transformer can implement exact belief propagation on any declared knowledge base. On knowledge bases without circular dependencies this yields provably correct probability estimates at every node. Formally verified against standard mathematical axioms. Third, we prove uniqueness: a sigmoid transformer that produces exact posteriors necessarily has BP weights. There is no other path through the sigmoid architecture to exact posteriors. Formally verified against standard mathematical axioms. Fourth, we delineate the AND/OR boolean structure of the transformer layer: attention is AND, the FFN is OR, and their strict alternation is Pearl's gather/update algorithm exactly. Fifth, we confirm all formal results experimentally, corroborating the Bayesian network characterization in practice. We also establish the practical viability of loopy belief propagation despite the current lack of a theoretical convergence guarantee. We further prove that verifiable inference requires a finite concept space. Any finite verification procedure can distinguish at most finitely many concepts. Without grounding, correctness is not defined. Hallucination is not a bug that scaling can fix. It is the structural consequence of operating without concepts. Formally verified against standard mathematical axioms.
Just discovered a hep-th article on #arXiv that is very likely AI-generated.
The items in bibliography are basically made-up.

We investigate the thermodynamic phase transitions of a four-dimensional charged anti-de Sitter black hole endowed with a non-minimal coupling of the form $F^{αβ}F^{γλ}R_{αγβλ}$. Using perturbative methods, we derive a consistent black hole solution and analyze its thermodynamics through both conventional equilibrium techniques and a topological defect classification approach. The system displays van der Waals-like critical behavior, with a swallow-tail structure in the free energy and distinct phase branches. The topological analysis independently confirms the existence of critical points and classifies the system within the universal topological scheme for black hole thermodynamics.
fly51fly (@fly51fly)
대규모 언어모델을 활용한 형식적 반례 생성 연구 'Learning to Disprove: Formal Counterexample Generation with Large Language Models'가 소개되었다. 정리·검증이 필요한 수학/논리 문제에서 LLM을 반례 생성 도구로 쓰는 새로운 연구 방향을 제시한다.
La plateforme de preprints arXiv devient une structure indépendante à but non lucratif
https://next.ink/230231/la-plateforme-de-preprints-arxiv-devient-une-structure-independante-a-but-non-lucratif/
Après 35 ans au sein du laboratoire de Los Alamos puis de l’université Cornell, la plateforme de preprint s’institutionnalise pour diversifier ses sources budgétaires mais aussi avoir une organisation des ressources humaines plus claires.
Avi Chawla (@_avichawla)
TinyLoRA 논문(arXiv: 2602.04118) 소개 트윗으로, 초소형 파라미터 조정만으로 대형 모델의 추론 성능을 개선하는 방법을 시각적으로 설명한다. 새로운 모델 출시보다는 연구 결과 공유에 가깝지만, 경량 파인튜닝과 효율적 적응 기술에 관심 있는 개발자에게 유용하다.
[ENG] ⋅ ArXiv, the pioneering preprint server, declares independence from Cornell
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
[FR] ⋅ La plateforme de preprints arXiv devient une structure indépendante à but non lucratif
Vectorization of Verilog Designs and its Effects on Verification and Synthesis

Vectorization is a compiler optimization that replaces multiple operations on scalar values with a single operation on vector values. Although common in traditional compilers such as rustc, clang, and gcc, vectorization is not common in the Verilog ecosystem. This happens because, even though Verilog supports vector notation, the language provides no semantic guarantee that a vectorized signal behaves as a word-level entity: synthesis tools still resolve multiple individual assignments and a single vector assignment into the same set of parallel wire connections. However, vectorization brings important benefits in other domains. In particular, it reduces symbolic complexity even when the underlying hardware remains unchanged. Formal verification tools such as Cadence Jasper operates at the symbolic level: they reason about Boolean functions, state transitions, and equivalence classes, rather than about individual wires or gates. When these tools can treat a bus as a single symbolic entity, they scale more efficiently. This paper supports this observation by introducing a Verilog vectorizer. The vectorizer, built on top of the CIRCT compilation infrastructure, recognizes several vectorization patterns, including inverted assignments, assignments involving complex expressions, and inter-module assignments. It has been experimented with some Electronic design automation (EDA) tools, and for Jasper tool, it improves elaboration time by 28.12% and reduces memory consumption by 51.30% on 1,157 designs from the ChiBench collection.