fly51fly (@fly51fly)
AI 추론 과정을 얼마나 읽기 쉽게 표현할 수 있는지 측정하는 연구 논문이 공개되었습니다. 사람의 이해가 다른 사람의 추론 과정을 가르치는 데 도움이 되는지 검토하며, 모델의 reasoning trace 해석 가능성과 교육 가능성을 다룹니다.
fly51fly (@fly51fly)
AI 추론 과정을 얼마나 읽기 쉽게 표현할 수 있는지 측정하는 연구 논문이 공개되었습니다. 사람의 이해가 다른 사람의 추론 과정을 가르치는 데 도움이 되는지 검토하며, 모델의 reasoning trace 해석 가능성과 교육 가능성을 다룹니다.
Extended Daily March 24, 2026 - Accelerating Research and Application in the AI Generation Era
In the past 24 hours, prominent research includes single-agent robots, methodological debates in computational social science, frameworks for societal behavior with LLMs, and molecular/social digit...
https://oct-rick-brick.com/en/articles/2026-03-24-extended-daily-2026-03-24/

In the past 24 hours, prominent research includes single-agent robots, methodological debates in computational social science, frameworks for societal behavior with LLMs, and molecular/social digit...

Transformers are the dominant architecture in AI, yet why they work remains poorly understood. This paper offers a precise answer: a transformer is a Bayesian network. We establish this in five ways. First, we prove that every sigmoid transformer with any weights implements weighted loopy belief propagation on its implicit factor graph. One layer is one round of BP. This holds for any weights -- trained, random, or constructed. Formally verified against standard mathematical axioms. Second, we give a constructive proof that a transformer can implement exact belief propagation on any declared knowledge base. On knowledge bases without circular dependencies this yields provably correct probability estimates at every node. Formally verified against standard mathematical axioms. Third, we prove uniqueness: a sigmoid transformer that produces exact posteriors necessarily has BP weights. There is no other path through the sigmoid architecture to exact posteriors. Formally verified against standard mathematical axioms. Fourth, we delineate the AND/OR boolean structure of the transformer layer: attention is AND, the FFN is OR, and their strict alternation is Pearl's gather/update algorithm exactly. Fifth, we confirm all formal results experimentally, corroborating the Bayesian network characterization in practice. We also establish the practical viability of loopy belief propagation despite the current lack of a theoretical convergence guarantee. We further prove that verifiable inference requires a finite concept space. Any finite verification procedure can distinguish at most finitely many concepts. Without grounding, correctness is not defined. Hallucination is not a bug that scaling can fix. It is the structural consequence of operating without concepts. Formally verified against standard mathematical axioms.
Just discovered a hep-th article on #arXiv that is very likely AI-generated.
The items in bibliography are basically made-up.

We investigate the thermodynamic phase transitions of a four-dimensional charged anti-de Sitter black hole endowed with a non-minimal coupling of the form $F^{αβ}F^{γλ}R_{αγβλ}$. Using perturbative methods, we derive a consistent black hole solution and analyze its thermodynamics through both conventional equilibrium techniques and a topological defect classification approach. The system displays van der Waals-like critical behavior, with a swallow-tail structure in the free energy and distinct phase branches. The topological analysis independently confirms the existence of critical points and classifies the system within the universal topological scheme for black hole thermodynamics.
fly51fly (@fly51fly)
대규모 언어모델을 활용한 형식적 반례 생성 연구 'Learning to Disprove: Formal Counterexample Generation with Large Language Models'가 소개되었다. 정리·검증이 필요한 수학/논리 문제에서 LLM을 반례 생성 도구로 쓰는 새로운 연구 방향을 제시한다.
La plateforme de preprints arXiv devient une structure indépendante à but non lucratif
https://next.ink/230231/la-plateforme-de-preprints-arxiv-devient-une-structure-independante-a-but-non-lucratif/
Après 35 ans au sein du laboratoire de Los Alamos puis de l’université Cornell, la plateforme de preprint s’institutionnalise pour diversifier ses sources budgétaires mais aussi avoir une organisation des ressources humaines plus claires.
Extended Daily March 24, 2026 - Accelerating Research and Application in the AI Generation Era
In the past 24 hours, prominent research includes single-agent robots, methodological debates in computational social science, frameworks for societal behavior with LLMs, and molecular/social digit...
https://oct-rick-brick.com/en/articles/2026-03-24-extended-daily-2026-03-24/

In the past 24 hours, prominent research includes single-agent robots, methodological debates in computational social science, frameworks for societal behavior with LLMs, and molecular/social digit...
Avi Chawla (@_avichawla)
TinyLoRA 논문(arXiv: 2602.04118) 소개 트윗으로, 초소형 파라미터 조정만으로 대형 모델의 추론 성능을 개선하는 방법을 시각적으로 설명한다. 새로운 모델 출시보다는 연구 결과 공유에 가깝지만, 경량 파인튜닝과 효율적 적응 기술에 관심 있는 개발자에게 유용하다.