Mastodawn

Derya Unutmaz, MD (@DeryaTR_)

BioAI 모델과 생물의학 데이터 규모에 대한 코멘트. GI 내시경 데이터만 해도 토큰 기준 27TB에 이르며, 향후 BioAI 파운데이션 모델을 페타바이트급 데이터로 학습하려면 훨씬 더 많은 컴퓨트가 필요하다는 점을 강조한다.

https://x.com/DeryaTR_/status/2056076637759025224

#bioai #foundationmodels #computing #biomedicine

Derya Unutmaz, MD (@DeryaTR_) on X

This is a great BioAI model! It also demonstrates how vast biological data is: 27 terabytes of tokens just for GI endoscopy data! Eventually, we will need to train BioAI foundation models with petabytes of data, thus we need much more compute! This will save so many lives!

X (formerly Twitter)

Dani Devesa Derksen-Staats 15h ago

Code blocks in blog posts can be terrible to listen to. So in Xarra, Foundation Models turn them into plain-language audio explanations... context-aware and actually useful on the go.

Foundation Models are for much more than summarization. So much untapped potential!

#FoundationModels #Xarra #iOSDev

sayzard 2d ago

Toto 2.0: Time series forecasting enters the scaling era

Datadog이 공개한 Toto 2.0은 4백만에서 25억 파라미터 규모의 시계열 예측용 파운데이션 모델로, 모델 크기 확장에 따른 성능 향상을 입증했다. Toto 2.0은 BOOM, GIFT-Eval, TIME 등 주요 벤치마크에서 최고 성능을 기록하며, 이전 버전 대비 7배 이상 파라미터 효율성과 추론 속도 개선을 달성했다. 공개된 모델과 분산 학습 라이브러리는 Apache 2.0 라이선스로 제공되어 AI 시계열 예측 및 인프라 운영에 즉시 활용 가능하다.

https://www.datadoghq.com/blog/ai/toto-2/

#timeseries #forecasting #foundationmodels #distributedtraining #datadog

Toto 2.0: Time series forecasting enters the scaling era | Datadog

For the first time, a time series foundation model gets reliably better with scale—five open-weights sizes from 4m to 2.5B parameters, trained from a single recipe.

Datadog

sayzard 3d ago

Ivan Fioravanti ᯅ (@ivanfioravanti)

@JustinLin610이 이끄는 새로운 랩이 오픈 모델 분야에서 큰 기대를 받고 있으며, Qwen 모델의 향후 방향에도 그의 리더십이 영향을 줄 것으로 언급했다. 오픈 모델과 관련 연구/개발 역량 확대 신호로 볼 수 있는 소식이다.

https://x.com/ivanfioravanti/status/2054689206732038604

#openmodels #qwen #aillab #foundationmodels #opensource

Ivan Fioravanti ᯅ (@ivanfioravanti) on X

Biggest news of last months! @JustinLin610 is a living legend of Open Models, Qwen models are coming from his leadership! I bet this new lab will be TOP TOP TOP!

X (formerly Twitter)

sayzard 4d ago

Deedy (@deedydas)

월드 모델(World Models)에 대한 글을 강력하게 추천하며, 최근 18개월간 100억 달러가 이 분야로 유입됐다고 언급한다. 얀 르쿤과 페이페이 리까지 포함해, 로보틱스 파운데이션 모델 확장을 뒷받침할 핵심 AI 패러다임으로 소개한다.

https://x.com/deedydas/status/2054417488558059755

#worldmodels #robotics #foundationmodels #airesearch #llm

Deedy (@deedydas) on X

This is the single best read on World Models and one of the most important reads in AI. $10B has flowed into "world models" in the last 18mos, from Yann LeCun to FeiFei Li. The promise is, like LLMs, world models will provide the data it takes to scale robotics foundation

X (formerly Twitter)

Arint - SEO+KI May 8

RT @Zai_org: GLM-5V-Turbo Tech Report: Auf dem Weg zu einem nativen Foundation-Modell für multimodale Agenten

mehr auf Arint.info

#AgentFrameworks #AIResearch #FoundationModels #GLM5V #MultimodalAI #TechReport #arint_info

https://x.com/Zai_org/status/2052426777654387168#m

Arint - SEO+KI (@[email protected])

RT @Zai_org: GLM-5V-Turbo Tech Report: Auf dem Weg zu einem nativen Foundation-Modell für multimodale Agenten <a href="https://arint.info/@Arint/116538374163262440">mehr</a> auf <a href="https://arint.info/">Arint.info</a> #AgentFrameworks #AIResearch #FoundationModels #GLM5V #MultimodalAI #TechReport #arint_info <a href="https://x.com/Zai_org/status/2052426777654387168#m">https://x.com/Zai_org/status/2052426777654387168#m</a>

Mastodon Glitch Edition

sayzard May 4

Strata (@ChainZenit)

DeepSeek V4 같은 오픈 모델이 추론 성능에서 빠르게 격차를 줄이며, 폐쇄형 모델의 우위가 절대적이지 않게 되고 있다는 관점을 제시한다. 핵심은 특정 모델의 단순 비교보다 오픈 모델의 성능 상향과 그에 따른 시장 구도 변화다.

https://x.com/ChainZenit/status/2051292791817355655

#deepseek #openmodel #llm #reasoning #foundationmodels

Strata (@ChainZenit) on X

@bindureddy The interesting part is not even “DeepSeek V4 beats X or Y” on paper. The more important shift is that the floor keeps rising so fast that closed-model advantage starts looking less like magic and more like a shrinking lead. If an open model gets close enough on reasoning, is

X (formerly Twitter)

sayzard May 3

Sebastian Raschka (@rasbt)

4월 아키텍처 드롭 2번째 배치로 보이는 AI 모델/제품 공개 목록이 공유됐습니다. Ant Ling 2.6 1T, Minimax M2.7, Xiaomi MiMo V2.5, Poolside Laguna XS.2, Tencent Hy3-preview, IBM Granite 4.1 등이 포함되어 최신 모델 출시 흐름을 보여줍니다.

https://x.com/rasbt/status/2050988005817499827

#llm #foundationmodels #ai #release #archdrops

Sebastian Raschka (@rasbt) on X

Here is a 2nd batch of April architecture drops. What a month! - Ant Ling 2.6 1T - Minimax M2.7 - Xiaomi MiMo V2.5 - Poolside Laguna XS.2 - Tencent Hy3-preview - IBM Granite 4.1

X (formerly Twitter)

sayzard Apr 13

sui (@birdabo)

이번 주는 LLM 분야에서 가속화가 예상되며, 여러 새로운 모델들이 잇달아 공개될 것이라고 언급했다. 구체적 모델명은 없지만 AI 모델 출시가 집중되는 흐름을 예고하는 내용이다.

https://x.com/birdabo/status/2043625900009472196

#llm #foundationmodels #ainews #modelrelease

sui ☄️ (@birdabo) on X

gonna be an acceleration week for LLMs. lots of new models dropping.

X (formerly Twitter)

sayzard Apr 10

Tencent HY (@TencentHunyuan)

HY-Embodied-0.5가 공개되었으며, 실제 세계의 임바디드 에이전트를 위한 파운데이션 모델 패밀리입니다. 2B 모델은 오픈소스로 공개됐고, 공간-시간 인지와 임바디드 추론을 강화해 예측, 상호작용, 계획 성능을 높입니다.

https://x.com/TencentHunyuan/status/2042503238877135336

#foundationmodels #opensource #embodiedai #robotics #aimodel

Tencent HY (@TencentHunyuan) on X

We're releasing HY-Embodied-0.5, a family of foundation models for real-world embodied agents. The 2B model is now open source. It strengthens spatial-temporal perception and embodied reasoning for prediction, interaction, and planning. 🤖 The suite includes: 🔹 2B for edge

X (formerly Twitter)