Mastodawn

나나 May 27

스무디에 바나나 한 개, 베리 항산화 흡수를 84%나 줄입니다

바나나를 스무디에 넣으면 베리의 플라바놀 흡수가 84% 감소합니다. UC Davis 연구 결과와 함께, 플라바놀을 살리는 스무디 조합법을 영양사가 알려드립니다.

https://habitdays.com/posts/12325

Kraken Apr 16

🔴 Drone Attack | 9/10
🇺🇦

Drone attack continues: new targets in Cherkasy, Kyiv, and other oblasts
A drone attack campaign continues across Ukraine, with new targets reported in several regions. In Cherkasy Oblast, the cities of Cherkasy, Chyhyryn, and Shpola were reportedly targeted. The Kyiv region also came under fire, with strikes reported in the areas of Rokytne and Yahotyn. In Poltava Oblast, the towns of Lubny and Zavodske were hit. Northern regions were not spared, with attacks reported in Sumy Oblast, specifically Konotop and Lypova Dolyna, and in Chernihiv Oblast, including Talalaiivka and Ponornytsia. Kharkiv Oblast was also a target of the ongoing assault.

💬 SENTINEL: Хвильова атака, що охоплює значну територію. Можлива спроба перевантажити ППО та завдати максимальних збитків інфраструктурі.

🔗 https://newsgroup.site/prodovzhennya-ataky-dronamy-novi-tsili-na-cherkashchyni-ky/

#Ukraine #Shahed #PPO #Infrastructure

Продовження атаки дронами: нові цілі на Черкащині, Київщині та інших областях

Повітряна атака з використанням дронів-камікадзе триває на території України. За повідомленнями, новими цілями стали об'єкти у Черкаській області, зокрема міста Черкаси, Чигирин та Шпола

NewsGroup

Hacker News Mar 15

Tree Search Distillation for Language Models Using PPO

https://ayushtambde.com/blog/tree-search-distillation-for-language-models-using-ppo/

#HackerNews #TreeSearchDistillation #LanguageModels #PPO #AIResearch #MachineLearning

Tree Search Distillation for Language Models using PPO

Personal website of Ayush Tambde

Show thread

Jonathan Galou Feb 11

@jog Le CLEE Lyon Est s'engage résolument dans le combat de l'égalité femmes-hommes dans l'orientation.
#parcoursavenir #CLEE #PPO

Habr Feb 8

Продвинутые RL алгоритмы: Normal Policy, TRPO, PPO

Большой конспект по продвинутым RL алгоритмам: TRPO и PPO. Автор слегка упоролся в формулах, но это из любви к прозрачности алгоритмов.

https://habr.com/ru/articles/991622/

#Policy_gradient_methods #ActorCritic #reinforcementlearning #ppo #trpo

Продвинутые RL алгоритмы: Normal Policy, TRPO, PPO

Продолжение постов про RL: 1) Intro Reinforcement Learning 2) Reinforcement Learning: Model-free & Deep RL 3) Reinforcement Learning: Policy gradient methods Если вы заметите ошибки в формулах или...

Хабр

Habr Oct 19, 2025

RL (RLM): Разбираемся вместе

Всем привет! Недавно я познакомился с курсом по глубокому обучению с подкреплением от HuggingFace Deep Reinforcement Learning Course и захотел сделать выжимку самого интересного. Эта статья — своего рода шпаргалка по основам Reinforcement Learning (RL) и одному из ключевых алгоритмов — PPO, который лежит в основе тонкой настройки современных LLM (Large Language Models).

https://habr.com/ru/articles/958062/

#Искуственный_интеллект #Машинное_обучение #Алгоритмы #RLHF #LLM #Большие_языковые_модели #RL #Reinforcement_learning #PPO #Proxi

RL (RLM): Разбираемся вместе

Хабр

Edwin G.

Sep 27, 2025

A Vulnerable Sector Check (VSC) pre-employment screening can take over 3 months because of a backlog at the OPP.

https://www.cbc.ca/news/canada/toronto/opp-background-check-backlog-1.7643394
- - -
La vérification des antécédents en vue d’un travail auprès de personnels vulnérables (VATPV) peut prendre plus de 3 mois à cause de retards chez la PPO.

// Article en anglais //

#Ontario #OPP #PPO

Ontario police background check backlog strands social worker in Labrador without a job | CBC News

A social worker from Ontario moved provinces for a new job but can’t begin work without a required background check from the OPP. The agency’s backlog means she could go without any income for months.

CBC

Hacker News Apr 22, 2025

Does RL Incentivize Reasoning in LLMs Beyond the Base Model?
https://limit-of-rlvr.github.io/
#ycombinator #Qwen #Deepseek_R1 #PPO #GRPO #AIME #RLVR #Tsinghua_University

Limit of RLVR

Reasoning LLMs Are Just Efficient Samplers: RL Training Elicits No Transcending Capacity

Dr Priya Sammani ( MBBS ,DFM )Dec 19, 2024

The gentle chords of a familiar song drifted through the living room as sunlight spilled across the kitchen table. I stirred my coffee absently, lost in thought about my neighbor, Emily. #AffordableCareAct #familyhealthplans #financialsecurity #HDHP #healthcoverage #healthinsurance #HMO #insuranceproviders #PPO #USAhealthcare

https://priya.health/best-health-insurance-plans-for-families-in-the-usa/

Best Health Insurance Plans for Families in the USA: A Journey Through Choices and Challenges

The gentle chords of a familiar song drifted through the living room as sunlight spilled across the kitchen table. I stirred my coffee absently, lost in thought

Health With Priya

Putin's IBS Jun 22, 2024

💥 “This is our air defence! What should we do now, Olezha? They won’t catch missiles now!”: #Pantsir_S1 was destroyed near #Belgorod

The #PPO installation was located near #Dubovoy.

After the explosion, residents of the nearby area observed a “rain” of shrapnel.

#ukraine #putinisamasskiller #putinisawarcriminal @kardinal691