Mastodawn

I built a tiny LLM to demystify how language models work

#HackerNews #tinyLLM #languageModels #AI #demystify #coding #guppylm

GitHub - arman-bd/guppylm: A ~9M parameter LLM that talks like a small fish.

A ~9M parameter LLM that talks like a small fish. Contribute to arman-bd/guppylm development by creating an account on GitHub.

GitHub

AI Daily Post Nov 15, 2025

Google’s new supervised reinforcement‑learning approach lets tiny language models learn sequential decision‑making as well as larger systems. The open‑source‑friendly method could close the performance gap for small models, making AI more accessible. Curious how it works? Read the full story. #SupervisedRL #TinyLLM #OpenSourceAI #GoogleAI

🔗 https://aidailypost.com/news/google-introduces-supervised-reinforcement-learning-close-gap-small

Reddit Tech VN Bot Oct 19, 2025

Chuyển tự động tải lại sau timeout (TTL 300-600s) + cấu hìnhenciar nhiều mô hình không cần giải phóng tất cả. Tùy chỉnh quy tắc tải mô hình đồng thời? Cấu hình mẫu có 7 mô hình với TTL khác nhau. #LLM #AI #LlamaSwap #TinyLLM #MôHìnhNgônNgữ #TựDộng

https://www.reddit.com/r/LocalLLaMA/comments/1oaoprv/llamaswap_automatic_unloading_after_timeout/

Hacker News Apr 28, 2025

Tiny-LLM – a course of serving LLM on Apple Silicon for systems engineers

https://github.com/skyzh/tiny-llm

#HackerNews #TinyLLM #AppleSilicon #LLMsystems #EngineersCourse #MachineLearning #GitHub

GitHub - skyzh/tiny-llm: A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen. - skyzh/tiny-llm

GitHub

stackdpodcast Jan 15, 2025

Did you listen to #75 yet? @javajuneau @kito99 @dhinojosa and guest @nraychaudhuri (founder of @tublian) discuss #JakartaEE 12, software dev agents, #TabNine, #LangGraph, #TinyLLM, #NVIDIA’s new model, #JDK 23, Tublian using #AI to empower the devs. https://www.pubhouse.net/2024/12/stackd-75-hes-a-mystery-man.html

Stackd 75: He’s a mystery man – Pub House Network

stackdpodcast Dec 4, 2024

Stackd #75: "He's a mystery man" is out! @javajuneau, @kito99, and @dhinojosa are joined by special guest @nraychaudhuri, founder of @tublian and author of #Scala in Action. They discuss the retirement of James Gosling, #JakartaEE 12, software development agents, #TabNine, #LangGraph for Java, #TinyLLM, #NVIDIA’s nvidia/Llama-3.1-Nemotron-70B-Instruct model, #JDK 23, and Tublian’s use of #AI to empower the next generation of software developers. https://buff.ly/4f2u3Rf

Stackd 75: He’s a mystery man – Pub House Network

Kyle Leaders Apr 27, 2024

The first model in my #tinyllm review series is #qwen 1.5 0.5b. A truly tiny model that can run comfortably on my Lichee Pi4a #riscv64 single board system. Read the article and let me know if you can think of anything else I should test on #qwen and the future models.

https://kyle.works/blog/tiny-llm-reviews-qwen-1-5/

#python #llm #genai #qwen #ChatGPT #llama #phi

Tiny LLM Reviews: Qwen 1.5 0.5b

Lets explore Qwen 1.5 0.5b, the new tiniest LLM from Alibaba

Kyle.works the site of Kyle Leaders