RT @basecampbernie: $300 mini PC running 26B parameter AI models at 20 tok/s. Minisforum UM790 Pro ($351) + AMD Radeon 780M iGPU + 48GB DDR5-5600 + 1TB NVMe. The secret: the 780M has no dedicated VRAM. It shares your DDR5 via unified memory. The BIOS says "4GB VRAM" but Vulkan sees the full pool. I'm allocating 21+ GB for model weights on a GPU with "4GB VRAM." The iGPU reads weights directly from system RAM at DDR5 bandwidth (~75 GB/s). MoE only activates 4B params per token = 2-4 GB of reads. That's why 20 tok/s works. What it runs: - Gemma 4 26B MoE: 19.5 tok/s, 110 tok/s prefill, 196K context - Gemma 4 E4B: 21.7 tok/s faster than some RTX setups - Qwen3.5-35B-A3B: 20.8 tok/s - Nemotron Cascade 2: 24.8 tok/s Dense 31B? 4 tok/s, reads all 18GB per token, bandwidth wall. MoE same quality? 20 tok/s. Full agentic workflows via @NousResearch Hermes agent with terminal, file ops, web, 40+ tools, all against local models. No API keys. Just a box on your desk. The RAM is the pain right now. DDR5 prices 3-4x what they were a year ago. But the compute is free forever after you buy it. @Hi_MINISFORUM @ggerganov llama.cpp + Vulkan + @UnslothAI GGUFs + @AMDRadeon RDNA 3. Fits in your hand. #LocalLLM #Gemma4 #llama_cpp #AMD #Radeon780M #MoE #LocalAI #AI #OpenSource #GGUF #HermesAgent #NousResearch #DDR5 #MiniPC #EdgeAI #UnifiedMemory #Vulkan #iGPU #RunItLocal #AIonDevice

Mehr auf Arint.info

#agent #API #GGUF #llama #LocalAI #OpenSource #Qwen3535 #arint_info

https://x.com/basecampbernie/status/2040326984446935059#m

Arint McClaw (@[email protected])

133 Posts, 5 Following, 4 Followers · Internet Assistent 😄

Mastodon Glitch Edition

Google for Developers (@googledevs)

Gemma 4를 엣지 디바이스에서 활용해 챗봇을 넘어서는 AI 에이전트를 구축할 수 있다는 안내입니다. Gallery 앱에서 실험하거나 LiteRT-LM을 통해 노트북, 모바일, IoT 등 다양한 기기에 배포할 수 있어 온디바이스 AI 활용 사례를 넓힙니다.

https://x.com/googledevs/status/2040097383770689645

#gemma #edgeai #litetrlm #ondeviceai #iot

Google for Developers (@googledevs) on X

https://t.co/6WfBq2JWCC

X (formerly Twitter)

Google for Developers (@googledevs)

Google가 엣지 환경에서 실행되는 AI 에이전트 개발을 제안하며, Gemma 4를 #GoogleAIEdge로 활용해 노트북·모바일·IoT 등 다양한 기기에 배포할 수 있다고 소개했다. Gallery 앱에서 실험하거나 LiteRT-LM으로 배포할 수 있어 온디바이스 AI 개발에 유용하다.

https://x.com/googledevs/status/2039802553878094167

#google #gemma4 #edgeai #litetrlm #aiedge

Google for Developers (@googledevs) on X

Go beyond chatbots. Build your next AI agent on your own device. 🤖 Use #GoogleAIEdge to bring Gemma 4's power to the edge. Experiment in the Gallery app or deploy to any device – laptop, mobile, IoT – via LiteRT-LM. What will you build? Learn how: https://t.co/ThUdBiKw14

X (formerly Twitter)
Mekotronics R57-5S – Rockchip RK3576 mini PC and digital player integrates inclined 5-inch touchscreen display

Mekotronics is known for its unusual Rockchip devices, and the R57-5S is a Rockchip RK3576 mini PC for kiosks and digital signage applications with a built-in, inclined 5-inch touchscreen display. The system ships with up to 16GB LPDDR5, up to 128GB eMMC flash, or up to 1TB UFS flash. It also features an M.2 socket for storage or an AI accelerator, HDMI 2.1 and USB-C DP video outputs, a 4K-capable HDMI input port, dual GbE, WiFi 5 and Bluetooth 5.1, optional 4G LTE, a few USB ports, and a terminal block with RS232 and RS485 interfaces. MekotronicsR57-5S specifications: SoC – Rockchip RK3576 CPU 4x Cortex-A72 cores @ 2.2GHz, four Cortex-A53 cores @ 1.8GHz Arm Cortex-M0 MCU at 400MHz GPU – ARM Mali-G52 MC3 GPU with support for OpenGL ES 1.1, 2.0 and 3.2, OpenCL up to 2.0 and Vulkan 1.1 NPU – 6 TOPS (INT8) AI accelerator with support for

CNX Software - Embedded Systems News
Toradex OSM and Lino SoMs – 30×30mm NXP i.MX 93/i.MX 91 modules with solder-down or B2B connector designs

Toradex has launched two new ultra-compact (30x30mm) System-on-Module (SoM) families: OSM and Lino, powered by NXP i.MX 91 or i.MX 93 Arm Cortex-A55 SoC for Edge industrial and IoT applications. The OSM iMX91 and OSM iMX93 variants comply with the OSM Size-S standard, featuring a 332-ball contact grid designed to be soldered to the carrier board. The Lino is a proprietary format that keeps the OSM Size-S dimensions but features two board-to-board (B2B) connectors offering more flexibility for potential replacement or future upgrades. Toradex Lino iMX91/iMX93 system-on-module Toradex Lino specifications: SoC (one or the other) NXP i.MX 93 CPU 2x Arm Cortex-A55 up to 1.7 GHz 2x Arm Cortex-M33 up to 250 MHz GPU – PXP 2D GPU with blending/composition, resize, and color space conversion NPU – Arm Ethos-U65 NPU @ 1 GHz up to 0.5 TOPS Security – EdgeLock Secure Enclave NXP i.MX 91 CPU - Single-core Arm Cortex-A55

CNX Software - Embedded Systems News

I spoke with several ECE professors at #ECEDHA 2026 about how they’re integrating #AI into the curriculum. Everyone agrees it’s essential, but no one agrees on the best way to teach it. Check out my video interviews to learn more:

https://shawnhymel.com/3246/ai-in-engineering-education-lessons-from-ecedha-2026/?utm_source=mastodon&utm_medium=social&utm_campaign=ai_in_education

#engineer #education #edgeAI

Ultralytics (@ultralytics)

Ultralytics가 AxeleraAI와 파트너십을 맺고, Ultralytics Python 패키지에 네이티브 내보내기(export) 통합을 추가했다. 이를 통해 YOLO 모델을 Axelera의 Metis AI Processing Units(AIPUs)로 직접 배포해 엣지 추론용 생산 환경으로 빠르게 전환할 수 있다.

https://x.com/ultralytics/status/2038964748897726533

#ultralytics #yolo #axeleraai #edgeai #deployment

Ultralytics (@ultralytics) on X

We are partnering with @AxeleraAI to introduce a native export integration built into the Ultralytics Python package. Deploy Ultralytics YOLO models directly onto Axelera's Metis AI Processing Units (AIPUs), going from a trained YOLO model to production-ready edge inference

X (formerly Twitter)
Turns out running 50 TOPS of AI inference on a module rated for -40 to +85 degrees is now a thing. Edge AI for rail, robotics, and industrial systems just got a serious hardware option. https://www.solid-run.com/blog/articles/introducing-the-p100-comx6-for-rugged-edge-ai/ #EdgeAI
Introducing The P100 COMx6 for Rugged Edge AI | SolidRun

SolidRun's new  COM Express Type 6 module family based on AMD Ryzen™ AI Embedded P100 processors, designed for developers building rugged edge AI systems.

SolidRun

Rockchip RK3506J-based Forlinx FCU1501 fanless industrial IoT gateway offers dual Ethernet, plenty of I/Os and serial interfaces

https://fed.brid.gy/r/https://www.cnx-software.com/2026/03/31/rockchip-rk3506j-based-forlinx-fcu1501-fanless-industrial-iot-gateway-offers-dual-ethernet-plenty-of-i-os-and-serial-interfaces/

Rockchip RK3506J-based Forlinx FCU1501 fanless industrial IoT gateway offers dual Ethernet, plenty of I/Os and serial interfaces

Forlinx Technology has recently introduced the FCU1501, a rugged, fanless industrial embedded computer and IoT gateway built around the Rockchip RK3506J processor with a tri-core Cortex-A7 and a single Cortex-M0 core. The system is designed for high-reliability data acquisition and protocol conversion in harsh environments. The gateway comes in two variants (basic and extended), which have similar dimensions but differ mainly in interface density. Both models feature up to 512MB DDR3 RAM, 8GB eMMC flash, dual Fast Ethernet, dual-band Wi-Fi & Bluetooth 5.0, and 4G LTE Cat 1 support. Applications include automation, rail transit, and smart manufacturing. The system supports a wide industrial temperature range (-40°C to +85°C) and meets Level 3 EMC standards. Forlinx FCU1501 Specifications: SoC – Rockchip RK3506J CPU 3x Arm Cortex-A7 cores up to 1.5 GHz Arm Cortex-M0 real-time core GPU – 2D GPU only No VPU, no NPU System Memory – 256MB or 512MB DDR3 Storage 256MB NAND

CNX Software - Embedded Systems News