Mastodawn

#Granite4.1:8b va plus vite que #gemma4:e2b, déjà il prend moins de place et je l'utilise essentiellement pour du résumé d'article, un article traduit et concis prend environ 30 secondes

#ollama

TechLİfe Nov 18, 2025

IBM Unveils Granite 4.0: Hyper-Efficient Hybrid Models

https://techlife.blog/posts/ibm-granite-4-0-hyper-efficient-high-performance-hybrid-models-for-enterprise/

#LLM #AI #Granite4

IBM Unveils Granite 4.0: Hyper-Efficient Hybrid Models

IBM launches Granite 4.0, a new generation of hyper-efficient, high-performance hybrid models for enterprise applications.

TechLife

Markus Eisele Oct 21, 2025

New tutorial: Build a real-time AI progress tracker with Quarkus, LangChain4j, Ollama & Granite 4!
Make your LLM pipeline transparent : Show retrieval, prompt construction & model calls step by step with SSE + a modern UI.

https://www.the-main-thread.com/p/quarkus-langchain4j-granite4-ai-progress-tracker

#Java #Quarkus #LangChain4j #LLM #Granite4

Reddit Tech VN Bot Oct 6, 2025

Định dạng GGUF có thể đã hỗ trợ các mô hình LLM lai Transformer/Mamba? LM Studio đã có các file GGUF cho Granite 4.0 của IBM. Người dùng muốn chuyển đổi Phi-4-mini-flash-reasoning (MSFT) và Nemotron-Nano-9B-v2 (Nvidia) sang GGUF để chạy cục bộ. Thảo luận về khả năng kỹ thuật và chi phí suy luận.

#GGUF #LLM #AI #Mamba #Transformer #Granite4 #Phi4 #NemotronNano #MáyHọc #TríTuệNhânTạo #MôHìnhNgônNgữ

https://www.reddit.com/r/LocalLLaMA/comments/1nzpjz8/how_did_lm_studio_convert_ibms_granite_40_mod

Reddit Tech VN Bot Oct 6, 2025

Một lập trình viên đã tinh chỉnh mô hình IBM Granite-4.0 bằng Python và Unsloth. Dù nhỏ, mô hình cho thấy độ trễ thấp và độ chính xác cao đáng ngạc nhiên. Bản LoRA đã được đăng trên Hugging Face và bài viết hướng dẫn chi tiết quá trình tinh chỉnh cũng đã ra mắt.

#AI #MachineLearning #Granite4 #Unsloth #TinhChinhAI #HocMay

https://www.reddit.com/r/SideProject/comments/1nzkh6s/finetuned_the_ibm_granite_using_python_and_unsloth/