⚠️ Lỗi khi chạy model **Deepseek-R1-Distill-Qwen-14B-Q4_0** trên Mac: "unknown pre-tokenizer type: 'deepseek-r1-qwen'". Có thể llama.cpp chưa hỗ trợ loại pre-tokenizer này?
#LLaMA #Deepseek #AIError #LỗiMôHình #MáyTính #AI
⚠️ Running Deepseek-R1-Distill-Qwen-14B-Q4_0 on Mac fails due to "unknown pre-tokenizer type". Llama.cpp may lack support for this model's tokenizer.
#LLaMA #Deepseek #ModelError #TechIssue #AI #VTech
https://www.reddit.com/r/LocalLLaMA/comments/1pim110/unknown_pretoke
**Post:**
Cập nhật: Fehler tải'argento GLM-4.6-UD-IQ2_M từ Reddit do tensor "blk.92.nextn.embed_tokens.weight" bị missing, đồng thời các tensor "unused" Wallis. Gây ngại do quant hóa khuy-cuốn hoặc tải dữ liệu chậm. Tag: #AI #ModelError #GLM46 #TechProblem #RedditCommunity
https://www.reddit.com/r/LocalLLaMA/comments/1o5pu88/glm46udiq2_m_b0rked/
New open-access article on "Decomposition of the mean absolute error (MAE) into systematic and unsystematic components" in
#PLOSONE, if you're into that sort of thing.
#Statistics #ModelEvaluation #ModelErrorhttps://dx.plos.org/10.1371/journal.pone.0279774
Decomposition of the mean absolute error (MAE) into systematic and unsystematic components
When evaluating the performance of quantitative models, dimensioned errors often are characterized by sums-of-squares measures such as the mean squared error (MSE) or its square root, the root mean squared error (RMSE). In terms of quantifying average error, however, absolute-value-based measures such as the mean absolute error (MAE) are more interpretable than MSE or RMSE. Part of that historical preference for sums-of-squares measures is that they are mathematically amenable to decomposition and one can then form ratios, such as those based on separating MSE into its systematic and unsystematic components. Here, we develop and illustrate a decomposition of MAE into three useful submeasures: (1) bias error, (2) proportionality error, and (3) unsystematic error. This three-part decomposition of MAE is preferable to comparable decompositions of MSE because it provides more straightforward information on the nature of the model-error distribution. We illustrate the properties of our new three-part decomposition using a long-term reconstruction of streamflow for the Upper Colorado River.