fly51fly (@fly51fly)
논문 'SimMerge: Learning to Select Merge Operators from Similarity Signals'은 유사성 신호를 이용해 모델 병합에 사용할 merge 연산자를 학습적으로 선택하는 방법을 제안합니다. O. Bolton 등(Cohere·Google) 저자이며 arXiv에 공개되어 모델 병합과 파라미터 통합 관련 연구 및 MLOps 실무에 영향을 줄 수 있습니다.
🎯 Zero accuracy loss - preserves what matters: errors, anomalies, high-scoring items & query-relevant content using BM25/embedding similarity
✅ Full provider support: #OpenAI, #Anthropic, #Google, #Cohere, #Mistral & #LiteLLM with exact token counting
📈 Performance: Search results (1000 items) 45K→4.5K tokens (90%), Log analysis 22K→3.3K tokens (85%), API responses 15K→2.25K tokens (85%)
fly51fly (@fly51fly)
논문 'SimMerge: Learning to Select Merge Operators from Similarity Signals'은 유사성 신호를 이용해 모델 병합에 사용할 merge 연산자를 학습적으로 선택하는 방법을 제안합니다. O. Bolton 등(Cohere·Google) 저자이며 arXiv에 공개되어 모델 병합과 파라미터 통합 관련 연구 및 MLOps 실무에 영향을 줄 수 있습니다.
Manuel Faysse (@ManuelFaysse)
ViDoRe V3 논문 공개: AI 에이전트와 12,000시간 이상의 인간 주석을 활용해 '현실적인' 검색(retrieval) 벤치마크를 확장한 방법을 상세히 설명. V3 점수는 이미 Cohere와 Alibaba_Qwen의 최근 Visual Document Retrieval 릴리스에서 보고되었으며, 관련 논문은 arXiv에 게시됨.

The ViDoRe V3 paper is out! We detail how we scaled *realistic* retrieval benchmarking using AI agents and >12k hours of human annotation. V3 scores are already reported in recent Visual Document Retrieval releases from @cohere @Alibaba_Qwen ! https://t.co/9NrGmS3ZfK
30. A US federal district court just ruled that paraphrases or summaries by the #AI tool #Cohere might infringe publisher copyrights on the original full texts.
https://copyrightlately.com/court-rules-ai-news-summaries-may-infringe-copyright/
Here's the Nov 17 decision by the federal district court for the Southern District of NY.
https://www.courtlistener.com/docket/69636122/59/advance-local-media-llc-v-cohere-inc/
PS: This could undermine my thesis in this thread. But it doesn't undermine it yet. As I pointed out in the second post, "If a paraphrase doesn't use the original expression or track it too closely, then it doesn't infringe. If it does track the original too closely, it might count as a derivative work." The question in this case is whether some Cohere summaries were too close to the originals. Cohere lost a motion to dismiss, and now the court will investigate the "substantial similarity" claims on the merits. If the publishers win, we'll learn more about where the line is, not that there is no line.
Press Gazette: News publishers win first round of copyright claim against AI start-up Cohere. “News publishers have celebrated a victory in the first stage of their copyright lawsuit against Canadian AI start-up Cohere. A judge has rejected in full Cohere’s motion to dismiss, saying publishers had ‘adequately alleged’ that outputs from the AI provider were ‘quantitatively and qualitatively […]
Warnung: Wer KI-Dienste als dauerhafte Grundlage für Wissen, Archive oder gesellschaftliche Entscheidungen nutzt, übernimmt implizit die Prioritäten ihrer Betreiber: kapital-, cloud- und staatsnahe Strukturen. Ergebnis: Verlust von Commons, Abhängigkeit, schwindende Pluralität.
Betroffene Ökosysteme (strukturell, nicht moralisch):
Hyperscaler / Cloud-Monokulturen: #AWS #Azure #Microsoft #GoogleCloud #GCP #OracleCloud #IBMCloud #AlibabaCloud #TencentCloud
LLM-Plattformen (gehostet, nicht lokal): #OpenAI #ChatGPT #Gemini #Anthropic #Claude #Cohere #MistralHosted #StabilityAICloud
Staatsnahe / sicherheitsindustrielle KI: #Palantir #ThalesAI #DefenseAI #GovCloudAI
Kommerzielle KI-Systeme mit Konsum- und Werbekopplung: #MetaAI #InstagramAI #FacebookAI #YouTubeAI #TikTokAI #AmazonAI #AdsDrivenAI
Einkaufs- & Payment-Integrationen: #WalmartAI #ShopifyAI #PayPalAI #StripeAI #RetailAI
Ökosysteme mit tiefem Nutzer-Lock-in: #AppleAI #GoogleEcosystem #Microsoft365AI #SamsungAI
Profiling / Behavioral Intelligence: #GoogleAds #MetaAds #SnowflakeAI #DatabricksAI #SalesforceEinstein
GPU-/Compute-Oligopol: #NvidiaAI #CoreWeave
Kurzfazit: Digitale Souveränität entsteht nicht durch Nutzung dieser Dienste, sondern durch dezentrale, offene, lokal kontrollierbare Systeme.
#Infologie #Commons #Souveränität #SmallWeb #DeCloud #DigitalAutonomy #Dorfzwockel
If, like me, you are not an AI expert but are curious about it, this interview with Joëlle Pineau is fun to watch and listen to.