Introducing #LMEval – a tool that helps AI researchers & developers compare the performance of different #LLMs.

Designed to be accurate, multimodal, and easy to use, LMEval has already been used to evaluate major models in terms of safety and security.

Dive deeper: https://bit.ly/3T7fgfk

#AI #opensource #Google #InfoQ

🔍 Was ist Google LMEval? Entdecke das neue KI-Test-Framework!

Einheitliche Modellbewertung
Multimodal & anbieterübergreifend
Effiziente, inkrementelle Tests

#ai #ki #artificialintelligence #Google #LMEval #LLM

Jetzt LIKEN, teilen, LESEN und FOLGEN!

https://kinews24.de/google-lmeval-llms-ki-modelle-2025-clever-testen/

At Giskard, we've integrated LMEval into our Phare LLM benchmark (phare.giskard.ai) to independently evaluate popular models' security and safety dimensions - through rigorous testing.

Read the announcement: https://opensource.googleblog.com/2025/05/announcing-lmeval-an-open-ource-framework-cross-model-evaluation.html

#LMEval #AISecurity #LLMEvaluation #OpenSource

Announcing LMEval: An Open Source Framework for Cross-Model Evaluation

Announcing LMEval, an open source framework for cross-model evaluation and simplifying cross-provider model benchmarking.

Google Open Source Blog