🧠 #SpreadsheetLLM di #Microsoft mira a risolvere un limite nel lavoro con i #LLM: i task sui fogli di calcolo.
🤔 Ma non avevamo già Code Interpreter? NO. In quel caso viene usato un LLM per sviluppare ed eseguire uno script #Python che legge i dati da un foglio e li può elaborare.

💡 In questo caso, il modello permette di usare un LLM per interagire direttamente con i dati nel foglio.

🔗 Il paper: https://arxiv.org/abs/2407.09025 

#AI #GenAI #GenerativeAI #IntelligenzaArtificiale 

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Spreadsheets are characterized by their extensive two-dimensional grids, flexible layouts, and varied formatting options, which pose significant challenges for large language models (LLMs). In response, we introduce SpreadsheetLLM, pioneering an efficient encoding method designed to unleash and optimize LLMs' powerful understanding and reasoning capability on spreadsheets. Initially, we propose a vanilla serialization approach that incorporates cell addresses, values, and formats. However, this approach was limited by LLMs' token constraints, making it impractical for most applications. To tackle this challenge, we develop SheetCompressor, an innovative encoding framework that compresses spreadsheets effectively for LLMs. It comprises three modules: structural-anchor-based compression, inverse index translation, and data-format-aware aggregation. It significantly improves performance in the spreadsheet table detection task, outperforming the vanilla approach by 25.6% in GPT4's in-context learning setting. Moreover, fine-tuned LLM with SheetCompressor has an average compression ratio of 25 times, and achieves a state-of-the-art 78.9% F1 score, surpassing the best existing models by 12.3%. Finally, we propose Chain of Spreadsheet for downstream tasks of spreadsheet understanding and validate it in a new and demanding spreadsheet QA task. We methodically leverage the inherent layout and structure of spreadsheets, demonstrating that SpreadsheetLLM is highly effective across a variety of spreadsheet tasks.

arXiv.org

https://arxiv.org/abs/2407.09025
12 Jul 2024

#SpreadsheetLLM: Encoding #Spreadsheets for Large Language Models

"we introduce SpreadsheetLLM, pioneering an efficient encoding method designed to unleash and optimize LLMs' powerful understanding and reasoning capability on spreadsheets."

https://www.theregister.com/2024/07/16/microsoft_research_llms_grok_spreadsheets/

"to analyze the spreadsheet structure and discard anything outside of a table"

It's really TabularDataLLM, throwing away the really useful working part of a spreadsheet, the #Excel formulas!

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Spreadsheets are characterized by their extensive two-dimensional grids, flexible layouts, and varied formatting options, which pose significant challenges for large language models (LLMs). In response, we introduce SpreadsheetLLM, pioneering an efficient encoding method designed to unleash and optimize LLMs' powerful understanding and reasoning capability on spreadsheets. Initially, we propose a vanilla serialization approach that incorporates cell addresses, values, and formats. However, this approach was limited by LLMs' token constraints, making it impractical for most applications. To tackle this challenge, we develop SheetCompressor, an innovative encoding framework that compresses spreadsheets effectively for LLMs. It comprises three modules: structural-anchor-based compression, inverse index translation, and data-format-aware aggregation. It significantly improves performance in the spreadsheet table detection task, outperforming the vanilla approach by 25.6% in GPT4's in-context learning setting. Moreover, fine-tuned LLM with SheetCompressor has an average compression ratio of 25 times, and achieves a state-of-the-art 78.9% F1 score, surpassing the best existing models by 12.3%. Finally, we propose Chain of Spreadsheet for downstream tasks of spreadsheet understanding and validate it in a new and demanding spreadsheet QA task. We methodically leverage the inherent layout and structure of spreadsheets, demonstrating that SpreadsheetLLM is highly effective across a variety of spreadsheet tasks.

arXiv.org
SpreadsheetLLM by Microsoft: A Game Changer for Data Analysts and Accountants

Microsoft introduces SpreadsheetLLM, a powerful AI model designed to revolutionize spreadsheet data management and analysis. Promising enhanced efficiency and intelligent user interactions, SpreadsheetLLM is set to significantly impact data analysts and accountants. Learn about its groundbreaking features and how it aims to transform the finance world.

Tech Chill

Microsoft unveils prototype LLM designed to analyse spreadsheets

SpreadsheetLLM excels at tabulated number crunching, Redmond's researchers claim

https://www.computing.co.uk/news/4335791/microsoft-unveils-prototype-llm-designed-analyse-spreadsheets

#technews #microsoft #llm #ai #spreadsheetllm #spreadsheet #genai

Microsoft unveils prototype LLM designed to analyse spreadsheets

Microsoft has unveiled a new experimental large language model (LLM), dubbed SpreadsheetLLM, specifically designed to tackle the challenge of spreadsheets.