Mastodawn

Hacker News Jul 12, 2024

Reasoning skills of large language models are often overestimated
https://news.mit.edu/2024/reasoning-skills-large-language-models-often-overestimated-0711
#ycombinator #MIT_CSAIL #MIT_IBM_Watson_AI_Lab #MIT_Quest_for_Intelligence #large_language_models_LLMs #computer_reasoning #abstract_task_solving #Counterfactual_tasks #general_AI #Zhaofeng_Wu #Yoon_Kim #Jacob_Andreas

Reasoning skills of large language models are often overestimated

MIT CSAIL researchers developed an evaluation framework for large language models about counterfactual tasks. They found that LLMs can recite answers, but struggle to reason as it relates to abstract task-solving.

MIT News | Massachusetts Institute of Technology

Hacker News Mar 27, 2024

Engineering household robots to have a little common sense
https://news.mit.edu/2024/engineering-household-robots-have-little-common-sense-0325
#ycombinator #human_robot_interaction #generative_AI #Robotics #large_language_models #LLM #grounding_classifier #MIT_CSAIL #MIT_AeroAstro #Julie_Shah #Yanwei_Wang #household_robots #teleoperations

Engineering household robots to have a little common sense

MIT engineers aim to give robots a bit of common sense when faced with situations that push them off their trained path, so they can self-correct after missteps and carry on with their chores. The team’s method connects robot motion data with the common sense knowledge of large language models, or LLMs.

MIT News | Massachusetts Institute of Technology