Posts

Showing posts with the label fancy

Big language models can't plan, even if they write fancy essays

Image
This article is part of our coverage of the latest AI research. Large language models such as GPT-3 have grown to the point where it is difficult to measure the limits of their capabilities. When you have a very large neural network that can produce articles, write software code, and engage in conversations about feelings and life, you should expect it to be able to reason about tasks and plans like humans do, right? Wrong. A study by researchers at Arizona State University, Tempe, showed that when it comes to planning and methodical thinking, LLMs perform very poorly, and suffer from many of the same failures observed in today’s deep learning systems. Regards, humanoids Subscribe to our newsletter now for weekly recaps of our favorite AI stories in your inbox. Interestingly, this study found that, although very large LLMs such as GPT-3 and PaLM pass many tests intended to evaluate reasoning abilities and artificial intelligence systems, they do so because these benchmarks are too si