Friday, December 30, 2022

GPT-3 displayed a surprisingly strong capacity for abstract pattern induction, matching or even surpassing human capabilities in most settings; large language models have acquired an emergent ability to find zero-shot solutions to a broad range of analogy problems

Emergent Analogical Reasoning in Large Language Models. Taylor Webb, Keith J. Holyoak, Hongjing Lu. Dec 19 2022. https://arxiv.org/abs/2212.09196v1


Abstract: The recent advent of large language models - large neural networks trained on a simple predictive objective over a massive corpus of natural language - has reinvigorated debate over whether human cognitive capacities might emerge in such generic models given sufficient training data. Of particular interest is the ability of these models to reason about novel problems zero-shot, without any direct training on those problems. In human cognition, this capacity is closely tied to an ability to reason by analogy. Here, we performed a direct comparison between human reasoners and a large language model (GPT-3) on a range of analogical tasks, including a novel text-based matrix reasoning task closely modeled on Raven's Progressive Matrices. We found that GPT-3 displayed a surprisingly strong capacity for abstract pattern induction, matching or even surpassing human capabilities in most settings. Our results indicate that large language models such as GPT-3 have acquired an emergent ability to find zero-shot solutions to a broad range of analogy problems.


---

GPT-3 “has been forced to develop mechanisms similar to those thought to underlie human analogical reasoning — despite not being explicitly trained to do so […] through a radically different route than that taken by biological intelligence.”


No comments:

Post a Comment