Large language models surpass human experts in predicting neuroscience results

Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. Here, to evaluate this possibility, we created BrainBench, a forward-looking benchmark for predicting neuroscience results. We find that LLMs surpass experts in predicting experimental outcomes. BrainGPT, an LLM we tuned on the neuroscience literature, performed better yet. Like human experts, when LLMs indicated high confidence in their predictions, their responses were more likely to be correct, which presages a future where LLMs assist humans in making discoveries. Our approach is not neuroscience specific and is transferable to other knowledge-intensive endeavours. Large language models (LLMs) can synthesize vast amounts of information. Luo et al. show that LLMs-especially BrainGPT, an LLM the authors tuned on the neuroscience literature-outperform experts in predicting neuroscience results and could assist scientists in making future discoveries.

Source Title

Nature Human Behaviour

Publisher

NATURE PORTFOLIO

Permalink

https://hdl.handle.net/11693/116653

Published Version (Please cite this version)

https://dx.doi.org/10.1038/s41562-024-02046-9

Rights

https://creativecommons.org/licenses/by/4.0/

Collections

Scholarly Publications - UMRAM

Language

English

Type

Article

Full item page

Large language models surpass human experts in predicting neuroscience results

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Rights

Collections

Language

Type

Large language models surpass human experts in predicting neuroscience results

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Rights

Collections

Language

Type