Brown University study finds AI language models show basic understanding of real-world events

Christina H. Paxson President
Christina H. Paxson President
0Comments

AI language models can distinguish between events that are common, improbable, impossible, or nonsensical, according to a new study from Brown University released on April 22. The research will be presented at the International Conference on Learning Representations in Rio de Janeiro.

The findings matter as they suggest that artificial intelligence systems are developing an internal representation of how the world works—one that aligns with human reasoning about plausibility and causality. This could influence future development of more reliable and trustworthy AI tools.

The study examined several open-source AI language models by analyzing their responses to sentences describing various scenarios. Examples included commonplace situations like “Someone cooled a drink with ice,” improbable ones such as “Someone cooled a drink with snow,” impossible events like “Someone cooled a drink with fire,” and nonsensical statements such as “Someone cooled a drink with yesterday.” Researchers then used mechanistic interpretability—a method likened to neuroscience for machines—to analyze the mathematical states generated inside each model when processing these sentences.

“This work reveals some evidence that language models have encoded something like the causal constraints of the real world,” said Michael Lepori, a Ph.D. candidate at Brown who led the research. “Beyond just encoding these constraints, they do so in a way that is predictive of human judgments of these categories.”

By comparing differences in internal representations across multiple AI systems—including OpenAI’s GPT-2, Meta’s Llama 3.2, and Google’s Gemma 2—the researchers found distinct patterns strongly correlated with each plausibility category. These patterns allowed the models to differentiate even closely related categories (such as improbable versus impossible) with about 85% accuracy.

Lepori said: “Mechanistic interpretability can be appropriately characterized as something like neuroscience for AI systems… You could kind of think about it as understanding what is encoded in the ‘brain state’ of the machine.” He added that when people disagreed over whether certain statements were impossible or merely unlikely—such as “Someone cleaned the floor with a hat”—the models mirrored this uncertainty by assigning similar probabilities.

According to Lepori, “What we show is that the models actually capture that human uncertainty pretty well… In cases where, say, 50% of people said a statement was impossible and 50% said it was improbable, the models were assigning roughly 50% probability as well.”

These results suggest modern AI language systems can develop an understanding reflective of human thought processes even at relatively modest scales (over two billion parameters). The researchers believe further mechanistic interpretability studies may help create smarter and more trustworthy artificial intelligence.



Related

Christina H. Paxson President

Brown University study links teen violence exposure to increased tobacco use

A new Brown University study finds adolescent exposure to various forms of violence increases their likelihood of smoking cigarettes or using e-cigarettes. Researchers suggest routine assessment by educators or healthcare providers could aid early intervention.

Christina H. Paxson President

Researchers at Brown and Michigan create new structural state of matter

Brown University and University of Michigan researchers report creating a novel nanoparticle superlattice that stabilizes an elusive phase between two common metal crystal arrangements. Their findings reveal unique quantum optical properties potentially useful for quantum technologies.

Christina H. Paxson President

Brown University researchers develop method to compress and stream 3D volumetric video

Brown University researchers have developed PackUV—a new technique enabling efficient compression of large-scale volumetric (3D) videos for easier storage and streaming using standard codecs. Their method could bring immersive experiences like navigable sports or concert footage closer to mainstream devices.