Large language models use a surprisingly simple mechanism to retrieve some stored knowledge

Apr 7, 2024
10 comments
Industry News, Left-hand
2138 Views

11 months ago
Large language models use a surprisingly simple mechanism to retrieve some stored knowledge

By: Adam Zewe

Originally published in MIT News, March 25, 2024

Researchers demonstrate a technique that can be used to probe a model to see what it knows about new subjects.

Large language models, such as those that power popular artificial intelligence chatbots like ChatGPT, are incredibly complex. Even though these models are being used as tools in many areas, such as customer support, code generation, and language translation, scientists still don’t fully grasp how they work.

They found a surprising result: Large language models (LLMs) often use a very simple linear function to recover and decode stored facts. Moreover, the model uses the same decoding function for similar types of facts. Linear functions, equations with only two variables and no exponents, capture the straightforward, straight-line relationship between two variables.

The researchers showed that, by identifying linear functions for different facts, they can probe the model to see what it knows about new subjects, and where within the model that knowledge is stored.

To continue reading this article, click here.

10 thoughts on “Large language models use a surprisingly simple mechanism to retrieve some stored knowledge”

Fred Newman on April 19, 2024 at 9:11 am said:
Log in to Reply

Hey, thank you!
Ramsey Morgan on April 20, 2024 at 4:11 pm said:
Log in to Reply

Researchers demonstrate a technique that can be used to probe a model to see New York Knicks OVO Varsity Jacket
Lyly on May 14, 2024 at 4:13 am said:
Log in to Reply

By identifying these linear functions, scientists can probe LLMs to understand Pokerogue what they know about new subjects and pinpoint where this knowledge is stored, shedding light on the inner workings of these complex AI systems.
Jessica emma on May 18, 2024 at 3:56 am said:
Log in to Reply

This is exactly what I want to talk about. moto x3m
Solar Cat on May 24, 2024 at 9:04 am said:
Log in to Reply

Thank you for the interesting read. Great blog! Solar
Jack Hunter on June 28, 2024 at 6:43 am said:
Log in to Reply

Let the styling be even more innovative as the Donatella Versace Varsity Jacket is the most impressive option to have in the closet.
Qasim Latif on August 16, 2024 at 6:17 pm said:
Log in to Reply

Pull and Wears are built with premium components and meticulous attention to detail, ensuring a superb fit and long lifespan. Their excellent workmanship ensures long-lasting comfort and beauty.
Magnet Alice on September 5, 2024 at 5:31 am said:
Log in to Reply

LLMs are trained on massive datasets that include a wide range of text from books, articles, websites, and other fnf sources. During training, the model learns patterns, structures, and facts from this data.
David Miller on December 14, 2024 at 8:10 pm said:
Log in to Reply

“This appears to be quite comfortable and ideal for the next cold days. Excellent decision! Pull and Wears
Edward Johnson on February 14, 2025 at 6:57 am said:
Log in to Reply

Researchers have discovered that large language models (LLMs), like those used in AI chatbots, often utilize simple linear functions to decode and retrieve stored information. This surprising finding reveals that these models use the same decoding function for similar types of facts. Invisalign Doctor Site

EXCLUSIVE HIGHLIGHTS

Related

11 months ago
Large language models use a surprisingly simple mechanism to retrieve some stored knowledge

10 thoughts on “Large language models use a surprisingly simple mechanism to retrieve some stored knowledge”

Leave a Reply Cancel reply

Login

Industry News

Connect with Us

Subscription

ADVERTISEMENTS

Produced By:

Archives

The Machine Learning Times © 2025 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190
Produced by: Rising Media & Prediction Impact

EXCLUSIVE HIGHLIGHTS

Related

11 months agoLarge language models use a surprisingly simple mechanism to retrieve some stored knowledge

Recommended

Five Trends in AI and Data Science for 2025

AI data readiness: C-suite fantasy, big IT problem

AI Optimism vs. Skepticism: Bridging the Gap Between Hype and Practicality

How Gen AI and Analytical AI Differ — and When to Use Each

10 thoughts on “Large language models use a surprisingly simple mechanism to retrieve some stored knowledge”

Leave a Reply Cancel reply

Login

Industry News

Connect with Us

Subscription

ADVERTISEMENTS

Produced By:

Archives

The Machine Learning Times © 2025 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190 Produced by: Rising Media & Prediction Impact

11 months ago
Large language models use a surprisingly simple mechanism to retrieve some stored knowledge

The Machine Learning Times © 2025 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190
Produced by: Rising Media & Prediction Impact