Machine Learning Times
Machine Learning Times
EXCLUSIVE HIGHLIGHTS
Why Alphabet’s Clean Energy Moonshot Depends On AI
 Originally published in Forbes Note: Ravi Jain, Chief Technology Officer...
Predictive AI Only Works If Stakeholders Tune This Dial
 Originally published in Forbes I’ll break it to you gently:...
The Rise Of Large Database Models
 Originally published in Forbes Even as large language models have...
3 Predictions For Predictive AI In 2025
 Originally published in Forbes GenAI’s complementary sibling, predictive AI, makes...
SHARE THIS:

11 months ago
Large language models use a surprisingly simple mechanism to retrieve some stored knowledge

 

Originally published in MIT News, March 25, 2024

Researchers demonstrate a technique that can be used to probe a model to see what it knows about new subjects.

Large language models, such as those that power popular artificial intelligence chatbots like ChatGPT, are incredibly complex. Even though these models are being used as tools in many areas, such as customer support, code generation, and language translation, scientists still don’t fully grasp how they work.

They found a surprising result: Large language models (LLMs) often use a very simple linear function to recover and decode stored facts. Moreover, the model uses the same decoding function for similar types of facts. Linear functions, equations with only two variables and no exponents, capture the straightforward, straight-line relationship between two variables.

The researchers showed that, by identifying linear functions for different facts, they can probe the model to see what it knows about new subjects, and where within the model that knowledge is stored.

To continue reading this article, click here.

10 thoughts on “Large language models use a surprisingly simple mechanism to retrieve some stored knowledge

  1. By identifying these linear functions, scientists can probe LLMs to understand Pokerogue what they know about new subjects and pinpoint where this knowledge is stored, shedding light on the inner workings of these complex AI systems.

     
  2. Researchers have discovered that large language models (LLMs), like those used in AI chatbots, often utilize simple linear functions to decode and retrieve stored information. This surprising finding reveals that these models use the same decoding function for similar types of facts. Invisalign Doctor Site

     

Leave a Reply