Originally published in MIT News, March 25, 2024
Researchers demonstrate a technique that can be used to probe a model to see what it knows about new subjects.
Large language models, such as those that power popular artificial intelligence chatbots like ChatGPT, are incredibly complex. Even though these models are being used as tools in many areas, such as customer support, code generation, and language translation, scientists still don’t fully grasp how they work.
They found a surprising result: Large language models (LLMs) often use a very simple linear function to recover and decode stored facts. Moreover, the model uses the same decoding function for similar types of facts. Linear functions, equations with only two variables and no exponents, capture the straightforward, straight-line relationship between two variables.
The researchers showed that, by identifying linear functions for different facts, they can probe the model to see what it knows about new subjects, and where within the model that knowledge is stored.
To continue reading this article, click here.
You must be logged in to post a comment.
Hey, thank you!
Researchers demonstrate a technique that can be used to probe a model to see New York Knicks OVO Varsity Jacket
By identifying these linear functions, scientists can probe LLMs to understand Pokerogue what they know about new subjects and pinpoint where this knowledge is stored, shedding light on the inner workings of these complex AI systems.
This is exactly what I want to talk about. moto x3m
Thank you for the interesting read. Great blog! Solar
Let the styling be even more innovative as the Donatella Versace Varsity Jacket is the most impressive option to have in the closet.
Pull and Wears are built with premium components and meticulous attention to detail, ensuring a superb fit and long lifespan. Their excellent workmanship ensures long-lasting comfort and beauty.
LLMs are trained on massive datasets that include a wide range of text from books, articles, websites, and other fnf sources. During training, the model learns patterns, structures, and facts from this data.