Productizing Large Language Models

data analytics, Deep Learning, language models, Large Language Models, LLM, Machine Learning, Predictive Analytics
2107 Views

2 years ago
Productizing Large Language Models

By: Amjad Masad, Samip Dahal, Luis Héctor Chávez

Originally posted on Replit.com, Sept 21, 2022.

Large Language Models (LLMs) are known for their near-magical ability to learn from very few examples — as little as zero — to create language wonders. LLMs can chat, write poetry, write code, and even do basic arithmetic. However, the same properties that make LLMs magical also make them challenging from an engineering perspective.

At Replit we have deployed transformer-based language models of all sizes: ~100m parameter models for search and spam, 1-10B models for a code autocomplete product we call GhostWriter, and 100B+ models for features that require a higher reasoning ability. In this post we’ll talk about what we’ve learned about building and hosting large language models.

Nonsense

Any sufficiently advanced bullshit is indistinguishable from intelligence, or so the LLM thought. LLMs are super suggestible — in fact, the primary way to interact with LLMs is via “prompting.” Basically, you give the LLM a string of text and it generates a response, mostly in text form although some models can also generate audio or even images. The problem is, you can prompt the LLM with nonsense and it will generate nonsense. Garbage in, garbage out. Also, LLMs tend to get stuck in loops, repeating the same thing over and over again, since they have a limited attention span when dealing with some novel scenarios that were not present during training.

To continue reading this article, click here.

10 thoughts on “Productizing Large Language Models”

Pingback: Productizing Large Language Models « Machine Learning Times ✔️ Autocomp
compton sosa on October 23, 2022 at 10:08 pm said:
Log in to Reply

Thanks for sharing. That’s what interests me the most. If you love entertainment like me, you can play it here basket random
Danial Berku on February 5, 2023 at 1:59 pm said:
Log in to Reply

Well said. And yes, it is one big BS and nonsense! I can’t say any positive s words about this.
- Danial Berku on February 11, 2023 at 9:40 am said:
  Log in to Reply
  
  Thinking about how many positive t words they said about LLM and what a BS it really is, it is laughable.
Jeffrey Briggs on June 23, 2023 at 1:10 pm said:
Log in to Reply

gwegwegwe wegweg
Jeffrey Briggs on June 23, 2023 at 1:11 pm said:
Log in to Reply

wgrwegeg
[link=(https://google.com/)] google.com[/link]
wegfweg
Jeffrey Briggs on June 23, 2023 at 1:12 pm said:
Log in to Reply

wegrweg “google”:https://google.com/ wgweweg
Jeffrey Briggs on June 23, 2023 at 1:12 pm said:
Log in to Reply

wegwegweg https://www.google.com.ua/ wregwegweg
Jeffrey Briggs on June 23, 2023 at 1:13 pm said:
Log in to Reply

wqefgwegweg https://djinni.co/ wegtwegw
Jeffrey Briggs on June 23, 2023 at 1:16 pm said:
Log in to Reply

wefwefwef https://www.filmscoremonthly.com/board/posts.cfm?threadID=103983&forumID qwfqwfqwf

EXCLUSIVE HIGHLIGHTS

Related

2 years ago
Productizing Large Language Models

Originally posted on Replit.com, Sept 21, 2022.

10 thoughts on “Productizing Large Language Models”

Leave a Reply Cancel reply

Login

Industry News

Connect with Us

Subscription

ADVERTISEMENTS

Produced By:

Archives

The Machine Learning Times © 2020 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190
Produced by: Rising Media & Prediction Impact

EXCLUSIVE HIGHLIGHTS

Related

2 years agoProductizing Large Language Models

Originally posted on Replit.com, Sept 21, 2022.

Recommended

Chemistry Nobel goes to developers of AlphaFold AI that predicts protein structures

Generative AI’s Act o1

Nvidia improves Meta’s Llama model with new training approach

Generative AI Use Case: Using LLMs to Score Customer Conversations

10 thoughts on “Productizing Large Language Models”

Leave a Reply Cancel reply

Login

Industry News

Connect with Us

Subscription

ADVERTISEMENTS

Produced By:

Archives

The Machine Learning Times © 2020 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190 Produced by: Rising Media & Prediction Impact

2 years ago
Productizing Large Language Models

The Machine Learning Times © 2020 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190
Produced by: Rising Media & Prediction Impact