Nvidia improves Meta’s Llama model with new training approach

5 months ago
Nvidia improves Meta’s Llama model with new training approach

By: Jonathan Kemper, freelance tech journalist, THE DECODER

Originally published in the-decoder.com, Oct 18, 2024.

Nvidia has introduced a new large language model that outperforms others in alignment benchmarks. The company achieved this through a special training procedure combining evaluation and preference models.

The new model, called Llama-3.1-Nemotron-70B-Instruct, is based on Meta’s open-source Llama 3.1 model. Nvidia optimized it to provide helpful answers to user queries by combining different training methods.

However, the results only show that the answers align better with human preferences, not that the content is necessarily more accurate. In fact, the Nemotron variant performs slightly worse than the base model on the MMLU Pro benchmark, which tests factual knowledge.

Nvidia created two new datasets for training: HelpSteer2 and HelpSteer2-Preference. HelpSteer2 contains over 20,000 prompt-response pairs. Multiple annotators rated each response on a 1-5 scale for criteria like helpfulness, correctness, and coherence. HelpSteer2-Preference adds comparisons between two answers to the same prompt. Annotators indicated which answer they preferred and how strong their preference was.

To continue reading this article, click here.

EXCLUSIVE HIGHLIGHTS

Related

5 months ago
Nvidia improves Meta’s Llama model with new training approach

One thought on “Nvidia improves Meta’s Llama model with new training approach”

Login

or Log in with Username or Email

Industry News

Connect with Us

Subscription

ADVERTISEMENTS

Produced By:

Archives

The Machine Learning Times © 2025 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190
Produced by: Rising Media & Prediction Impact

EXCLUSIVE HIGHLIGHTS

Related

5 months agoNvidia improves Meta’s Llama model with new training approach

Recommended

Predictive AI + GenAI: The WFM Game-Changer Hidden in Plain Sight

When to Use GenAI Versus Predictive AI

XGBoost is All You Need, Part 3 – Gradient Boosted Trees

Five Trends in AI and Data Science for 2025

One thought on “Nvidia improves Meta’s Llama model with new training approach”

Login

or Log in with Username or Email

Industry News

Connect with Us

Subscription

ADVERTISEMENTS

Produced By:

Archives

The Machine Learning Times © 2025 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190 Produced by: Rising Media & Prediction Impact

5 months ago
Nvidia improves Meta’s Llama model with new training approach

The Machine Learning Times © 2025 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190
Produced by: Rising Media & Prediction Impact