How to fine-tune: Focus on effective datasets

Sep 6, 2024
No comments yet
Industry News, Left-hand
2289 Views

7 months ago
How to fine-tune: Focus on effective datasets

By: Aditya Jain, Amir Maleki, Nathalie Saade

Originally published in ai.meta.com/blog, August 7, 2024.

This is the third blog post in a series about adapting open source large language models (LLMs). In this post, we explore some rules of thumb for curating a good training dataset.

In Part 1, we took a look at prevalent approaches for adapting language models to domain data.
In Part 2, we discussed how to determine if fine-tuning is the right approach for your use case.

Introduction
Fine-tuning LLMs is a mix of art and science, with best practices in the field still emerging. In this blog post, we’ll highlight design variables for fine-tuning and give directional guidance on best practices we’ve seen so far to fine-tune models with resource constraints. We recommend using the information below as a starting point to strategize your fine-tuning experiments.

Full fine-tuning vs. parameter-efficient fine-tuning (PEFT)
Both full fine-tuning and PEFT have shown improvements in downstream performance when applied to new domains in both academic and practical settings. Choosing one boils down to compute available (in GPU hours and GPU memory), performance on tasks other than the target downstream task (the learning-forgetting tradeoff) and human annotation costs.

To continue reading this article, click here.

EXCLUSIVE HIGHLIGHTS

Related

7 months ago
How to fine-tune: Focus on effective datasets

Login

Industry News

Connect with Us

Subscription

ADVERTISEMENTS

Produced By:

Archives

The Machine Learning Times © 2025 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190
Produced by: Rising Media & Prediction Impact

EXCLUSIVE HIGHLIGHTS

Related

7 months agoHow to fine-tune: Focus on effective datasets

Recommended

Five Trends in AI and Data Science for 2025

AI data readiness: C-suite fantasy, big IT problem

AI Optimism vs. Skepticism: Bridging the Gap Between Hype and Practicality

How Gen AI and Analytical AI Differ — and When to Use Each

Login

Industry News

Connect with Us

Subscription

ADVERTISEMENTS

Produced By:

Archives

The Machine Learning Times © 2025 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190 Produced by: Rising Media & Prediction Impact

7 months ago
How to fine-tune: Focus on effective datasets

The Machine Learning Times © 2025 • 1221 State Street • Suite 12, 91940 • Santa Barbara, CA 93190
Produced by: Rising Media & Prediction Impact