Why Would You Ever Fine-tune an LLM?

PublishedAugust 1, 2024

You can get pretty good results with few-shot prompt templates, why would you go to the trouble of spending expensive GPU hours to fine-tune an LLM with essentially the same process as few-shot prompt templates?

It basically comes down to data. How much do you have?

If all the data you have available is what you can personally type yourself, you probably don't have enough data to do fine-tuning on your LLM. In this case, you will get the best results with few-shot prompt templates.

On the other hand, if you can come up with 1000s of example prompt/response pairings, then fine-tuning will allow you to "bake" all of that knowledge into your LLM so that all of it can be taken advantage of on every query. With few-shot and several-shot templates you can only pass a few of these examples to the LLM on each query, so if you have a wide vocabulary or lots of different types of conversation topics to cover, fine-tuning is the way to go.

#finetuning #fine-tuning-models #llm #few-shot-learning #promptengineering

Comments

Join the discussion

No comments yet. Be the first to comment.

More from this blog

Is Dev Culture Dead in 2024?

When I started as a professional developer in 2015, I felt like a panda in a zoo. The elephants received hay shoveled through a hole in the wall while I received hand-picked bamboo and on-call veterinary services. I had a higher salary than non-dev...

Aug 20, 20241 min read

A Very Simple Example of Fine-Tuning an LLM

Normally when you fine-tune an LLM you end up making Jeff Bezos just a little bit richer due to the enormous compute power required even for the simplest of fine-tuning. I tried every free avenue I could think of to demonstrate fine-tuning using Mist...

Aug 16, 20246 min read

A Very Simple Example of Fine-Tuning an LLM

How to Create Your First Hugging Face Dataset

Modern AI tooling is mainly based on building models trained by lots of data rather than developing clever algorithms. This means that once you move beyond the basics of calling models others have developed and want to start training your own models,...

Aug 12, 20243 min read

How to Create Your First Hugging Face Dataset

In a Few Words, What is Few-shot Learning?

Imagine that you went to high school in a place where everyone was rich except for you. Your parents had very mediocre jobs, enough to put food on the table and pay for school fees but nothing else. Further imagine that you didn't want anyone else to...

Aug 7, 20241 min read

In a Few Words, What is Few-shot Learning?

Sassy Food Service Bot. Careful. Few-shot Learning is Powerful

Most AIs are almost annoyingly polite. I wondered how easy it would be to make a chatbot that delights in giving sass to customers. Well, turns out just 3 few-shot learning examples was enough. Don't try this at work kids! First, get set up with a Ju...

Aug 5, 20242 min read

Sassy Food Service Bot. Careful. Few-shot Learning is Powerful

SamSchneider.me

10 posts

Content to help you make the jump into the world of AI software development.

Command Palette

Comments

More from this blog