Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

blog.research.google

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

blog.research.google

Chrüsimüsi@feddit.ch to

Learn Machine Learning@sh.itjust.works · 10 months ago

Large language models (LLMs) are data-efficient but their size makes them difficult to deploy in real-world scenarios.

“Distilling Step-by-Step” is a new method introduced by Google researchers that enables smaller models to outperform LLMs using less training data. This method extracts natural language rationales from LLMs, which provide intermediate reasoning steps, and uses these rationales to train smaller models more efficiently.

In experiments, the distilling step-by-step method consistently outperformed LLMs and standard training approaches, offering both reduced model size and reduced training data requirements.

You must log in or register to comment.

Chat

Learn Machine Learning@sh.itjust.works

learnmachinelearning@sh.itjust.works

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !learnmachinelearning@sh.itjust.works

Welcome! This is a place for people to learn more about machine learning techniques, discuss applications and ask questions.

Example questions:

“Should I use a deep neural network for my audio classification task?”
“I’m working with a small dataset, what can I do to make my model generalize well?”
“Is there a library available that implements function X in language Y?”
“I want to learn more about the math behind machine learning technique A, where should I start?”

Please do:

Be kind to new people
Post guides and tutorials that you find helpful
Link to open/free sources instead of paywalled when possible

Please don’t:

Post news articles / memes (there are other machine learning/AI communities for this)

Other communities in this area:

Similar subreddits: r/MLquestions, r/askmachinelearning, r/learnmachinelearning

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
1 user / month
11 users / 6 months
20 local subscribers
484 subscribers
59 Posts
24 Comments
Modlog