Fine-tuning Large Language Models (LLMs) | w/ Example Code

Dr.Wooz October 18, 2024

36 18 Less than a minute

Want to learn more? I’m launching a 6-week live BootCamp for AI Builders. Learn more: …

[ad_2]

source

Dr.Wooz

36 Comments

@ShawhinTalebi says:
October 18, 2024 at 4:07 pm
UPDATE: Someone pointed out that the fine-tuned model here is overfitting, so I created an improved example that uses transfer learning: https://youtu.be/4QHg8Ix8WWQ
👉More on LLMs: https://www.youtube.com/playlist?list=PLz-ep5RbHosU2hnz5ejezwaYpdMutMVB0
—
References
[1] Deeplearning.ai Finetuning Large Langauge Models Short Course: https://www.deeplearning.ai/short-courses/finetuning-large-language-models/
[2] arXiv:2005.14165 [cs.CL] (GPT-3 Paper)
[3] arXiv:2303.18223 [cs.CL] (Survey of LLMs)
[4] arXiv:2203.02155 [cs.CL] (InstructGPT paper)
[5] PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware: https://huggingface.co/blog/peft
[6] arXiv:2106.09685 [cs.CL] (LoRA paper)
[7] Original dataset source — Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, and Christopher Potts. 2011. Learning Word Vectors for Sentiment Analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 142–150, Portland, Oregon, USA. Association for Computational Linguistics.
Reply
@BRK326 says:
October 18, 2024 at 4:07 pm
Nice job, efficient , straight forward, enough detail for learning big picture without confusion.
Reply
@naokir1381 says:
October 18, 2024 at 4:07 pm
Can you fine tuning a fine tuned models from huggingface?
Reply
@sakarsalunke3245 says:
October 18, 2024 at 4:07 pm
Indepth overview of fine tuning!
Reply
@alokkale2718 says:
October 18, 2024 at 4:07 pm
Bro you are the best instructor
Reply
@janjiavicii1328 says:
October 18, 2024 at 4:07 pm
very explicit explanation. Thanks!
Reply
@jonathanleroy269 says:
October 18, 2024 at 4:07 pm
This video series is underrated. Loved it, thank you!
Reply
@bruceyin603 says:
October 18, 2024 at 4:07 pm
Very good explain about fine tune, thanks Shaw
Reply
@万万里 says:
October 18, 2024 at 4:07 pm
Teacher, can you tune it for me
Reply
@lucycaffrey-maffei6666 says:
October 18, 2024 at 4:07 pm
thank you sooo much for this content, it's so helpful and clear!!!
if i don't use LoRa when fine-tuning and instead just specify `Trainer(model=AutoModelForSequenceClassification.from_pretrained(
model_checkpoint, num_labels=2)` (which is what huggingface does in most of its documentation), what type of parameter training is that performing? retraining all of the parameters?
Reply
@raffelravionaldo463 says:
October 18, 2024 at 4:07 pm
Thank you, is a nice video and you have a clear explanation, actually I try to do this with GPT Neo model (EleutherAI/gpt-neo-1.3B) but when I do training, the Training Loss always have no log values and Validation Loss always NaN (when do it with BERT or distilbert, is run perfectly), Do you have any suggestions or reading resources to fix this?
Reply
@chilupuriharshitha7261 says:
October 18, 2024 at 4:07 pm
Hey, It was good but did your Model also take a lot of time to get fine-tuned while you were Pluging everything to the Trainer Class as for me it's taking nearly i lost the time it is taking but 10 epochs are getting trained for every 0.05 it/s.
Reply
@arturoruiz6274 says:
October 18, 2024 at 4:07 pm
Why is truncation side set to the left? What is the difference between choosing "right" or "left"? I'm asking in a general sense, not just for the example shown here. Is there anyone that can provide some intuition or good reading references?
Reply
@Josia-p5m says:
October 18, 2024 at 4:07 pm
You're a fantastic communicator – this was very helpful. I would love to see more walkthroughs like this that have a good balance of theory, math and python.
Reply
@fakharmursaleen9889 says:
October 18, 2024 at 4:07 pm
Hands down one of the most best explanations on youtube keep it up homie
Reply
@egemenklc2515 says:
October 18, 2024 at 4:07 pm
Hey YouTube, I really liked this kind of machine learning and fine tuning topics. Please recommend me more of these.
Reply
@hoseinmirhoseini6013 says:
October 18, 2024 at 4:07 pm
Thank you for the great content.
Reply
@nandkumarghatge5152 says:
October 18, 2024 at 4:07 pm
great video.
Reply
@parkkk123 says:
October 18, 2024 at 4:07 pm
Why has dropout parameter to config LoRA? What drop in formular?
Reply
@ajeethsuryash5123 says:
October 18, 2024 at 4:07 pm
Very informative video. Thanks for sharing
Reply
@goinsgroove says:
October 18, 2024 at 4:07 pm
Loving the info! Do you have a video on self-supervised training? I want to train a llm to write in my style.
Reply
@FirstNameLastName-fv4eu says:
October 18, 2024 at 4:07 pm
Dont fall for these stupid ideas – this is good for Youtube video – you will never be able to train it in a level that someone will pay "MONEY" for your trained model.
Reply
@balubalaji9956 says:
October 18, 2024 at 4:07 pm
Hey YouTube algorithm , I loved this video . suggest me more of them
Reply
@balubalaji9956 says:
October 18, 2024 at 4:07 pm
I loved the video
Reply
@rma1563 says:
October 18, 2024 at 4:07 pm
I needed to know how parameter efficient finetuning works to finetune a voice encoder for emotion detection task. This video helped me a lot. I used LoRA for it. Thanks ❤
Reply
@liaoyixu6882 says:
October 18, 2024 at 4:07 pm
Very amazing video!!! I have one question: when I use your code to fine-tune the model with my own dataset, but since my dataset is too large it leads to memory error (not gpu memory) when I read the dataset, what should I do to avoid this issue? Can I read and fine-tune in a small batch?
Reply
@mheetu3909 says:
October 18, 2024 at 4:07 pm
I'm wanting to use an LLM for sentiment analysis and textual content classification and require a very specific JSON output structure for further processing and storage… would this be possible via fine-tuning? Could you (or anyone) perhaps point me in the right direction?
Reply
@soudaminipanda says:
October 18, 2024 at 4:07 pm
Thanks for these great videos. I really love the fact that you focus on building an intuitive understanding as opposed to throwing jargons. Could you please start a langchain series?
Reply
@PriyanshuDayal-q6n says:
October 18, 2024 at 4:07 pm
but during back propogation,,, each value in A is dependent on EACH value of B so again there are A*B dimensionality to be impoved or you are ignoring other wights apart from diogonal eleemnts
Reply
@Random-bq8qc says:
October 18, 2024 at 4:07 pm
Thanks for the well defined video because it helped me prepare my proposal related to this topic.
Reply
@MrCdofitas says:
October 18, 2024 at 4:07 pm
this video helpful using text, hoping you can have a sample of finetuning using images
Reply
@CaribSurfKing1 says:
October 18, 2024 at 4:07 pm
Video volume is so low, compared to most YT videos
Reply
@amoralvladislav says:
October 18, 2024 at 4:07 pm
ШАПКУ СНИМИ!
Reply
@ncjatin says:
October 18, 2024 at 4:07 pm
I cant find code in the git link provide, can anyone please send me the code please
Reply
@lotusluthor says:
October 18, 2024 at 4:07 pm
Finally something that I can grok!!!! Fantastic tutorial!
Reply
@ayyanarjayabalan says:
October 18, 2024 at 4:07 pm
Excellent way of teaching. Keep doing this kind of good work.
Reply

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Dr.Wooz

36 Comments

Leave a Reply Cancel reply

Level Up Your Minecraft Server with Proxmox – Install and Setup

How To Install Tiny 11 Without a USB Drive | Windows 11 Lite Installation

Linux error on my phone I’m glad I got out of it it didn’t delete my Samsung account

How to Change Compatibility Mode Settings in Windows 11/10 [Guide]

Are Lights On Guns Really Necessary?

No Suit Survival: Ep. 4 Mining – Space Engineers

How To Solved Error 𝟎𝐱𝟎𝟎𝟎𝟎𝟎𝟏𝟏𝐁 Share Printer Not Connect In Windows 10 / 11

Level Up Your Minecraft Server with Proxmox – Install and Setup

How To Install Tiny 11 Without a USB Drive | Windows 11 Lite Installation

“How to Install and Play Ubisoft Connect Games on Linux – Step by Step Guide”

MEDIA STATION X – НАИЛУЧШИЕ Стартовые Параметры(Start Parameter)

AtlasOS vs ReviOS vs Tiny11 – Which is the Best Custom Windows 11?

Dr.Wooz

Subscribe to our mailing list to get the new updates!

Related Articles

Papo Enfrenta a un Hater en Liga Bazooka y Detiene la Batalla: ¡Momento Tenso!

This Would Be a TOP 10 COURSE for Me, BUT…

Avengers Endgame Captain America 1:4 Legacy Replica Statue by Iron Studios #ironstudios

AWS re:Invent 2024 – Andy Jassy shares his final thoughts from re:Invent | Amazon Web Services

36 Comments

Leave a Reply Cancel reply

Level Up Your Minecraft Server with Proxmox – Install and Setup

How To Install Tiny 11 Without a USB Drive | Windows 11 Lite Installation

Linux error on my phone I’m glad I got out of it it didn’t delete my Samsung account

How to Change Compatibility Mode Settings in Windows 11/10 [Guide]

Are Lights On Guns Really Necessary?

No Suit Survival: Ep. 4 Mining – Space Engineers

How To Solved Error 𝟎𝐱𝟎𝟎𝟎𝟎𝟎𝟏𝟏𝐁 Share Printer Not Connect In Windows 10 / 11

Level Up Your Minecraft Server with Proxmox – Install and Setup

How To Install Tiny 11 Without a USB Drive | Windows 11 Lite Installation

“How to Install and Play Ubisoft Connect Games on Linux – Step by Step Guide”

MEDIA STATION X – НАИЛУЧШИЕ Стартовые Параметры(Start Parameter)

AtlasOS vs ReviOS vs Tiny11 – Which is the Best Custom Windows 11?