Discussion about this post

User's avatar
Nathan Lambert's avatar

Not enough people talking about this obvious fact that most people shouldn't bother training models (I very cursoryily skimmed mostly the title). It was the TLDR of my talk on "post training for applications" https://www.youtube.com/watch?v=grpc-Wyy-Zg

Most people should just not do it!

Maria Sukhareva's avatar

Correct. When you are fine-tuning in the sense if you decide to change all weights, you are risking to break the model.

PEFT types of fine-tuning are much safer like Lora, it’s still fine-tuning but it changes a subset of weights and it works great.

I would not say RAG though is a way to bring new knowledge to LLMs.

Even in a RAG setup, the assembled prompt, user question plus retrieved passages is still just a query to the model. The model isn’t learning new information; it’s interpreting the concatenated text through its existing patterns.

If your domain is too far away from what a LLM saw, RAG won’t help, it will spiral in hallucinations

14 more comments...

No posts

Ready for more?