1 article in Training
Adapting pre-trained LLMs to specific tasks: instruction tuning, RLHF, LoRA, and QLoRA.