Hi community, you asked how much it costs to fine-tune /maybe align/ a bigger LLM, like a Llama-3-70B. @abhi1thakur tweeted, that on a single node (8xH100GPU) with HuggingFace's Autotrain it took about 2.5 hours, with this yaml file ( https://github.com/huggingface/autotrain-advanced/blob/main/configs/llm_finetuning/llama3-70b-orpo-v1.yml ) and costs about US$ 200, when done w/ NVIDIA DGX cloud. Just PEFT, no 4-bit quant. How to install autotrain: https://github.com/huggingface/autotrain-advanced See also https://twitter.com/abhi1thakur/status/1786680791348506855 Detailed instructions for Autotrain here: https://huggingface.co/blog/train-dgx-cloud
Show more