ViewTube

Skip

Recommended videos

code_your_own_AI

17:10

NEFTune: NEW LLM Fine-Tuning plus 25% Performance

2,490 views

6 months ago

DeepFindr

17:07

LoRA explained (and a bit about precision and quantization)

39,440 views

8 months ago

code_your_own_AI

30:50

New Discovery: Retrieval Heads for Long Context

1,766 views

2 days ago

code_your_own_AI

40:55

PEFT LoRA Explained in Detail - Fine-Tune your LLM on your local GPU

62,012 views

1 year ago

New LLM-Quantization LoftQ outperforms QLoRA

4,398 views

192

code_your_own_AI

32.4K subscribers

Sun, 29 Oct 2023 00:00:00 GMT

Tags

artificial intelligence

AI models

LLM

VLM

VLA

Multi-modal model

explanatory video

RAG

multi-AI

multi-agent

Fine-tune

Pre-train

RLHF

New LLM Quantization method called LoftQ (LoRA-Fine-Tuning-aware Quantization) by GeorgiaTech and Microsoft outperforms QLoRA. Deep dive into the theory of the latest LLM Quantization combined with Low Rank Adaptations (LoRA) of high-precision weight tensors. LoftQ explained in simple terms. All rights with authors: https://arxiv.org/pdf/2310.08659.pdf (please switch the the latest version, in my case v3) #ai #quantization #memory

ViewTube

Recommended videos

New LLM-Quantization LoftQ outperforms QLoRA

13 Comments