ViewTube

Skip

Recommended videos

code_your_own_AI

14:15

New LLM-Quantization LoftQ outperforms QLoRA

4,398 views

6 months ago

Pinecone

43:31

What is a vector database? Why are they critical infrastructure for #ai #applications?

10,877 views

6 months ago

code_your_own_AI

30:50

New Discovery: Retrieval Heads for Long Context

1,716 views

1 day ago

Andrej Karpathy

59:48

[1hr Talk] Intro to Large Language Models

1,816,293 views

5 months ago

After RAG, Vector & GPT Store: NEW AI Breakthrough UNFOLDS

6,382 views

257

code_your_own_AI

32.4K subscribers

Sat, 18 Nov 2023 00:00:00 GMT

Tags

artificial intelligence

AI models

LLM

VLM

VLA

Multi-modal model

explanatory video

RAG

multi-AI

multi-agent

Fine-tune

Pre-train

RLHF

Retrieval Augmented Generation (RAG), Retrieval Augmented Language Models (RALM), and Vector Stores are a thing of the past. A NEW AI Breakthrough UNFOLDS in this explanatory video on the latest insight in AI. The Next Evolutionary step promises to be amazing. And a beautiful solution to all our current shortcomings (RAG, Vector Store) and AI problems. Supported by an amazing compute optimization for CUDA Kernels, Tensor parallelism, Unified Paging (unified memory pool for LoRA Adapter weight tensors and KV cache) and minimize latency when batching different LoRA Adapters w/ NEW S-LoRA (by Stanford Univ, UC Berkeley, ..). Literature: S-LoRA: Serving Thousands of Concurrent LoRA Adapters https://arxiv.org/abs/2311.03285 #future #challenge #ai

ViewTube

Recommended videos

After RAG, Vector & GPT Store: NEW AI Breakthrough UNFOLDS

32 Comments