Any of you have a self-hosted AI "hub"? (e.g. for LLM, stable-diffusion, ...)

robber@lemmy.ml · 1 month ago

Any of you have a self-hosted AI "hub"? (e.g. for LLM, stable-diffusion, ...)

JackGreenEarth@lemm.ee · 1 month ago

Is that 128GB of VRAM? Because normal RAM doesn’t matter unless you want to run the model on the CPU, which is much slower.

Greg Clarke@lemmy.ca · 1 month ago

That’s 128GB RAM, the GPU has 24GB VRAM. Ollama has gotten pretty smart with resource allocation. Smaller models can fit soley on my VRAM but I can still run larger models on RAM.