Neat! I'm already using openwebui/ollama with a 7900 xtx but the STT and TTS par...

dankwizard · 2025-05-06T01:09:34 1746493774

I've given up trying to locally use LLMs on AMD

lhl · 2025-05-06T04:37:12 1746506232

Basically anything llama.cpp (Vulkan backend) should work out of the box w/o much fuss (LM Studio, Ollama, etc).

The HIP backend can have a big prefill speed boost on some architectures (high-end RDNA3 for example). For everything else, I keep notes here: https://llm-tracker.info/howto/AMD-GPUs