Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

GGML hasn't been a thing for some time, and GGUF (its successor) has features such as "importance matrix" quantization that is all about quantizing adaptively. Then there's all the stuff that Unsloth does, e.g.: https://unsloth.ai/blog/dynamic-v2


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: