Importance Matrix calculation by ikawrakow · Pull Request #4861 · ggerganov/llama.cpp

TL;DR This PR adds to llama.cpp the ability to compute an "importance matrix" that can later be used for model quantization. The resulting matrix is much simpler and smaller compared to what is com...