We’re not breaking ground on AI innovation (in fact, we’re using an old, “deprecated” file format from a whole six months ago)
The ggml format isn’t “deprecated” it’s completely dead. In those 6 months we’ve also seen 2-4x speedups on some systems, not to mention improved accuracy via kquants. I don’t know why they would build out a new extension with such an ancient dependency.
The ggml format isn’t “deprecated” it’s completely dead. In those 6 months we’ve also seen 2-4x speedups on some systems, not to mention improved accuracy via kquants. I don’t know why they would build out a new extension with such an ancient dependency.