What is the purpose of this API? Do I need to use it when running a quantized GGUF model? Thanks
What is the purpose of this API? Do I need to use it when running a quantized GGUF model? Thanks