-
Notifications
You must be signed in to change notification settings - Fork 15.9k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Send reasoning content back to the model across turns via the reasoning_content API field
examples
server
#21036
opened Mar 26, 2026 by
ServeurpersoCom
Loading…
rpc : proper handling of data pointers to CPU buffers
ggml
changes relating to the ggml tensor library for machine learning
#21030
opened Mar 26, 2026 by
rgerganov
Loading…
vulkan: add FA dequant for q4_1, q5_0, q5_1, iq4_nl
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#21029
opened Mar 26, 2026 by
mkoker
Loading…
fix: mtmd "v.patch_embd" quant and unsupported im2col ops on Metal for deepseek-ocr
python
python script changes
#21027
opened Mar 26, 2026 by
sfallah
Loading…
WebUI: Replace illegal nested button elements
examples
server
#21026
opened Mar 26, 2026 by
bluemoehre
Loading…
Vulkan Q4_0 Repack PoC
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
hexagon: support for IQ4_NL and MXFP4
ggml
changes relating to the ggml tensor library for machine learning
Hexagon
#21018
opened Mar 26, 2026 by
njsyw1997
Loading…
grammar: increase MAX_REPETITION_THRESHOLD + make it configurable via envvar
testing
Everything test related
#21003
opened Mar 25, 2026 by
pwilkin
Loading…
ci: introduce audits for self-hosted runners
devops
improvements to build systems and github actions
webui: Improve Chat Messages initial scroll + auto-scroll logic + add lazy loading with transitions to content blocks
examples
server
#20999
opened Mar 25, 2026 by
allozaur
Loading…
CUDA: Add Flash Attention Support for Head Dimension 512
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
#20998
opened Mar 25, 2026 by
anavp-nvidia
Loading…
llama-model-loader: use pinned memory for tensor overrides
#20978
opened Mar 25, 2026 by
am17an
Loading…
Install libraries into LLAMA_LIB_INSTALL_DIR
#20966
opened Mar 25, 2026 by
WhyNotHugo
Loading…
2 tasks done
Previous Next
ProTip!
no:milestone will show everything without a milestone.