Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

vulkan: Extend rope fusions to allow mrope ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#18264 opened Dec 21, 2025 by jeffbolznv Loading…
ggml-cuda : refactor repetitive switch case statements in mmf ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#18260 opened Dec 21, 2025 by Aadeshveer Loading…
Add Gemma3n multimodal support with MobileNetV5 vision encoder examples model Model specific python python script changes
#18256 opened Dec 21, 2025 by simrnsingh Loading…
New quantization type: Q3_HIFI Apple Metal https://en.wikipedia.org/wiki/Metal_(API) documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#18246 opened Dec 21, 2025 by geoffmunn Loading…
ggml rpc : Add missing check for rpc buffer type ggml changes relating to the ggml tensor library for machine learning
#18242 opened Dec 21, 2025 by struct Loading…
ggml-cpu: parallelize tensor repacking with OpenMP ggml changes relating to the ggml tensor library for machine learning
#18239 opened Dec 21, 2025 by pestopoppa Loading…
webui: Fix the header backdrop blur examples server
#18230 opened Dec 20, 2025 by ImadSaddik Loading…
ggml-metal: guard buffer map slicing Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#18225 opened Dec 20, 2025 by SzymonPrajs Loading…
ggml-metal: fix memset range and temp buffer leaks Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#18221 opened Dec 20, 2025 by SzymonPrajs Loading…
model: support nvidia/llama-embed-nemotron model Model specific python python script changes
#18220 opened Dec 20, 2025 by sfallah Draft
convert: rework ftype heuristics python python script changes
#18214 opened Dec 20, 2025 by taronaeo Loading…
ggml-metal: fix bf16/f16 matmul kernels Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#18210 opened Dec 20, 2025 by SzymonPrajs Loading…
Fix BLAS Compile Definitions ggml changes relating to the ggml tensor library for machine learning
#18205 opened Dec 19, 2025 by DaAwesomeP Loading…
HIP: Use mmq on MFMA devices for MUL_MAT_ID in cases where a lot of splits would be generated ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#18202 opened Dec 19, 2025 by IMbackK Loading…
llamafile: add rvv support for sgemm kernels ggml changes relating to the ggml tensor library for machine learning
#18199 opened Dec 19, 2025 by taimur-10x Loading…
cmake: Added more x86_64 CPU backends when building with GGML_CPU_ALL_VARIANTS=On ggml changes relating to the ggml tensor library for machine learning
#18186 opened Dec 18, 2025 by bberberov Draft
vulkan: Warptile tuning for Intel Xe2/Xe3 ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18178 opened Dec 18, 2025 by virajwad Loading…
tool/ex/tests: consistently free ctx, then model examples testing Everything test related
#18168 opened Dec 18, 2025 by JohannesGaessler Loading…
ProTip! Follow long discussions with comments:>50.