llama.cpp

shahondin1624/llama.cpp

Fork 0

master

CI / build-cmake-pkg (push) Successful in 15m57s

Details

CI / android-arm64 (push) Failing after 15s

Details

CI / ubuntu-latest-rpc (push) Failing after 13s

Details

CI / ubuntu-latest-cuda (push) Failing after 9s

Details

Release / android-arm64 (push) Failing after 34s

Details

Server / server (default) (push) Failing after 13s

Details

Server / server (backend-sampling) (push) Failing after 17s

Details

Server (self-hosted) / server-metal (GPUx2, backend-sampling) (push) Has been cancelled

Details

CI (self-hosted) / Determine tag name (push) Has been cancelled

Details

CI (self-hosted) / ggml-ci-nvidia-webgpu (push) Has been cancelled

Details

CI / macOS-latest-arm64 (push) Has been cancelled

Details

CI / macOS-latest-x64 (push) Has been cancelled

Details

CI / macOS-latest-arm64-webgpu (push) Has been cancelled

Details

CI / ubuntu-cpu (arm64, ubuntu-24.04-arm) (push) Has been cancelled

Details

CI / ubuntu-cpu (ppc64le, ubuntu-24.04-ppc64le) (push) Has been cancelled

Details

CI / ubuntu-cpu (s390x, ubuntu-24.04-s390x) (push) Has been cancelled

Details

CI / ubuntu-cpu (x64, ubuntu-22.04) (push) Has been cancelled

Details

CI / ubuntu-24-vulkan (arm64, ubuntu-24.04-arm) (push) Has been cancelled

Details

CI / ubuntu-24-vulkan (x64, ubuntu-24.04) (push) Has been cancelled

Details

CI / ubuntu-24-webgpu (push) Has been cancelled

Details

CI / ubuntu-24-webgpu-wasm (push) Has been cancelled

Details

CI / ubuntu-22-hip (push) Has been cancelled

Details

CI / ubuntu-22-musa (push) Has been cancelled

Details

CI / windows-latest (arm64, llvm-arm64, -G "Ninja Multi-Config" -D CMAKE_TOOLCHAIN_FILE=cmake/arm64-windows-llvm.cmake -DGGML_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON) (push) Has been cancelled

Details

CI / windows-latest (arm64, llvm-arm64-opencl-adreno, -G "Ninja Multi-Config" -D CMAKE_TOOLCHAIN_FILE=cmake/arm64-windows-llvm.cmake -DCMAKE_PREFIX_PATH="$env:RUNNER_TEMP/opencl-arm64-release" -DGGML_OPENCL=ON -DGGML_OPENCL_USE_ADRENO_KERNELS=ON) (push) Has been cancelled

Details

CI / windows-latest (x64, cpu-x64 (static), -G "Ninja Multi-Config" -D CMAKE_TOOLCHAIN_FILE=cmake/x64-windows-llvm.cmake -DGGML_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON -DGGML_RPC=ON -DBUILD_SHARED_LIBS=OFF) (push) Has been cancelled

Details

CI / windows-latest (x64, openblas-x64, -G "Ninja Multi-Config" -D CMAKE_TOOLCHAIN_FILE=cmake/x64-windows-llvm.cmake -DGGML_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON -DGGML_RPC=ON -DGGML_BACKEND_DL=ON -DGGML_CPU_ALL_VARIANTS=ON -DGGML_OPENMP=OFF -DGGML_BLAS=ON -DG… (push) Has been cancelled

Details

CI / windows-latest (x64, vulkan-x64, -DCMAKE_BUILD_TYPE=Release -DGGML_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON -DGGML_RPC=ON -DGGML_BACKEND_DL=ON -DGGML_CPU_ALL_VARIANTS=ON -DGGML_VULKAN=ON) (push) Has been cancelled

Details

CI / windows-2022-cuda (12.4) (push) Has been cancelled

Details

CI / windows-latest-hip (push) Has been cancelled

Details

CI / ubuntu-cpu-riscv64-native (push) Has been cancelled

Details

CI / ggml-ci-x64-cpu-low-perf (push) Has been cancelled

Details

CI / ggml-ci-arm64-cpu-low-perf (push) Has been cancelled

Details

CI / ggml-ci-x64-cpu-high-perf (push) Has been cancelled

Details

CI / ggml-ci-arm64-cpu-high-perf (push) Has been cancelled

Details

CI / ggml-ci-arm64-cpu-high-perf-sve (push) Has been cancelled

Details

CI / ggml-ci-arm64-cpu-kleidiai (push) Has been cancelled

Details

CI / ggml-ci-arm64-cpu-kleidiai-graviton4 (push) Has been cancelled

Details

Code Style Checker / model-naming (push) Has been cancelled

Details

EditorConfig Checker / editorconfig (push) Has been cancelled

Details

HIP quality check / ubuntu-22-hip-quality-check (push) Has been cancelled

Details

Release / macOS-cpu (arm64, arm64, -DGGML_METAL_USE_BF16=ON -DGGML_METAL_EMBED_LIBRARY=ON, macos-14) (push) Has been cancelled

Details

Release / macOS-cpu (arm64, arm64-kleidiai, -DGGML_METAL_USE_BF16=ON -DGGML_METAL_EMBED_LIBRARY=ON -DGGML_CPU_KLEIDIAI=ON, macos-14) (push) Has been cancelled

Details

Release / macOS-cpu (x64, x64, -DGGML_METAL=OFF -DCMAKE_OSX_DEPLOYMENT_TARGET=13.3, macos-15-intel) (push) Has been cancelled

Details

Release / ubuntu-cpu (arm64, ubuntu-24.04-arm) (push) Has been cancelled

Details

Release / ubuntu-cpu (s390x, ubuntu-24.04-s390x) (push) Has been cancelled

Details

Release / ubuntu-cpu (x64, ubuntu-22.04) (push) Has been cancelled

Details

Release / ubuntu-vulkan (arm64, ubuntu-24.04-arm) (push) Has been cancelled

Details

Release / ubuntu-vulkan (x64, ubuntu-22.04) (push) Has been cancelled

Details

Release / ubuntu-24-openvino (push) Has been cancelled

Details

Release / windows-cpu (arm64) (push) Has been cancelled

Details

Release / windows-cpu (x64) (push) Has been cancelled

Details

Release / windows (arm64, opencl-adreno, -G "Ninja Multi-Config" -D CMAKE_TOOLCHAIN_FILE=cmake/arm64-windows-llvm.cmake -DCMAKE_PREFIX_PATH="$env:RUNNER_TEMP/opencl-arm64-release" -DGGML_OPENCL=ON -DGGML_OPENCL_USE_ADRENO_KERNELS=ON, ggml-opencl) (push) Has been cancelled

Details

Release / windows (x64, vulkan, -DGGML_VULKAN=ON, ggml-vulkan) (push) Has been cancelled

Details

Release / windows-cuda (12.4) (push) Has been cancelled

Details

Release / windows-cuda (13.1) (push) Has been cancelled

Details

Release / windows-sycl (push) Has been cancelled

Details

Release / ubuntu-24-sycl (fp16, ON) (push) Has been cancelled

Details

Release / ubuntu-24-sycl (fp32, OFF) (push) Has been cancelled

Details

Release / ubuntu-22-rocm (7.2.1, x64, gfx908;gfx90a;gfx942;gfx1030;gfx1100;gfx1101;gfx1102;gfx1151;gfx1150;gfx1200;gfx1201) (push) Has been cancelled

Details

Release / windows-hip (gfx1150;gfx1151;gfx1200;gfx1201;gfx1100;gfx1101;gfx1102;gfx1030;gfx1031;gfx1032, radeon) (push) Has been cancelled

Details

Release / ios-xcode-build (push) Has been cancelled

Details

Release / openEuler-cann (aarch64, Release, 310p, off) (push) Has been cancelled

Details

Release / openEuler-cann (aarch64, Release, 910b, on) (push) Has been cancelled

Details

Release / openEuler-cann (x86, Release, 310p, off) (push) Has been cancelled

Details

Release / openEuler-cann (x86, Release, 910b, on) (push) Has been cancelled

Details

Server (self-hosted) / server-metal (GPUx2) (push) Has been cancelled

Details

Server (self-hosted) / server-metal (GPUx1) (push) Has been cancelled

Details

Server (self-hosted) / server-metal (GPUx1, backend-sampling) (push) Has been cancelled

Details

Server (self-hosted) / server-kleidiai (CPUx1, kleidiai) (push) Has been cancelled

Details

Server / server-windows (push) Has been cancelled

Details

CI (self-hosted) / ggml-ci-nvidia-cuda (push) Has been cancelled

Details

CI (self-hosted) / ggml-ci-nvidia-vulkan-cm (push) Has been cancelled

Details

CI (self-hosted) / ggml-ci-nvidia-vulkan-cm2 (push) Has been cancelled

Details

CI (self-hosted) / ggml-ci-mac-metal (push) Has been cancelled

Details

CI (self-hosted) / ggml-ci-mac-webgpu (push) Has been cancelled

Details

CI (self-hosted) / ggml-ci-mac-vulkan (push) Has been cancelled

Details

CI (self-hosted) / ggml-ci-linux-intel-vulkan (push) Has been cancelled

Details

CI (self-hosted) / ggml-ci-win-intel-vulkan (push) Has been cancelled

Details

CI (self-hosted) / ggml-ci-intel-openvino-gpu-low-perf (push) Has been cancelled

Details

Release / release (push) Has been cancelled

Details

Release / ui-publish (push) Has been cancelled

Details

4c66df50ca · hip: fix HIP graph capture crash for FA quantized KV f16 dequant · Updated 2026-05-27 08:08:12 +02:00

llm-build-context a8796f9609 · llm : cleanup + comments · Updated 2023-11-01 19:08:02 +01:00 shahondin1624	7790 4	ZIP TAR.GZ
llm-reuse-constants 7420bef83e · wip wip wip · Updated 2023-11-01 07:51:43 +01:00 shahondin1624	7790 1	ZIP TAR.GZ
llama-refactor afb3929279 · Merge branch 'master' into llama-refactor · Updated 2023-10-31 19:35:31 +01:00 shahondin1624	7792 21	ZIP TAR.GZ
test-mmv 29fe516913 · wip · Updated 2023-10-31 17:36:37 +01:00 shahondin1624	7793 1	ZIP TAR.GZ
deploy dab42893c9 · scripts : working curl pipe · Updated 2023-10-31 16:03:56 +01:00 shahondin1624	7793 3	ZIP TAR.GZ
llama-refactor-norm 7923b70cb8 · llama : add llm_build_inp_embd helper · Updated 2023-10-31 15:43:08 +01:00 shahondin1624	7798 37	ZIP TAR.GZ
ggml-impl 4b3cb98d46 · ggml-impl : move extern "C" to start of file · Updated 2023-10-30 18:05:58 +01:00 shahondin1624	7794 7	ZIP TAR.GZ
lto bc28aaa8c2 · make : use -lfto=auto to avoid warnings and maintain perf · Updated 2023-10-30 15:00:53 +01:00 shahondin1624	7794 5	ZIP TAR.GZ
scratch 15267192c0 · llama : refactor tensor offloading as callback · Updated 2023-10-29 12:04:36 +01:00 shahondin1624	7798 15	ZIP TAR.GZ
ggml-quants 8a86b95e87 · quantize : --pure option for disabling k-quant mixtures · Updated 2023-10-28 22:37:03 +02:00 shahondin1624	7799 3	ZIP TAR.GZ
apply-3585 de7e0912b6 · convert : ignore tokens if their IDs are within [0, vocab_size) · Updated 2023-10-28 14:01:36 +02:00 shahondin1624	7802 1	ZIP TAR.GZ
sampling-greedy-with-probs bbfc62ac2f · sampling : temp == 0.0 -> no probs, temp < 0.0 -> probs · Updated 2023-10-28 13:04:57 +02:00 shahondin1624	7810 3	ZIP TAR.GZ
cuda-multi-gpu cd3e20fb50 · cuda : fix multi-gpu with tensor cores · Updated 2023-10-27 22:11:50 +02:00 shahondin1624	7809 3	ZIP TAR.GZ
cuda-quantum-batch 49af767fad · build : add compile option to force use of MMQ kernels · Updated 2023-10-27 12:21:04 +02:00 shahondin1624	7811 7	ZIP TAR.GZ
cuda-batched-gemm d798a17c34 · cuda : add TODO for calling cublas from kernel + using mem pool · Updated 2023-10-24 15:33:24 +02:00 shahondin1624	7825 10	ZIP TAR.GZ
cuda-batched-gemm-deq 6966474928 · cuda : play with faster Q4_0 dequantization · Updated 2023-10-24 09:29:40 +02:00 shahondin1624	7825 8	ZIP TAR.GZ
upd-issue-templates b9bb4cbe86 · Separate bug and enhancement template + no default title · Updated 2023-10-23 17:59:11 +02:00 shahondin1624	7825 1	ZIP TAR.GZ
server-rev c0f4d54870 · server : add comment about changing slot_state to bool · Updated 2023-10-22 21:24:39 +02:00 shahondin1624	7831 72	ZIP TAR.GZ
perf-study cb79f8a2d8 · llama : add SKIP_KQ_KQV option · Updated 2023-10-22 08:58:29 +02:00 shahondin1624	7831 3	ZIP TAR.GZ
sampling-refactor 56ba00b923 · sampling : hide prev behind API and apply #3661 · Updated 2023-10-20 17:53:27 +02:00 shahondin1624	7834 6	ZIP TAR.GZ

... 24 25 26 27 28 ...

Default Branch

Branches