ggml sync #1082

catap · 2025-12-13T00:28:40Z

No description provided.

wbruna · 2025-12-13T02:23:19Z

CMakeLists.txt

-endif()
+# see https://github.com/ggerganov/ggml/pull/682
+add_definitions(-DGGML_MAX_NAME=128)



Why are you undoing f7f05fb ?

Because as @leejet pointed in ggml-org/ggml#682 it should be increased to 128 from default 64.

You are pointing to a ggml PR that was included on sd.cpp more than a year before f7f05fb .

I suggest you carefully read that commit, and its associated PR. If you got a build error because of that, it's working as intended: it's preventing you from linking against a ggml library built with an incompatible value, which would appear to work, but give you a broken binary.

wbruna · 2025-12-13T02:26:21Z

ggml_extend.hpp

-            if (mask_pad > 0) {
-                mask_in = ggml_pad(ctx, mask_in, 0, mask_pad, 0, 0);
-            }
            mask_in = ggml_cast(ctx, mask_in, GGML_TYPE_F16);


And what's the reason for simply removing the padding here?

Because it was simple removed at ggml :)

See: ggml-org/llama.cpp#17910

Ah, I see.

But you are assuming we'll only need to build against the new version, once it gets updated. Most of the time it's inevitable, due to an incompatible API, but being able to alternate between versions is very useful to help tracking down bugs introduced by the version change. And in this case, we can. I suggest doing something like this:

diff --git a/ggml_extend.hpp b/ggml_extend.hpp index 92dd3b8..1d32f0e 100644 --- a/ggml_extend.hpp +++ b/ggml_extend.hpp @@ -1268,6 +1268,9 @@ __STATIC_INLINE__ struct ggml_tensor* ggml_ext_attention_ext(struct ggml_context } if (mask_in != nullptr) { + // the need for padding got removed in ggml 4767bda + // ensure we can still use the old version for now + #ifdef GGML_KQ_MASK_PAD int mask_pad = 0; if (mask_in->ne[1] % GGML_KQ_MASK_PAD != 0) { mask_pad = GGML_PAD(L_q, GGML_KQ_MASK_PAD) - mask_in->ne[1]; @@ -1275,6 +1278,7 @@ __STATIC_INLINE__ struct ggml_tensor* ggml_ext_attention_ext(struct ggml_context if (mask_pad > 0) { mask_in = ggml_pad(ctx, mask_in, 0, mask_pad, 0, 0); } + #endif mask_in = ggml_cast(ctx, mask_in, GGML_TYPE_F16); }

catap · 2025-12-13T14:13:43Z

@wbruna thanks for your feedback, I've incorporated everything.

Green-Sky · 2025-12-13T14:34:11Z

Has anyone yet looked at the performance for flash attention?

wbruna · 2025-12-13T15:30:38Z

For me, SDXL 1024x1024 got slightly faster: something like 2% on Vulkan, 4% on ROCm (comparing with the results from the ggml_ext_chunk PR).

wbruna suggested changes Dec 13, 2025

View reviewed changes

catap added 2 commits December 13, 2025 15:13

sync to last ggml sync with llama.cpp

e3e8122

fix api

a9ee71e

catap force-pushed the ggml-sync branch from 7579e26 to a9ee71e Compare December 13, 2025 14:13

leejet merged commit 614f873 into leejet:master Dec 13, 2025
9 checks passed

catap deleted the ggml-sync branch December 13, 2025 17:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml sync #1082

ggml sync #1082

Uh oh!

catap commented Dec 13, 2025

Uh oh!

wbruna Dec 13, 2025

Uh oh!

catap Dec 13, 2025

Uh oh!

wbruna Dec 13, 2025

Uh oh!

wbruna Dec 13, 2025

Uh oh!

catap Dec 13, 2025

Uh oh!

wbruna Dec 13, 2025

Uh oh!

catap commented Dec 13, 2025

Uh oh!

Green-Sky commented Dec 13, 2025

Uh oh!

wbruna commented Dec 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ggml sync #1082

ggml sync #1082

Uh oh!

Conversation

catap commented Dec 13, 2025

Uh oh!

wbruna Dec 13, 2025

Choose a reason for hiding this comment

Uh oh!

catap Dec 13, 2025

Choose a reason for hiding this comment

Uh oh!

wbruna Dec 13, 2025

Choose a reason for hiding this comment

Uh oh!

wbruna Dec 13, 2025

Choose a reason for hiding this comment

Uh oh!

catap Dec 13, 2025

Choose a reason for hiding this comment

Uh oh!

wbruna Dec 13, 2025

Choose a reason for hiding this comment

Uh oh!

catap commented Dec 13, 2025

Uh oh!

Green-Sky commented Dec 13, 2025

Uh oh!

wbruna commented Dec 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants