sycl: unify unary kernels with a generic implementation and enable wide operator support #17213

shani-f · 2025-11-12T15:39:08Z

Summary

Adds a generic unary implementation for the SYCL backend, allowing many unary operators to share a single optimized execution path.
The implementation matches the behavior of the existing CPU unary kernels.

Changes

Added ggml_sycl_op_unary generic function
Updated unary dispatch in element_wise.cpp
Removed per-op SYCL kernels (ABS, SGN, NEG, STEP, etc.)
Updated documentation in:
- docs/ops.md
- docs/ops/SYCL.csv

Implementation

One templated kernel handles all unary ops
Supports 4-D tensors and non-contiguous views
Supports F16 and F32 data types
Uses dispatch_ggml_sycl_op_unary with parallel_for
Eliminates duplicated indexing logic across operators

Supported Ops

ABS
SGN
NEG
STEP
RELU
HARDSIGMOID
TANH
GELU
SILU
SIGMOID
HARDSWISH
GELU_QUICK
GELU_ERF
EXP
ELU

Testing

All supported unary ops pass test-backend-ops
Verified correctness on contiguous + non-contiguous tensors
Matches CPU results

Performance

Single optimized unary path for all ops
Reduced kernel count and maintenance complexity
Same SYCL scheduling style as existing ops

Compatibility

Works on OpenCL and Level Zero devices
No changes required for CPU fallback
Follows SYCL backend design conventions

…); unify non-contiguous access

shani-f · 2025-11-12T19:52:47Z

Hello @CISC @NeoZhangJianyu,
The ggml-ci-x64-cpu-low-perf test failed, but it’s CPU-only and unrelated to my SYCL changes.
What’s the next step toward merging?
Thanks!

NeoZhangJianyu

It's good job!
Thank you!

NeoZhangJianyu · 2025-11-13T00:48:17Z

Hello @CISC @NeoZhangJianyu, The ggml-ci-x64-cpu-low-perf test failed, but it’s CPU-only and unrelated to my SYCL changes. What’s the next step toward merging? Thanks!

We could merge directly after the conflicts are fixed.

NeoZhangJianyu · 2025-11-14T01:00:07Z

@shani-f
Could you fix the conflicts?

Thank you!

…y-generic

shani-f · 2025-11-14T12:28:25Z

Hello @NeoZhangJianyu,
Yesterday I resolved all conflicts and the branch was fully up to date.
The new conflicts came from the latest merge into master, and I’ve resolved those as well.
Thanks!

docs/ops/SYCL.csv

CISC · 2025-11-15T18:06:05Z

Regen the CSV, at least TOPK_MOE should no longer be there.

shani-f · 2025-11-15T18:42:21Z

Is everything correct and finalized now?

CISC · 2025-11-15T19:05:24Z

Is everything correct and finalized now?

I really did mean that you should regen it, there are more ops in the CSV that would be removed if you did.

shani-f · 2025-11-15T21:18:27Z

Hello @CISC,
I tried to regenerate the CSV, but I couldn’t find any mechanism in the current LLAMA/ggml version that actually rebuilds SYCL.csv automatically.
If you want, I can reset both ops.md and SYCL.csv to their upstream versions and leave them as-is.

Let me know what you prefer.
Thanks!

CISC · 2025-11-15T21:42:53Z

I tried to regenerate the CSV, but I couldn’t find any mechanism in the current LLAMA/ggml version that actually rebuilds SYCL.csv automatically.

As explained here, simply run test-backend-ops support --output csv from your build/bin folder.

llama.cpp/docs/ops.md

Line 7 in 9ee5bdc

    
           1. Run `test-backend-ops support --output csv` with your backend name and redirect output to a csv file in `docs/ops/` (e.g., `docs/ops/CUDA.csv`)

shani-f · 2025-11-15T23:03:31Z

Hello @CISC,
I really hope it’s okay now.
Thank you so much for your help!

CISC · 2025-11-15T23:22:39Z

Hello @CISC, I really hope it’s okay now. Thank you so much for your help!

Yes, perfect, thank you! :)

…ide operator support (ggml-org#17213) * SYCL: add generic unary op implementation for multiple ops (ABS/SGN/…); unify non-contiguous access * SYCL: update documentation and sycl.csv to reflect new unary op support * update ops.md after syncing SYCL.csv changes * Fix SYCL.csv merge conflict * Update ops.md after fixing SYCL.csv conflicts * Fix SYCL.csv tail after merge conflict and regenerate ops.md * Fix line endings and final newline in SYCL.csv * Remove TOPK_MOE entries from SYCL.csv as requested * Update ops.md after removing TOPK_MOE from SYCL.csv * Regenerated SYCL.csv and synced ops.md with upstream * Update ops.md using create_ops_docs.py

shani-f added 3 commits November 11, 2025 16:51

SYCL: add generic unary op implementation for multiple ops (ABS/SGN/……

b23de74

…); unify non-contiguous access

SYCL: update documentation and sycl.csv to reflect new unary op support

371aaa2

Merge branch 'master' into feature/sycl-unary-generic

3cfa817

github-actions bot added documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Nov 12, 2025

DajanaV mentioned this pull request Nov 12, 2025

UPSTREAM PR #17213: sycl: unify unary kernels with a generic implementation and enable wide operator support auroralabs-loci/llama.cpp#180

Closed

update ops.md after syncing SYCL.csv changes

138d90b

NeoZhangJianyu approved these changes Nov 13, 2025

View reviewed changes

Resolve merge conflicts by regenerating ops docs

d6edac3

Merge remote-tracking branch 'upstream/master' into feature/sycl-unar…

b5a6f2f

…y-generic

CISC reviewed Nov 14, 2025

View reviewed changes

docs/ops/SYCL.csv Outdated Show resolved Hide resolved

Merge branch 'master' into feature/sycl-unary-generic

dfca19f

DajanaV mentioned this pull request Nov 15, 2025

UPSTREAM PR #17213: sycl: unify unary kernels with a generic implementation and enable wide operator support auroralabs-loci/llama.cpp#220

Open

shani-f added 2 commits November 15, 2025 19:19

Fix SYCL.csv merge conflict

baa8d59

Update ops.md after fixing SYCL.csv conflicts

29e44c9

CISC approved these changes Nov 15, 2025

View reviewed changes

CISC reviewed Nov 15, 2025

View reviewed changes

docs/ops/SYCL.csv Outdated Show resolved Hide resolved

shani-f added 2 commits November 15, 2025 19:42

Fix SYCL.csv tail after merge conflict and regenerate ops.md

d6d6701

Fix line endings and final newline in SYCL.csv

d512edf

shani-f added 2 commits November 15, 2025 20:22

Remove TOPK_MOE entries from SYCL.csv as requested

ed03c6a

Update ops.md after removing TOPK_MOE from SYCL.csv

9ee5bdc

shani-f added 2 commits November 16, 2025 00:49

Regenerated SYCL.csv and synced ops.md with upstream

1d9a831

Update ops.md using create_ops_docs.py

1ac8c68

CISC merged commit 72bd732 into ggml-org:master Nov 15, 2025
72 of 73 checks passed

sycl: unify unary kernels with a generic implementation and enable wide operator support #17213

sycl: unify unary kernels with a generic implementation and enable wide operator support #17213

Conversation

shani-f commented Nov 12, 2025

Summary

Changes

Implementation

Supported Ops

Testing

Performance

Compatibility

Uh oh!

shani-f commented Nov 12, 2025

Uh oh!

NeoZhangJianyu left a comment

Choose a reason for hiding this comment

Uh oh!

NeoZhangJianyu commented Nov 13, 2025

Uh oh!

NeoZhangJianyu commented Nov 14, 2025

Uh oh!

shani-f commented Nov 14, 2025

Uh oh!

Uh oh!

Uh oh!

CISC commented Nov 15, 2025

Uh oh!

shani-f commented Nov 15, 2025 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CISC commented Nov 15, 2025

Uh oh!

shani-f commented Nov 15, 2025

Uh oh!

CISC commented Nov 15, 2025

Uh oh!

shani-f commented Nov 15, 2025

Uh oh!

CISC commented Nov 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shani-f commented Nov 15, 2025 via email •

edited

Loading