[RVV] add rvv f32 kernels for vcos,vexp,vlog,vsigmoid,vsin,vtanh by ken-unger · Pull Request #9926 · google/XNNPACK

ken-unger · 2026-04-09T05:07:58Z

Add rvv kernels for f32-vcos, f32-vexp, f32-vlog, f32-vsigmoid, f32-vsin, f32-vtanh.

Most of this is a simple translation of the simd versions to the rvv implementation.

Tested on qemu & bpi-f3.

Results on bpi-f32 running operator-unary-bench. Generally a ~10x improvement over the previous scalar version.

// Previous (scalar)
xnnpack_cosine_f32/N:3840/real_time                       250815 ns
xnnpack_cosine_f32/N:32640/real_time                     2133473 ns
xnnpack_exp_f32/N:3840/real_time                           99876 ns 
xnnpack_exp_f32/N:32640/real_time                         844337 ns
xnnpack_log_f32/N:3840/real_time                          214770 ns
xnnpack_log_f32/N:32640/real_time                        1821376 ns
xnnpack_sigmoid_f32/N:3840/real_time                      130601 ns
xnnpack_sigmoid_f32/N:32640/real_time                    1130085 ns
xnnpack_sine_f32/N:3840/real_time                         241881 ns
xnnpack_sine_f32/N:32640/real_time                       2051720 ns
xnnpack_tanh_f32/N:3840/real_time                         183325 ns
xnnpack_tanh_f32/N:32640/real_time                       1563040 ns

// New
xnnpack_cosine_f32/N:3840/real_time                        22647 ns
xnnpack_cosine_f32/N:32640/real_time                      192981 ns
xnnpack_exp_f32/N:3840/real_time                           18143 ns
xnnpack_exp_f32/N:32640/real_time                         154458 ns 
xnnpack_log_f32/N:3840/real_time                           21046 ns
xnnpack_log_f32/N:32640/real_time                         178911 ns
xnnpack_sigmoid_f32/N:3840/real_time                       20727 ns
xnnpack_sigmoid_f32/N:32640/real_time                     211814 ns
xnnpack_sine_f32/N:3840/real_time                          22310 ns
xnnpack_sine_f32/N:32640/real_time                        190303 ns
xnnpack_tanh_f32/N:3840/real_time                          19419 ns
xnnpack_tanh_f32/N:32640/real_time                        165337 ns

…f32-vtanh

ken-unger · 2026-04-09T05:15:20Z

src/f32-vcos/gen/f32-vcos-rvv-rational-5-4-div-u1v.c

+    vp = __riscv_vfmul(vx, vp, vl);
+
+    // Evaluate the denominator polynomial q.
+    vfloat32m1_t vq = __riscv_vfadd(__riscv_vfmul(vx2, vbeta_4, vl), vbeta_2, vl);


Unfortunately RVV doesn't have a vfmadd that allows for addition of a scalar, which would have been useful here and many other places in this PR. One can only add a vector, but then you waste vector registers. So, as a result I've kept this as vfadd/vfmul allowing this (and similarly other kernels) to get to LMUL=8.

ken-unger · 2026-04-09T05:17:48Z

src/f32-vlog/gen/f32-vlog-rvv-rational-3-3-div-u1v.c

+#include "src/xnnpack/common.h"
+#include "src/xnnpack/microparams.h"
+#include "src/xnnpack/vunary.h"
+#include "src/xnnpack/simd/f32-scalar.h" // xnn_f32_i32_t


I considered moving xnn_f32_i32_t somewhere common ... but wasn't sure the right place, so left it where it was.

ken-unger · 2026-04-09T05:20:10Z

@fbarchard and @dsharletg please review when you are able. Thank you.

add rvv kernels for f32-vcos,f32-vexp,f32-vlog,f32-vsigmoid,f32-vsin,…

1babb40

…f32-vtanh

ken-unger commented Apr 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RVV] add rvv f32 kernels for vcos,vexp,vlog,vsigmoid,vsin,vtanh#9926

[RVV] add rvv f32 kernels for vcos,vexp,vlog,vsigmoid,vsin,vtanh#9926
ken-unger wants to merge 1 commit intogoogle:masterfrom
ken-unger:unary-trig-rvv

ken-unger commented Apr 9, 2026

Uh oh!

ken-unger Apr 9, 2026

Uh oh!

ken-unger Apr 9, 2026

Uh oh!

ken-unger commented Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ken-unger commented Apr 9, 2026

Uh oh!

ken-unger Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

ken-unger Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

ken-unger commented Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant