this function, for example
#define SIMDPP_ARCH_X86_AVX2
#include <simdpp/simd.h>
void sum(double* out, double const* lhs, double const* rhs) {
using vec_t = simdpp::float64<1>;
auto l = simdpp::load_u<vec_t>(lhs);
auto r = simdpp::load_u<vec_t>(rhs);
simdpp::store_u(out, l + r);
}
will load and write 4 doubles instead of a single one, which may result in an unexpected buffer overflow. is this the intended behavior?
this function, for example
will load and write 4
doubles instead of a single one, which may result in an unexpected buffer overflow. is this the intended behavior?