[Discussion] the bit unpacking performance #11000
zombee0
started this conversation in
Show and tell
Replies: 2 comments 8 replies
-
I just rerun the benchmark and arrow is only a little bit faster (< 13%) in |
Beta Was this translation helpful? Give feedback.
6 replies
-
Another thing I observed is the datainput size is important, the arrow algorithm might be good for some case that input is long enough. Otherwise the bmi2 would be faster |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
arrow-avx2.txt
arrow-none.txt
I like the job of #3000 #2353, but i can't reproduce your result,
there are two files, arrow_none.txt is the result of default compile configuration
while arrow_avx2.txt is the result of enable avx2 for arrow,
for result of uint16 and uint32, I found that arrow-avx2 performs better.
my test is done on Intel(R) Xeon(R) Platinum 8269CY CPU @ 2.50GHz
would you be willing to help confirm this? @yingsu00 @Yuhta
Beta Was this translation helpful? Give feedback.
All reactions