-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PERFORMANCE QUESTION] Why is DashMap faster with less threads? #177
Comments
Maybe the use of Atomics on SCC? |
Hi @redactedontop, DashMap is based on hashbrown, which is very aggressively optimized for single-threaded workloads, whereas SCC focuses more on predictability and scalability.
If one needs good single-threaded performance and latency spikes don't quite matter, then DashMap is the way to go. If good read performance is essential and the cost of frequent heap allocation (per-entry heap allocation) is not a problem, papaya is the best choice.
|
I forgot to answer the main part of the question; its single-threaded performance is not as good as others because I didn't pay much attention to this :-( I'll try to optimize it when I have some time. |
Hiya! I'll gladly help you, as I enjoy this project a lot: give me a checklist and some days, and I'll see what I can do ;) Thanks, |
@redactedontop awesome! The first step of optimization is usually profiling. On macOS, the profiling tool (Instruments) bundled with XCode is pretty powerful, so once a benchmark scenario is prepared (vanilla conc-map-bench is too short - need a longer one to get meaningful results), it will be possible to figure out where to optimize in the code. |
Would it be selfish to ask you for a flamegraph (cargo flamegraph) of a bigger benchmark for me to analyse? I currently cannot make benchmarks (studying + homework + personal projects)...? |
And do you have discord (quicker answers)? |
I can do profiling, but not sure when I can - possibly early Mar? / no I don’t do discord. |
I dug into the issue and figured that thread_local in sdd causes the difference.
It's interesting that the cost of |
Substituting |
That's so weird!
…On Wed, Mar 5, 2025 at 2:22 PM Changgyoo Park ***@***.***> wrote:
Substituting thread_local! with #[thread_local] helps much, but that's
gated behind an unstable feature flag.
—
Reply to this email directly, view it on GitHub
<#177 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/BJEGBBS25LAICH47IVS6THT2S4CAZAVCNFSM6AAAAABXHKUKHWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMBRGA4DQMZUGE>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
[image: wvwwvwwv]*wvwwvwwv* left a comment
(wvwwvwwv/scalable-concurrent-containers#177)
<#177 (comment)>
Substituting thread_local! with #[thread_local] helps much, but that's
gated behind an unstable feature flag.
—
Reply to this email directly, view it on GitHub
<#177 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/BJEGBBS25LAICH47IVS6THT2S4CAZAVCNFSM6AAAAABXHKUKHWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMBRGA4DQMZUGE>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
… On Sat, Mar 8, 2025 at 11:00 PM Niggerian ***@***.***> wrote:
That's so weird!
On Wed, Mar 5, 2025 at 2:22 PM Changgyoo Park ***@***.***>
wrote:
> Substituting thread_local! with #[thread_local] helps much, but that's
> gated behind an unstable feature flag.
>
> —
> Reply to this email directly, view it on GitHub
> <#177 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/BJEGBBS25LAICH47IVS6THT2S4CAZAVCNFSM6AAAAABXHKUKHWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMBRGA4DQMZUGE>
> .
> You are receiving this because you were mentioned.Message ID:
> ***@***.***
> com>
> [image: wvwwvwwv]*wvwwvwwv* left a comment
> (wvwwvwwv/scalable-concurrent-containers#177)
> <#177 (comment)>
>
> Substituting thread_local! with #[thread_local] helps much, but that's
> gated behind an unstable feature flag.
>
> —
> Reply to this email directly, view it on GitHub
> <#177 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/BJEGBBS25LAICH47IVS6THT2S4CAZAVCNFSM6AAAAABXHKUKHWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMBRGA4DQMZUGE>
> .
> You are receiving this because you were mentioned.Message ID:
> ***@***.***
> com>
>
|
Try one of 2 things:
1. Wrap the thread local initialization with const { } (ex. replace const
SOMETHING: _ = _::new() with const SOMETHING: _ = const { _::new() }) when
possible
2. Make a nightly-only version of the crate for honestly much faster code
(would be my course of action).
…On Sat, Mar 8, 2025 at 11:01 PM Niggerian ***@***.***> wrote:
Maybe https://swatinem.de/blog/slow-thread-local/?
On Sat, Mar 8, 2025 at 11:00 PM Niggerian ***@***.***>
wrote:
> That's so weird!
>
> On Wed, Mar 5, 2025 at 2:22 PM Changgyoo Park ***@***.***>
> wrote:
>
>> Substituting thread_local! with #[thread_local] helps much, but that's
>> gated behind an unstable feature flag.
>>
>> —
>> Reply to this email directly, view it on GitHub
>> <#177 (comment)>,
>> or unsubscribe
>> <https://github.com/notifications/unsubscribe-auth/BJEGBBS25LAICH47IVS6THT2S4CAZAVCNFSM6AAAAABXHKUKHWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMBRGA4DQMZUGE>
>> .
>> You are receiving this because you were mentioned.Message ID:
>> ***@***.***
>> com>
>> [image: wvwwvwwv]*wvwwvwwv* left a comment
>> (wvwwvwwv/scalable-concurrent-containers#177)
>> <#177 (comment)>
>>
>> Substituting thread_local! with #[thread_local] helps much, but that's
>> gated behind an unstable feature flag.
>>
>> —
>> Reply to this email directly, view it on GitHub
>> <#177 (comment)>,
>> or unsubscribe
>> <https://github.com/notifications/unsubscribe-auth/BJEGBBS25LAICH47IVS6THT2S4CAZAVCNFSM6AAAAABXHKUKHWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMBRGA4DQMZUGE>
>> .
>> You are receiving this because you were mentioned.Message ID:
>> ***@***.***
>> com>
>>
>
|
Also, can you send the complete SVG of both sdd and scc?
…On Sat, Mar 8, 2025 at 11:05 PM Niggerian ***@***.***> wrote:
Try one of 2 things:
1. Wrap the thread local initialization with const { } (ex. replace const
SOMETHING: _ = _::new() with const SOMETHING: _ = const { _::new() }) when
possible
2. Make a nightly-only version of the crate for honestly much faster code
(would be my course of action).
On Sat, Mar 8, 2025 at 11:01 PM Niggerian ***@***.***>
wrote:
> Maybe https://swatinem.de/blog/slow-thread-local/?
>
> On Sat, Mar 8, 2025 at 11:00 PM Niggerian ***@***.***>
> wrote:
>
>> That's so weird!
>>
>> On Wed, Mar 5, 2025 at 2:22 PM Changgyoo Park ***@***.***>
>> wrote:
>>
>>> Substituting thread_local! with #[thread_local] helps much, but that's
>>> gated behind an unstable feature flag.
>>>
>>> —
>>> Reply to this email directly, view it on GitHub
>>> <#177 (comment)>,
>>> or unsubscribe
>>> <https://github.com/notifications/unsubscribe-auth/BJEGBBS25LAICH47IVS6THT2S4CAZAVCNFSM6AAAAABXHKUKHWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMBRGA4DQMZUGE>
>>> .
>>> You are receiving this because you were mentioned.Message ID:
>>> ***@***.***
>>> com>
>>> [image: wvwwvwwv]*wvwwvwwv* left a comment
>>> (wvwwvwwv/scalable-concurrent-containers#177)
>>> <#177 (comment)>
>>>
>>> Substituting thread_local! with #[thread_local] helps much, but that's
>>> gated behind an unstable feature flag.
>>>
>>> —
>>> Reply to this email directly, view it on GitHub
>>> <#177 (comment)>,
>>> or unsubscribe
>>> <https://github.com/notifications/unsubscribe-auth/BJEGBBS25LAICH47IVS6THT2S4CAZAVCNFSM6AAAAABXHKUKHWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMBRGA4DQMZUGE>
>>> .
>>> You are receiving this because you were mentioned.Message ID:
>>> ***@***.***
>>> com>
>>>
>>
|
Also, what about Atomics instead of sdd? You can prevent ABA in HashMaps with double-checks (checking not just the alue but the others). |
Thanks, @redactedontop. I substituted the thread-local variable's non-const assignment with a Unfortunately, use-after-free is the problem, not ABA. The delayed memory reclamation prevents use-after-free if the hash map has been resized, and relying on thread-local is currently the optimal way to prevent it.
|
How come? Can't you prevent UAF with Atomics? |
Not quite possible as far as I know since while a reader is in a loop of { read-pointer - do-something - validate }, anything can happen including deallocating the pointee. EBR/RCU/HTM/hazard-pointers etc prevent this from happening. |
Then why not use HTM when supported instead of a full thread_local? Also, have you thought about the nightly #[thread_local] version? |
Simple: there is no decent crate that enables easy use of HTM instructions, and ![thread_local] is still unstable. |
Fair. Have you thought about making a nightly version of the crate? |
Heya!
As shown on the graph below, DashMap is a lot faster (throughput; 10 mega operations faster than scc) for 1 thread (scc eventually surpasses it). I'm a complete noob in optimization, and this is a genuine (and hopefully not rude) question: what are the big differences in implementations that make SCC's HashMap slower short-term but scale better?
Thanks,
Alex.
The text was updated successfully, but these errors were encountered: