RFC 2: Add `Fiber::ExecutionContext::MultiThreaded` #15517

ysbaddaden · 2025-02-25T11:08:24Z

Introduces the LAST EC scheduler that runs in multiple thread, with work stealing so any thread can resume any runnable fiber in the context (no more starving threads).

Unlike the ST scheduler, the MT scheduler needs to actively park threads since only one thread in the context can run the event loop (no parallel runs).

Having a single event loop for the whole context instead of having one per thread avoids situations where fibers would wait in an event loop but won't be processed because this thread happens to be busy, causing delays. With a single event loop, as soon as a thread is starving it can check the event loop and enqueue runnable fibers, that can be immediately resumed (and stolen).

NOTE: we can start running the specs in this context though they can segfault sometimes. Maybe because of some issues in spec helpers that used to expect fibers not switching, or maybe of issues in the stdlib for the same reason (for example libxml).

~~Kept in draft until #15511 and #15513 are merged.~~

refs #15342

Introduces the second EC scheduler that runs in multiple threads. Uses the thread-safe queues (Runnables, GlobalQueue). Contrary to the ST scheduler, the MT scheduler needs to actively park the thread in addition to waiting on the event loop, because only one thread is allowed to run the event loop.

src/fiber/execution_context/multi_threaded.cr

straight-shoota · 2025-03-28T11:06:16Z

src/fiber/execution_context/multi_threaded.cr

+    private def start_schedulers(hijack)
+      @size.end.times do |index|
+        @schedulers << Scheduler.new(self, "#{@name}-#{index}")
+      end
+    end


question: Why do we initialize all schedulers from the start? We only start the minimal number of threads. And might never actually need all schedulers when the number of threads doesn't reach the max.
Could we lazily initialize?

To avoid parallelism issues? For example in relation to stealing. We can expect the schedulers to exist, whatever if they're used. Maybe we could lazily initialize. That will need some experimentation.

Steal is bounded by @schedulers.size and the array should never shrink. So accessing should be fine. The worst that can happen is missing out on stealing from a scheduler if it was just added after we took the size. But that should not be an issue.

Either way, we should document why the code works like it does so we can be reminded the next time we look at it.

We'd only be mutating the array during #initialize then inside a mutex, so it's safe.

But we're accessing @schedulers outside of a mutex when stealing (for obvious reasons), so there's no telling how the compiler or a weak CPU like ARM will reorder the writes when pushing to the array (size could be updated before the value is written). For example:

T1: mutex.lock (ensures following writes can't happen before)

T1: schedulers.@size += 1 (happens first)

T2: execution_context.steal

T2: schedulers.last => 💥

T1: schedulers.@buffer[size] = scheduler (too late)

T1: mutex.unlock (ensures above writes did happen)

Yes, I'm paranoid but if something can happen, it will happen. The above shouldn't happen on x86 until LLVM optimizers decide to reorder, and it will happen on ARM.

To be safe, we'd need an array-like object that uses an atomic or fence to update the size after the value has been set. It could be used for both @schedulers and @threads.

Note: @threads isn't affected because calls to @threads.size are purely informational, we don't use it to iterate or access a thread; we only do this inside the mutex.

src/fiber/execution_context/multi_threaded.cr

ysbaddaden · 2025-03-28T18:16:33Z

I sanitized the range handling, and fixed the exclusive range/capcity, and wrote a few specs to assert the behavior. I also dropped a number of unused args.

src/fiber/execution_context/multi_threaded.cr

ysbaddaden · 2025-03-31T09:50:34Z

Now the fixed range normalization and capacity checks should be correct 🤞

src/fiber/execution_context/multi_threaded.cr

ysbaddaden added kind:feature topic:multithreading labels Feb 25, 2025

ysbaddaden self-assigned this Feb 25, 2025

ysbaddaden mentioned this pull request Mar 17, 2025

Implement RFC 0002: ExecutionContext [EPIC] #15342

Open

ysbaddaden added 2 commits March 25, 2025 10:21

Add Thread.delay

7bcff33

ysbaddaden force-pushed the feature/execution-context-multithreaded branch from f5e466e to 7bcff33 Compare March 25, 2025 09:23

ysbaddaden marked this pull request as ready for review March 25, 2025 09:24

straight-shoota reviewed Mar 28, 2025

View reviewed changes

ysbaddaden added 2 commits March 28, 2025 18:38

Fix: min/max range coercion + unused def arguments

50e6a32

Add note about starting all the schedulers upfront

d432809

straight-shoota reviewed Mar 28, 2025

View reviewed changes

src/fiber/execution_context/multi_threaded.cr Outdated Show resolved Hide resolved

src/fiber/execution_context/multi_threaded.cr Outdated Show resolved Hide resolved

Sija reviewed Mar 28, 2025

View reviewed changes

src/fiber/execution_context/multi_threaded.cr Outdated Show resolved Hide resolved

fixup! Fix: min/max range coercion + unused def arguments

d253a11

straight-shoota approved these changes Mar 31, 2025

View reviewed changes

straight-shoota added this to the 1.16.0 milestone Mar 31, 2025

straight-shoota changed the title ~~RFC 2: Add Fiber::ExecutionContext::MultiThreaded~~ RFC 2: Add Fiber::ExecutionContext::MultiThreaded Mar 31, 2025

ysbaddaden moved this from Review to Approved in Multi-threading Mar 31, 2025

Sija reviewed Mar 31, 2025

View reviewed changes

src/fiber/execution_context/multi_threaded.cr Show resolved Hide resolved

src/fiber/execution_context/multi_threaded.cr Outdated Show resolved Hide resolved

fixup! Fix: min/max range coercion + unused def arguments

0dce3cc

straight-shoota approved these changes Apr 1, 2025

View reviewed changes

straight-shoota added the topic:stdlib:runtime label Apr 1, 2025

straight-shoota merged commit b83439c into crystal-lang:master Apr 3, 2025
35 checks passed

github-project-automation bot moved this from Approved to Done in Multi-threading Apr 3, 2025

ysbaddaden deleted the feature/execution-context-multithreaded branch April 3, 2025 10:37

BrewTestBot mentioned this pull request Apr 9, 2025

crystal 1.16.0 Homebrew/homebrew-core#219037

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

RFC 2: Add `Fiber::ExecutionContext::MultiThreaded` #15517

RFC 2: Add `Fiber::ExecutionContext::MultiThreaded` #15517

Uh oh!

ysbaddaden commented Feb 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

straight-shoota Mar 28, 2025

Uh oh!

ysbaddaden Mar 28, 2025

Uh oh!

straight-shoota Mar 28, 2025 •

edited

Loading

Uh oh!

ysbaddaden Mar 28, 2025

Uh oh!

ysbaddaden Mar 28, 2025

Uh oh!

Uh oh!

Uh oh!

ysbaddaden commented Mar 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ysbaddaden commented Mar 31, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RFC 2: Add Fiber::ExecutionContext::MultiThreaded #15517

RFC 2: Add Fiber::ExecutionContext::MultiThreaded #15517

Uh oh!

Conversation

ysbaddaden commented Feb 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

straight-shoota Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

ysbaddaden Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

straight-shoota Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ysbaddaden Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

ysbaddaden Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ysbaddaden commented Mar 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ysbaddaden commented Mar 31, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RFC 2: Add `Fiber::ExecutionContext::MultiThreaded` #15517

RFC 2: Add `Fiber::ExecutionContext::MultiThreaded` #15517

ysbaddaden commented Feb 25, 2025 •

edited

Loading

straight-shoota Mar 28, 2025 •

edited

Loading