De-duplicate edges in `typeinfer` instead of `gf.c` #58117

topolarity · 2025-04-14T21:47:30Z

Without this PR, my system spends ~1.07 seconds just running store_backedges when doing using CairoMakie

With this change, that drops to 0.641 seconds

That's still not fast enough for me, but we do call this function 236,092 times so maybe it's understandable.

Since the caller CodeInstance is always part of the identity that we are de-duplicating on, this makes the linear scan much faster than it is in `gf.c`

topolarity · 2025-04-14T21:48:26Z

src/gf.c

                    }
                }
            }
+            assert(!found && "duplicate back-edge registered");
+#endif
            // reuse an already cached instance of this type, if possible
            // TODO: use jl_cache_type_(tt) like cache_method does, instead of this linear scan?


It's worth seeing if we can bypass this remaining linear scan in the MethodTable edge insertion.

If I disable it manually, the timing drops to 440 milliseconds

…Table` These are restored in their entirety by staticdata.jl, so there's no need to serialize them. Dropping them has the additional advantage of making it unnecessary to de-duplicate edges in `gf.c`

On my system, this saves ~500 ms when loading CairoMakie (and all dependent packages)

vtjnash · 2025-04-15T02:03:12Z

src/gf.c

@@ -1997,6 +1997,9 @@ JL_DLLEXPORT void jl_method_instance_add_backedge(jl_method_instance_t *callee,
            jl_gc_wb(callee, backedges);
        }
        else {
+#ifndef JL_NDEBUG


This introduces a race condition into the code (because of gc, threads, etc). Probably useful to know this scan is costly on some benchmarks though

Can you clarify where the race condition is?

Do you mean that inference may simultaneously try to store_backedges for the same caller CI from two different threads?

Ah, I see, we might be able to now assume that each CI is unique, while in older versions the MI were not expected to be unique

typeinfer: De-duplicate backedges as they are stored

3147a04

Since the caller CodeInstance is always part of the identity that we are de-duplicating on, this makes the linear scan much faster than it is in `gf.c`

topolarity requested a review from vtjnash April 14, 2025 21:47

topolarity commented Apr 14, 2025

View reviewed changes

topolarity added 2 commits April 14, 2025 17:50

staticdata: do not serialize backedges for MethodInstance / `Method…

de13724

…Table` These are restored in their entirety by staticdata.jl, so there's no need to serialize them. Dropping them has the additional advantage of making it unnecessary to de-duplicate edges in `gf.c`

gf.c: make edge de-duplication a caller responsibility

b8aa007

On my system, this saves ~500 ms when loading CairoMakie (and all dependent packages)

topolarity force-pushed the ct/typeinfer-edge-dedupe branch from 72735ee to b8aa007 Compare April 14, 2025 21:51

vtjnash reviewed Apr 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

De-duplicate edges in `typeinfer` instead of `gf.c` #58117

De-duplicate edges in `typeinfer` instead of `gf.c` #58117

topolarity commented Apr 14, 2025

topolarity Apr 14, 2025

vtjnash Apr 15, 2025

topolarity Apr 15, 2025

vtjnash Apr 15, 2025

De-duplicate edges in typeinfer instead of gf.c #58117

Are you sure you want to change the base?

De-duplicate edges in typeinfer instead of gf.c #58117

Conversation

topolarity commented Apr 14, 2025

topolarity Apr 14, 2025

Choose a reason for hiding this comment

vtjnash Apr 15, 2025

Choose a reason for hiding this comment

topolarity Apr 15, 2025

Choose a reason for hiding this comment

vtjnash Apr 15, 2025

Choose a reason for hiding this comment

De-duplicate edges in `typeinfer` instead of `gf.c` #58117

De-duplicate edges in `typeinfer` instead of `gf.c` #58117