Allow Multiple End States in FBW2 by DanEnergetics · Pull Request #13 · rwth-i6/i6_native_ops

DanEnergetics · 2025-06-17T16:39:42Z

This is one steps towards computing the full "denominator".
It's a draft so far because I'd appreciate some input on the design: Currently I extended the FBW2 Op by two parameters where we recover the "usual" behavior if we pass an empty tensor for the extra parameter end_state_offsets.
Alternatives would be:

copying the fbw2_cuda function: this creates a lot of code redundancy in my opinion
overloading fbw2_cuda and trying to call the more general function from the more specific one
instead of using empty tensors we could use something like std::optional but I'm far from an expert on that
The same questions can be asked for the python entry point, e.g. if we would like to have a separate multi_end_fbw2 function for the python interface.

This reverts commit 92d88da.

michelwi · 2025-06-18T08:54:06Z

i6_native_ops/fbw2/fbw2_op.cu


        if (not fwd and threadIdx.x == 0 and t == seq_lens[seq]) {
-            prev_states[final_states[seq]] = 0.0;
+            for (unsigned fs = final_state_offsets[seq]; fs < final_state_offsets[seq+1]; fs++) {


does fs < final_state_offsets[seq+1] work for the last sequence? e.g. are we adding num_final_states to the end of the offsets or are we expecting the user to do it?

Yes so far I'm expecting the user to do it. I'm not sure if it's the best way though. An alternative would be to provide a num_end_states array, from which we compute the offsets like it's done for the edges.
However, we will have to extend the logic for the edge offset computation at some point if we want to use a single shared automaton for all sequences.
My idea would be to have a [B,2]-array of start- and end-points to read out from the edge-tensor. The user wouldn't know of it though and just have the usual fbw_loss and an additional fbw_loss_shared_automaton or something. I just think it would be the easiest way to reuse the code that @curufinwe wrote for the "shared automaton" case.

I would try to avoid to rely on the user doing the right thing. And adding len(prev_states) to final_state_offsets if its length is equal to num_seqs seems a simple fix to me.

i6_native_ops/fbw2/fbw2_op.cu

michelwi · 2025-06-18T08:56:38Z

i6_native_ops/fbw2/fbw2_op.cu

+    unsigned* d_start_states      = Ndarray_DEV_DATA_uint32(start_states);
+    unsigned* d_end_states        = Ndarray_DEV_DATA_uint32(end_states);
+    unsigned* d_end_state_offsets = Ndarray_DEV_DATA_uint32(end_state_offsets);
+    unsigned* d_seq_lens          = reinterpret_cast<unsigned*>(Ndarray_DEV_DATA_int32(seq_lens));


why can we remove the reinterpret_cast above but not here?

I'm not sure but I think we should be able to handle it the way the others are handled.

michelwi · 2025-06-18T09:01:15Z

i6_native_ops/fbw2/fbw2_op.cu

-                           state_offsets.data().get(), edge_offsets.data().get(), d_seq_lens,
-                           d_from, d_to, d_weights, d_emission_idxs, d_start_states, d_end_states,
+                           state_offsets.data().get(), d_edge_offsets.data().get(), d_seq_lens,
+                           d_from, d_to, d_weights, d_emission_idxs, d_start_states, d_end_states, d_end_state_offsets,


since the new parameter does not work for the v1 baum_welch implementation, should we add some check somewhere that debug_options.explicit_merge is not set when we have multiple ends?

Yes that's a good idea. Alternatively I could adjust the v2 accordingly.

Co-authored-by: michelwi <michelwi@users.noreply.github.com>

Daniel Mann added 15 commits May 16, 2025 06:38

create test case for fbw2 op

92d88da

add github workflows

5c5e3f6

Revert "create test case for fbw2 op"

0c88949

This reverts commit 92d88da.

create test case for fbw2 op

2792ed7

update unittest

eab89da

update workflow

7c00618

remove unit_test

1b1742d

black

5ecb861

black -> ruff

fde5e17

delete black workflow

58885e5

git ignore vscode dir

317bf5b

adjust signatures to allow for extra end state related args

b56490c

end state offset impl

409dda4

remove commented out code

825bc24

Merge branch 'main' into fbw2_multi_end

e82e169

DanEnergetics requested review from curufinwe and michelwi June 17, 2025 16:39

ruff

d0d5aef

michelwi reviewed Jun 18, 2025

View reviewed changes

remove commented code

690b4a0

Co-authored-by: michelwi <michelwi@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow Multiple End States in FBW2#13

Allow Multiple End States in FBW2#13
DanEnergetics wants to merge 17 commits intomainfrom
fbw2_multi_end

DanEnergetics commented Jun 17, 2025 •

edited

Loading

Uh oh!

michelwi Jun 18, 2025

Uh oh!

DanEnergetics Jun 20, 2025 •

edited

Loading

Uh oh!

michelwi Jun 20, 2025

Uh oh!

Uh oh!

michelwi Jun 18, 2025

Uh oh!

DanEnergetics Jun 20, 2025

Uh oh!

michelwi Jun 18, 2025

Uh oh!

DanEnergetics Jun 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DanEnergetics commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michelwi Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

DanEnergetics Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

michelwi Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

michelwi Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

DanEnergetics Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

michelwi Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

DanEnergetics Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DanEnergetics commented Jun 17, 2025 •

edited

Loading

DanEnergetics Jun 20, 2025 •

edited

Loading