Update "past_sequence_length + 1" shape dimension name to "past_sequence_length + sequence_length" #2088

PatrikPerssonInceptron · 2024-11-04T13:23:03Z

When exporting a LLM to onnx with ``--task text-generation-with-past, the attention_mask and output kv-cache shapes contain a dimension set to "past_sequence_length + 1". This assumes that the input token sequence length is 1 which may not be the case when e.g. filling the kv-cache with the input prompt.

Fix

I updated the strings "past_sequence_length + 1" to "past_sequence_length + sequence_length" to reflect the case when the input sequence is longer than 1.

@fxmarty, @echarlaix, @JingyaHuang, @michaelbenayoun

…gth" instead of "past_sequence_length + 1" since the output length depends on the input sequence_length which may not be 1, e.g. when filling the kv-cache

updated decoder_sequence_name to "past_sequence_length + sequence_len…

0badc67

…gth" instead of "past_sequence_length + 1" since the output length depends on the input sequence_length which may not be 1, e.g. when filling the kv-cache

PatrikPerssonInceptron changed the title ~~[FIX][ONNX] Changed "past_sequence_length + 1" shape variable name to "past_sequence_length + sequence_length"~~ Changed "past_sequence_length + 1" shape variable name to "past_sequence_length + sequence_length" Nov 11, 2024

PatrikPerssonInceptron changed the title ~~Changed "past_sequence_length + 1" shape variable name to "past_sequence_length + sequence_length"~~ Update "past_sequence_length + 1" shape dimension name to "past_sequence_length + sequence_length" Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update "past_sequence_length + 1" shape dimension name to "past_sequence_length + sequence_length" #2088

Update "past_sequence_length + 1" shape dimension name to "past_sequence_length + sequence_length" #2088

PatrikPerssonInceptron commented Nov 4, 2024 •

edited

Loading

Update "past_sequence_length + 1" shape dimension name to "past_sequence_length + sequence_length" #2088

Are you sure you want to change the base?

Update "past_sequence_length + 1" shape dimension name to "past_sequence_length + sequence_length" #2088

Conversation

PatrikPerssonInceptron commented Nov 4, 2024 • edited Loading

Fix

PatrikPerssonInceptron commented Nov 4, 2024 •

edited

Loading