Skip to content

feat: add skip_tokenizer option for pre-tokenized input#50

Open
ChuanLi1101 wants to merge 1 commit intomainfrom
feature/skip-tokenizer-clean
Open

feat: add skip_tokenizer option for pre-tokenized input#50
ChuanLi1101 wants to merge 1 commit intomainfrom
feature/skip-tokenizer-clean

Conversation

@ChuanLi1101
Copy link
Collaborator

Add skip_tokenizer option to LLMEngine allowing users to pass pre-tokenized input directly without loading a tokenizer. Includes lightweight unit tests.

- Add skip_tokenizer parameter to LLMEngine.__init__
- When skip_tokenizer=True, users must provide bos_token_id and eos_token_id
- InputOutputProcessor now accepts list[int] input without tokenizer
- Raise clear errors for string input or stop_strings when tokenizer is None
- Add lightweight unit tests for skip_tokenizer functionality
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant