Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add parameter to ignore end of text token #181

Merged
merged 2 commits into from
Mar 19, 2023

Conversation

slaren
Copy link
Member

@slaren slaren commented Mar 15, 2023

Adds the --ignore-eos switch which prevents generation of the end of text (eos) token. This can be useful to avoid unexpected terminations in interactive mode and to force the model to generate longer output.

This is implemented by setting the logits of the eos token to zero, which seems to work well enough, but I am not sure if there may be any unwanted side effects.

@gjmulder gjmulder added the enhancement New feature or request label Mar 17, 2023
@ggerganov ggerganov merged commit 50fae10 into ggml-org:master Mar 19, 2023
@slaren slaren deleted the ignore-eos branch March 19, 2023 18:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants