Support for LongLLaMA tensor format of _past_key_values #390

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

vackosar wants to merge 1 commit into guidance-ai:main from vackosar:patch-1

vackosar commented Sep 30, 2023

Generalize the line in TransformerSession that trims the cache to support LongLLaMA tensor layout that has tuple length of 6 instead of 2.


          support for LongLLaMA tensor format of _past_key_values

fd508d9

paulbkoch force-pushed the main branch from c291eda to c4531ea Compare

February 3, 2025 08:45

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet