🔥 Remove our exllama code because we use auto-gptq vendored kernels #59

tjohnson31415 · 2024-03-14T06:07:08Z

Motivation

We recently found that AutoGPTQ vendors its own versions of exllama and exllamav2 kernels in augotgptq_extension that are installed with the library. Since we install AutoGPTQ after we installed our own builds of the exllama kernels, the AutoGPTQ ones overwrite our copies. So it turns out that we don't need to vendor and compile our own exllama kernels.

Modifications

Removes the vendored copies of exllama kernels.

Result

There should be no functional changes other than faster build times and less code.

Signed-off-by: Travis Johnson <[email protected]>

maxdebayser

LGTM

joerunde

get it

[pull] main from IBM:main

tjohnson31415 marked this pull request as ready for review March 14, 2024 17:01

🔥 Remove custom exllama code, use auto-gptq vendored instead

9613e56

Signed-off-by: Travis Johnson <[email protected]>

tjohnson31415 force-pushed the autogptq-exllama branch from 216a59a to 9613e56 Compare March 14, 2024 17:02

maxdebayser reviewed Mar 14, 2024

View reviewed changes

tjohnson31415 changed the title ~~🔥 Remove custom exllama code, use auto-gptq vendored instead~~ 🔥 Remove our exllama code, we use auto-gptq vendored kernels Mar 14, 2024

tjohnson31415 changed the title ~~🔥 Remove our exllama code, we use auto-gptq vendored kernels~~ 🔥 Remove our exllama code because we use auto-gptq vendored kernels Mar 14, 2024

joerunde approved these changes Mar 14, 2024

View reviewed changes

joerunde merged commit 0cc4a2e into main Mar 14, 2024

tjohnson31415 deleted the autogptq-exllama branch March 14, 2024 19:22

njhill mentioned this pull request Mar 19, 2024

Incoporate Marlin for GPTQ checkpoints into tgis_native #51

Closed

Xaenalt pushed a commit to Xaenalt/text-generation-inference that referenced this pull request Aug 1, 2024

Merge pull request IBM#59 from IBM/main

20fa70e

[pull] main from IBM:main

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🔥 Remove our exllama code because we use auto-gptq vendored kernels #59

🔥 Remove our exllama code because we use auto-gptq vendored kernels #59

Uh oh!

tjohnson31415 commented Mar 14, 2024

Uh oh!

maxdebayser left a comment

Uh oh!

joerunde left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

🔥 Remove our exllama code because we use auto-gptq vendored kernels #59

🔥 Remove our exllama code because we use auto-gptq vendored kernels #59

Uh oh!

Conversation

tjohnson31415 commented Mar 14, 2024

Motivation

Modifications

Result

Uh oh!

maxdebayser left a comment

Choose a reason for hiding this comment

Uh oh!

joerunde left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants