Big upgrades #62

joerunde · 2024-03-18T22:46:00Z

Motivation

We need to update a whole bunch of things that will cause output differences, and we want to bundle them up together.

Modifications

Updates:

pytorch
flash attention
autogptq
cuda

Result

Slight differences in outputs for some text generation prompts on many models, but our quality tests indicate no major drop in result quality.

Signed-off-by: Joe Runde <[email protected]>

Dockerfile

Signed-off-by: Joe Runde <[email protected]>

joerunde · 2024-03-19T22:44:07Z

python package list from this image:

$ pip3 list | grep -iE '(flash|torch|auto|cuda)'
DEPRECATION: Loading egg at /opt/tgis/lib/python3.11/site-packages/custom_kernels-0.0.0-py3.11-linux-x86_64.egg is deprecated. pip 24.3 will enforce this behaviour change. A possible replacement is to use pip for package installation.. Discussion can be found at https://github.com/pypa/pip/issues/12330
auto_gptq                 0.7.1
flash-attn                2.5.6
nvidia-cuda-cupti-cu12    12.1.105
nvidia-cuda-nvrtc-cu12    12.1.105
nvidia-cuda-runtime-cu12  12.1.105
torch                     2.2.1+cu121

looks like the versions I expect- running performance and integration tests to make sure nothing is totally borked

njhill

This is great, thanks @joerunde

Signed-off-by: declark1 <[email protected]>

Signed-off-by: Joe Runde <[email protected]>

njhill and others added 2 commits March 18, 2024 16:46

Bump to auto-gptq 0.7.1

dbd8beb

Signed-off-by: Joe Runde <[email protected]>

👷 swap to autogptq wheel

654d4f2

Signed-off-by: Joe Runde <[email protected]>

joerunde force-pushed the big-upgrades branch from 0408c5e to 654d4f2 Compare March 18, 2024 22:46

joerunde added 2 commits March 18, 2024 17:00

👷 update cuda to 12.4

0c947e4

Signed-off-by: Joe Runde <[email protected]>

👷 update torch and flash_attn

3faefb2

Signed-off-by: Joe Runde <[email protected]>

njhill reviewed Mar 19, 2024

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

njhill mentioned this pull request Mar 19, 2024

Incoporate Marlin for GPTQ checkpoints into tgis_native #51

Closed

joerunde added 3 commits March 19, 2024 09:38

👷 install cuda 12.1

639e516

Signed-off-by: Joe Runde <[email protected]>

👷 install torch, autogptq for cuda 12

4f41f0b

Signed-off-by: Joe Runde <[email protected]>

🐛 swap 12-4 with 12-1

faca247

Signed-off-by: Joe Runde <[email protected]>

joerunde marked this pull request as ready for review March 19, 2024 20:13

Merge branch 'main' into big-upgrades

fdff2ef

njhill approved these changes Mar 19, 2024

View reviewed changes

joerunde and others added 2 commits March 20, 2024 12:04

Merge branch 'main' into big-upgrades

8aa6700

Add source for onnxruntime-gpu cuda 12 support

9c0339b

Signed-off-by: declark1 <[email protected]>

declark1 force-pushed the big-upgrades branch from b671dd0 to 9c0339b Compare March 20, 2024 22:22

joerunde added 2 commits March 21, 2024 09:40

Merge branch 'main' into big-upgrades

4b229cf

🐛 put index url on pip command

a8299eb

Signed-off-by: Joe Runde <[email protected]>

joerunde merged commit dacfe50 into main Mar 21, 2024

joerunde deleted the big-upgrades branch March 21, 2024 20:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Big upgrades #62

Big upgrades #62

Uh oh!

joerunde commented Mar 18, 2024

Uh oh!

Uh oh!

joerunde commented Mar 19, 2024

Uh oh!

njhill left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Big upgrades #62

Big upgrades #62

Uh oh!

Conversation

joerunde commented Mar 18, 2024

Motivation

Modifications

Result

Uh oh!

Uh oh!

joerunde commented Mar 19, 2024

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants