Skip to content

Conversation

oobabooga
Copy link

This adds a workflow for llama-cpp-python CUDA wheels with tensor cores support, and appends it to the Build Wheels for New Release with Prioritization workflow.

I have been having tremendous difficulty in compiling those wheels, as I get flooded with errors saying

API rate limit exceeded for installation ID

that cause part of the wheels to not be uploaded to GitHub. So I end up having to run the same workflow several times for each new release.

Including this additional workflow will increase the compilation time for your actions, so I understand if you want to not include this in the repository. In this case, I can keep maintaining those additional wheels myself.

PS: MacOS wheels have been building successfully for all versions, so they can be added back if you haven't already.

@Smartappli
Copy link

Please create a batch file, and integrate your script in build-wheels-full-release.yml. Adapt powershell scripts (ps1 files)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants