WHLs for cuda 11.7, 11.8, and 12.0 for future Releases #62

Qubitium · 2024-06-26T12:01:42Z

Currently the bitblas whl support is too limited >= 12.1. I understand that building so many whl/python/torch combos is a headache but I think it may be worth it.

Include support for all Cuda supported by Torch >= 2.0.0 which need to add 11.7, 11.8, and 12.0 to the WHL builds.

Reasons:

Lots of gpu-poor academics are locked to institution provided envs where drivers are often locked to cuda 11.7, 11.8 version.
Compiling bitblas has large os lib dependency and a simple git clone + build is not possible even on ubuntu without adding ubuntu pkgs. But if env is not ubuntu the problem becomes quite a problem for users that has no clue about builds.
Allow 3rd party to fully embed Bitblas without raising Torch/Cuda requirements. GPTQModel, for example, has integrated bitblas right now as a non-optional integration but need to add cuda checks for pkg compat and redirect users to src compile during runtime.

Tasks

Give feedback

Append cuda 11.7, 11.8, and 12.0 for future Releases
Options

LeiWang1999 · 2024-06-26T12:28:00Z

Hi @Qubitium , thank you for your attention. Indeed, bitblas is not officially released yet. We are currently working on performance-related optimizations, And there are still many items on our roadmap, such as CI/CD integration and support for VLLM. We are committed to completing these tasks and releasing more WHL packages in our official release.

We expect to complete these tasks in approximately two weeks.

tngh5004 · 2024-07-10T11:29:40Z

We're waiting for this action. Thank you for your efforts :)

LeiWang1999 · 2024-07-10T16:35:56Z

@tngh5004 , thanks for you attention, we will arrange this item as soon as possible.

LeiWang1999 · 2024-07-18T05:29:54Z

Thanks @tzj-fxz for the fix, we just released 0.0.1.dev13 on pypi, the dependency for cuda 12 has been removed, please feel free to test with:

pip install bitblas==0.0.1.dev13

@Qubitium , @tngh5004

tngh5004 · 2024-07-25T04:13:42Z

Thanks @tzj-fxz for the fix, we just released 0.0.1.dev13 on pypi, the dependency for cuda 12 has been removed, please feel free to test with:
pip install bitblas==0.0.1.dev13
@Qubitium , @tngh5004

Thank you very much, I'm currently busy finalizing my thesis, but I'll experiment with it as soon as possible to see if it works.

LeiWang1999 self-assigned this Jul 1, 2024

zodiacg mentioned this issue Jul 16, 2024

bitblas introduces dependency on CUDA version mobiusml/hqq#94

Closed

LeiWang1999 mentioned this issue Jul 18, 2024

[Issue 62] flexible whl for different cuda version #86

Merged

LeiWang1999 closed this as completed Jul 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WHLs for cuda 11.7, 11.8, and 12.0 for future Releases #62

WHLs for cuda 11.7, 11.8, and 12.0 for future Releases #62

Qubitium commented Jun 26, 2024 •

edited by LeiWang1999

Loading

Tasks

LeiWang1999 commented Jun 26, 2024 •

edited

Loading

tngh5004 commented Jul 10, 2024

LeiWang1999 commented Jul 10, 2024

LeiWang1999 commented Jul 18, 2024

tngh5004 commented Jul 25, 2024

WHLs for cuda 11.7, 11.8, and 12.0 for future Releases #62

WHLs for cuda 11.7, 11.8, and 12.0 for future Releases #62

Comments

Qubitium commented Jun 26, 2024 • edited by LeiWang1999 Loading

Tasks

LeiWang1999 commented Jun 26, 2024 • edited Loading

tngh5004 commented Jul 10, 2024

LeiWang1999 commented Jul 10, 2024

LeiWang1999 commented Jul 18, 2024

tngh5004 commented Jul 25, 2024

Qubitium commented Jun 26, 2024 •

edited by LeiWang1999

Loading

LeiWang1999 commented Jun 26, 2024 •

edited

Loading