Golang, Python, Kotlin, Swift. I prefer strongly typed languages and I do not worship PEP.
@ModelCloudAi
-
ModelCloud.ai
- Earth/Epoch 2.0
- https://modelcloud.ai
- @qubitium
Block or Report
Block or report Qubitium
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned Loading
-
ModelCloud/GPTQModel
ModelCloud/GPTQModel PublicAn easy-to-use LLM quantization and inference toolkit based on GPTQ algorithm (weight-only quantization).
-
sgl-project/sglang
sgl-project/sglang PublicSGLang is yet another fast serving framework for large language models and vision language models.
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
-
AutoGPTQ/AutoGPTQ
AutoGPTQ/AutoGPTQ PublicAn easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
-
flashinfer-ai/flashinfer
flashinfer-ai/flashinfer PublicFlashInfer: Kernel Library for LLM Serving
-
Dao-AILab/flash-attention
Dao-AILab/flash-attention PublicFast and memory-efficient exact attention
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.