File Limit Request: dashinfer - 300 MiB #5662

laiwenzh · 2025-02-11T02:19:43Z

Project URL

https://pypi.org/project/dashinfer/

Does this project already exist?

Yes

New Limit

300

Update issue title

I have updated the title.

Which indexes

PyPI

About the project

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
We opensource dashinfer in April 2024, and recently updated it to v2.1.0, which has some prebuilt GPU codes, and exceeds upload limits.

Project github repository:
https://github.com/modelscope/dash-infer

Reasons for the request

Our project has many cuda kernels, these cuda kernels (not data) need to be compiled with different SMs. This results in a large shared library (.so) in our package. Currently the wheels are about 289 MB.

Code of Conduct

I agree to follow the PSF Code of Conduct

laiwenzh added the limit request label Feb 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

File Limit Request: dashinfer - 300 MiB #5662

File Limit Request: dashinfer - 300 MiB #5662

laiwenzh commented Feb 11, 2025

File Limit Request: dashinfer - 300 MiB #5662

File Limit Request: dashinfer - 300 MiB #5662

Comments

laiwenzh commented Feb 11, 2025

Project URL

Does this project already exist?

New Limit

Update issue title

Which indexes

About the project

Reasons for the request

Code of Conduct