Release v1.8.1 · leafspark/AutoGGUF

AutoGGUF v1.8.1

Changelog:

AutoFP8 quantization classes and window (currently WIP)
Minimize/maximize buttons to title bar
API key authentication support for the local server
HuggingFace upload/download class
OpenAPI docs for endpoints
Added new showcase image
Replaced Flask with FastAPI and Uvicorn for improved performance
Moved functions out of AutoGGUF.py into utils.py and TaskListItem.py
Updated llama.cpp convert scripts
Improved LoRA conversion process:
- Allow specifying output path in arguments
- Removed shutil.move operation
- Increased max number of LoRA layers
Changed default port to 7001
Now binding to localhost (127.0.0.1) instead of 0.0.0.0
Upadted Spanish localizations
Updated setuptools requirement from ~=68.2.0 to ~=74.0.0
Updated .env.example with new configuration parameters
Fix web page not found error
Use of proper status in TaskListItem
Passing quant_threads and Logger parameters to TaskListItem
Improved window moving smoothness
Prevent moving window below taskbar
Optimized imports in various files
Remove aliased quant types

Provide feedback