Skip to content

v1.8.1

Compare
Choose a tag to compare
@leafspark leafspark released this 05 Sep 01:04
· 115 commits to main since this release
45d0212

AutoGGUF v1.8.1

Changelog:

  • AutoFP8 quantization classes and window (currently WIP)
  • Minimize/maximize buttons to title bar
  • API key authentication support for the local server
  • HuggingFace upload/download class
  • OpenAPI docs for endpoints
  • Added new showcase image
  • Replaced Flask with FastAPI and Uvicorn for improved performance
  • Moved functions out of AutoGGUF.py into utils.py and TaskListItem.py
  • Updated llama.cpp convert scripts
  • Improved LoRA conversion process:
    • Allow specifying output path in arguments
    • Removed shutil.move operation
    • Increased max number of LoRA layers
  • Changed default port to 7001
  • Now binding to localhost (127.0.0.1) instead of 0.0.0.0
  • Upadted Spanish localizations
  • Updated setuptools requirement from ~=68.2.0 to ~=74.0.0
  • Updated .env.example with new configuration parameters
  • Fix web page not found error
  • Use of proper status in TaskListItem
  • Passing quant_threads and Logger parameters to TaskListItem
  • Improved window moving smoothness
  • Prevent moving window below taskbar
  • Optimized imports in various files
  • Remove aliased quant types