Spaces:

ggml-org
/

gguf-my-repo

Running on A10G

GGUF My Repo re-design

#187

by olegshulyakov - opened Aug 9

←

Aug 9

•

Migrate Docker image to official llama.cpp CUDA image.
Re-write app.py to OOP to re-design methods signatures.
Added additional llama-quantize options: --token-embedding-type, --leave-output-tensor, --output-tensor-type
Customizable output options: repo name, file name
Upload to different quants to the same repository.
Updated imatrix training file to calibration_data_v5_rc.txt.

olegshulyakov changed pull request status to open Aug 10

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment