Closed
Description
Is your feature request related to a problem? Please describe.
Inquiring whether this project supports loading a "sharded" gguf model file ? The llama cpp project appears to add tooling for splitting gguf files into pieces (more here). Was curious of the this project supports loading gguf files in that format since I didn't see any mention of it in the documentation or issues.
If it is supported, could you point me to the documentation on this or provide a code example ? If not, perhaps this feature could be added ?
veeragoni and conornash
Metadata
Metadata
Assignees
Labels
No labels