![]() Generally you dont have to change much besides the Presets and GPU Layers. Launching with no command line arguments displays a GUI containing a subset of configurable settings.If you're not on windows, then run the script KoboldCpp.py after compiling the libraries. exe, and then connect with Kobold or Kobold Lite. To run, execute koboldcpp.exe or drag and drop your quantized ggml_model.bin file onto the.Weights are not included, you can use the official llama.cpp quantize.exe to generate them from your official weight files (or download them from other places such as TheBloke's Huggingface.You can also rebuild it yourself with the provided makefiles and scripts. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |