Commit Graph

169 Commits (3fe11fe24da40a8afac7b8d7372a487042d5ce34)
 

Author SHA1 Message Date
mudler a9cd6b3ca3 ci: Fix tag detection for 'latest' 2 years ago
mudler e786576b95 Update README 2 years ago
Ettore Di Giacinto d426571789
Merge pull request #16 from go-skynet/fix_arm 2 years ago
mudler a896a2b5ad Drop armv7 builds 2 years ago
Ettore Di Giacinto 8273cd5c04
Merge pull request #15 from go-skynet/docker-compose 2 years ago
mudler 16f1281d38 Minor workflow fixes 2 years ago
mudler 8042e9a2d6 Add docker-compose 2 years ago
mudler 624092cb99 Update README 2 years ago
mudler a422a883ac Minor rephrasing 2 years ago
mudler 7858a97254 Update README 2 years ago
mudler 5556aa46dd Small refinements and refactors 2 years ago
mudler eb4257f946 Add .gitignore 2 years ago
mudler ae30bd346d Reorganize repository layout 2 years ago
mudler 93d8977ba2 Return model list 2 years ago
mudler f43aeeb4a1 Add both API endpoints (completion, chat) 2 years ago
mudler c17dcc5e9d Allow to inject prompt as part of the call 2 years ago
mudler 4a932483e1 Small fixup to template loading 2 years ago
mudler b710147b95 Add mutex on same models (parallel isn't supported yet) 2 years ago
mudler ba70363330 Use template input 2 years ago
mudler 9fb581739b Allow to template model prompts inputs 2 years ago
mudler 48aca246e3 Drop unused interactive mode 2 years ago
mudler 12eee097b7 Make it compatible with openAI api, support multiple models 2 years ago
mudler b33d015b8c Use go-llama.cpp 2 years ago
Ettore Di Giacinto b7c0a108f5
Update README.md 2 years ago
Ettore Di Giacinto f694a89c28
Update README.md 2 years ago
Ettore Di Giacinto be682e6c2f
Update README.md 2 years ago
mudler bf85a31f9e Don't set a default model path 2 years ago
Ettore Di Giacinto d69048e0b0
Update README.md 2 years ago
mudler 827f189163 Update README 2 years ago
mudler a23deb5ec7 Drop duplicate target 2 years ago
mudler 999676b106 Add gpt4all instructions 2 years ago
mudler c61b023bc8 Drop fat images, will document how to consume models 2 years ago
mudler 650a22aef1 Add compatibility to gpt4all models 2 years ago
mudler 17b1724f7c Update llama-go 2 years ago
mudler e860e62036 Add mutex, build only lite images 2 years ago
Ettore Di Giacinto 1f45ff8cd6
Update README.md 2 years ago
mudler abee34f60a Cleanup leftover 2 years ago
mudler dbc70dc13c Add a simple web-page as index of the API for helping with inference testing 2 years ago
mudler 55142065eb Update README with building instructions 2 years ago
mudler d83d2293b5 Update version in kubernetes deployment 2 years ago
mudler 467ce5a7aa Update models download instructions, update images 2 years ago
mudler 4c9c5ce4ce Update README on instruction on how to prompt with the API 2 years ago
mudler 6394d85ca2 Lower conversion parallelism 2 years ago
mudler 2b6a5aef5f Lower earthly parallelism 2 years ago
mudler d191ecb9fe Disable release pipeline 2 years ago
mudler e14e1b0a77 Update README 2 years ago
mudler bffaf2aa42 Build images without model 2 years ago
mudler d98d1fe55e Use models from model repository 2 years ago
mudler 0785cb6b0b Update README with 13B and 30B model instructions 2 years ago
mudler f88d5ad829 Update MODEL_URL 2 years ago