mudler
|
a4191f6895
|
tests: adapt to new added backends
|
2 years ago |
mudler
|
660a67cb25
|
tests: enable whisper
|
2 years ago |
Ettore Di Giacinto
|
8250391e49
|
Add support for gptneox/replit (#238)
|
2 years ago |
Ettore Di Giacinto
|
fd1df4e971
|
whisper: add tests and allow to set upload size (#237)
|
2 years ago |
Ettore Di Giacinto
|
4413defca5
|
feat: add starcoder (#236)
|
2 years ago |
Ettore Di Giacinto
|
85f0f8227d
|
refactor: drop code dups (#234)
|
2 years ago |
Ettore Di Giacinto
|
59e3c02002
|
make use of new bindings for gpt4all (#232)
|
2 years ago |
Matthew Campbell
|
032dee256f
|
Keep whisper models in memory (#233)
|
2 years ago |
Matthew Campbell
|
6b5e2b2bf5
|
Upload transcription API wasn't reading the data from the post (#229)
|
2 years ago |
Ettore Di Giacinto
|
11675932ac
|
feat: add dolly/redpajama/bloomz models support (#214)
|
2 years ago |
Ettore Di Giacinto
|
f8ee20991c
|
feat: add bert.cpp embeddings (#222)
|
2 years ago |
Ettore Di Giacinto
|
9f426578cf
|
feat: add transcript endpoint (#211)
|
2 years ago |
Ettore Di Giacinto
|
89dfa0f5fc
|
feat: add experimental support for embeddings as arrays (#207)
|
2 years ago |
Dave
|
07ec2e441d
|
mini fix - OpenAI documentation url (#200)
|
2 years ago |
mudler
|
8c8cf38d4d
|
tests: use 1 core
|
2 years ago |
mudler
|
009ee47fe2
|
Don't allow 0 as thread count
|
2 years ago |
mudler
|
ec2adc2c03
|
tests: use 3 cores
|
2 years ago |
mudler
|
e62ee2bc06
|
fix: remove trailing 0s from embeddings
This happens when no max_tokens are set, so by default go-llama
allocates more space for the slice and padding happens.
|
2 years ago |
mudler
|
b49721cdd1
|
fix: respect config from file for backends settings
|
2 years ago |
mudler
|
64c0a7967f
|
fix: pass prediction options when using the model
|
2 years ago |
mudler
|
e96eadab40
|
feat: support deprecated embeddings API
|
2 years ago |
mudler
|
e73283121b
|
feat: support arrays for prompt and input
Signed-off-by: mudler <mudler@mocaccino.org>
|
2 years ago |
mudler
|
857d13e8d6
|
debug: wire up go-fiber debugger
|
2 years ago |
Ettore Di Giacinto
|
961cf29217
|
feat: expose mirostat to config (#193)
|
2 years ago |
Ettore Di Giacinto
|
c839b334eb
|
feat: add embeddings for go-llama.cpp backend (#190)
|
2 years ago |
Ettore Di Giacinto
|
714bfcd45b
|
fix: missing returning error and free callback stream (#187)
|
2 years ago |
Ettore Di Giacinto
|
fdf75c6d0e
|
rwkv fixes and examples (#185)
|
2 years ago |
Ettore Di Giacinto
|
c974dad799
|
Return usage in the API responses (#166)
|
2 years ago |
Ettore Di Giacinto
|
67992a7d99
|
feat: support slices or strings in the prompt completion endpoint (#162)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2 years ago |
Ettore Di Giacinto
|
751b7eca62
|
feat: add rwkv support (#158)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2 years ago |
Ettore Di Giacinto
|
1ae7150810
|
feat: allow to specify default backend for model (#156)
Signed-off-by: mudler <mudler@c3os.io>
|
2 years ago |
Ettore Di Giacinto
|
70caf9bf8c
|
feat: support stopwords both string and arrays (#154)
|
2 years ago |
Dave
|
0b226ac027
|
Stop parameter of OpenAIRequest changed to String Array (#153)
|
2 years ago |
Ettore Di Giacinto
|
220d6fd59b
|
feat: add stream events (#152)
|
2 years ago |
Ettore Di Giacinto
|
156e15a4fa
|
Bump llama.cpp, downgrade gpt4all-j (#149)
|
2 years ago |
Ettore Di Giacinto
|
92452d46da
|
feat: add new gpt4all-j binding (#142)
|
2 years ago |
Ettore Di Giacinto
|
52f4d993c1
|
feat: add /edit endpoint (#119)
|
2 years ago |
Ettore Di Giacinto
|
c806eae0de
|
feat: config files and SSE (#83)
Signed-off-by: mudler <mudler@mocaccino.org>
Signed-off-by: Tyler Gillson <tyler.gillson@gmail.com>
Co-authored-by: Tyler Gillson <tyler.gillson@gmail.com>
|
2 years ago |
Ettore Di Giacinto
|
12d83a4184
|
feat: Return OpenAI errors and update docs (#80)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2 years ago |
Ettore Di Giacinto
|
1c872ec326
|
feat: add CI/tests (#58)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2 years ago |
Ettore Di Giacinto
|
79791438fe
|
Use the first available model if not specified (#55)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2 years ago |
Ettore Di Giacinto
|
5cba71de70
|
Add stopwords, debug mode, and other API enhancements (#54)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2 years ago |
Ettore Di Giacinto
|
f816dfae65
|
Add support for stablelm (#48)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2 years ago |
Ettore Di Giacinto
|
1c4fbaae20
|
Add support for cerebras (#45)
Signed-off-by: mudler <mudler@c3os.io>
|
2 years ago |
Ettore Di Giacinto
|
d517a54e28
|
Major API enhancements (#44)
|
2 years ago |
Ettore Di Giacinto
|
80f50e6ccd
|
Rename project to LocalAI (#35)
Signed-off-by: mudler <mudler@c3os.io>
|
2 years ago |
Ettore Di Giacinto
|
7fec26f5d3
|
Enhancements (#34)
Signed-off-by: mudler <mudler@c3os.io>
|
2 years ago |
Ettore Di Giacinto
|
0b330d90ad
|
feat: drop embedded webui (#27)
Signed-off-by: mudler <mudler@c3os.io>
|
2 years ago |
Ettore Di Giacinto
|
63601fabd1
|
feat: drop default model and llama-specific API (#26)
Signed-off-by: mudler <mudler@c3os.io>
|
2 years ago |
Ettore Di Giacinto
|
b062f3142b
|
feat: enhance API, expose more parameters (#24)
Signed-off-by: mudler <mudler@c3os.io>
|
2 years ago |