Skip to content

Commit f3670aa

Browse files
committed
add pyyaml dependency
1 parent b333a06 commit f3670aa

File tree

5 files changed

+17
-6
lines changed

5 files changed

+17
-6
lines changed

.gitattributes

+5
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
# this file helps control behavior of "git archive", which also used by github release
2+
ut_data/ export-ignore
3+
4+
version.txt export-subst
5+

.gitignore

+1
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
11
/models
22
*.swp
33
/.idea
4+
.DS_Store
45
/config.yml
56
# Byte-compiled / optimized / DLL files
67
__pycache__/

README.md

+8-6
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,9 @@ models:
3131
params:
3232
path: /absolute/path/to/your/7B/ggml-model-q4_0.bin
3333
EOF
34-
python -m python -m llama_api_server
34+
35+
# start web server
36+
python -m llama_api_server
3537
```
3638

3739
### Call with openai-python
@@ -49,13 +51,13 @@ openai api completions.create -e text-davinci-003 -p "hello?"
4951

5052
#### Supported APIs
5153
- [X] Completions
52-
- [X] set `temperature`, `top\_p`, and `top\_k`
53-
- [X] set `max\_tokens`
54+
- [X] set `temperature`, `top_p`, and `top_k`
55+
- [X] set `max_tokens`
5456
- [ ] set `stop`
5557
- [ ] set `stream`
5658
- [ ] set `n`
57-
- [ ] set `presence\_penalty` and `frequency\_penalty`
58-
- [ ] set `logit\_bias`
59+
- [ ] set `presence_penalty` and `frequency_penalty`
60+
- [ ] set `logit_bias`
5961
- [X] Embeddings
6062
- [X] batch process
6163
- [ ] Chat
@@ -64,8 +66,8 @@ openai api completions.create -e text-davinci-003 -p "hello?"
6466
- [X] [llama.cpp](https://github.com/ggerganov/llama.cpp) via [llamacpp-python](https://github.com/thomasantony/llamacpp-python)
6567

6668
#### Others
69+
- [X] Performance parameters like `n_batch` and `n_thread`
6770
- [ ] Documents
6871
- [ ] Token auth
6972
- [ ] Intergration tests
70-
- [ ] Performance parameters like `n_batch` and `n_thread`
7173
- [ ] A tool to download/prepare pretrain model

pyproject.toml

+1
Original file line numberDiff line numberDiff line change
@@ -28,6 +28,7 @@ dependencies = [
2828
"llamacpp>=0.1.11",
2929
"Flask>=2.0.0",
3030
"numpy",
31+
"pyyaml",
3132
]
3233

3334
[project.urls]

version.txt

+2
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
$Format:%h %d$
2+

0 commit comments

Comments
 (0)