Add Kubernetes deployment instructions to README.md

2023-08-18 23:56:43 +07:00 · 2023-08-18 23:56:43 +07:00 · 3553e20a3d
parent ca6ec9b370
commit 3553e20a3d
1 changed files with 24 additions and 3 deletions
--- a/README.md
+++ b/README.md
@ -36,11 +36,11 @@ https://github.com/getumbrel/llama-gpt/assets/10330103/5d1a76b8-ed03-4a51-90bd-1
 Running LlamaGPT on an [umbrelOS](https://umbrel.com) home server is one click. Simply install it from the [Umbrel App Store](https://apps.umbrel.com/app/llama-gpt).
 <!-- Todo: update badge link after launch  -->
 [![LlamaGPT on Umbrel App Store](https://apps.umbrel.com/app/llama-gpt/badge-light.svg)](https://apps.umbrel.com/app/llama-gpt)
-### Install LlamaGPT anywhere else
+---
 ### Install LlamaGPT anywhere else with Docker
 You can run LlamaGPT on any x86 or arm64 system. Make sure you have Docker installed.
@ -67,6 +67,27 @@ To stop LlamaGPT, run:
 docker compose down
 ```
 ---
 ### Install LlamaGPT with Kubernetes
 First, make sure you have a running Kubernetes cluster and `kubectl` is configured to interact with it.
 Then, clone this repo and `cd` into it.
 To deploy to Kubernetes first create a namespace:
 ```bash
 kubectl create ns llama
 ```
 Then apply the manifests under the `/deploy/kubernetes` directory with
 ```bash
 kubectl apply -k deploy/kubernetes/. -n llama
 ```
 Expose your service however you would normally do that. 
 ## Benchmarks
 We've tested LlamaGPT models on the following hardware with the default system prompt, and user prompt: "How does the universe expand?" at temperature 0 to guarantee deterministic results. Generation speed is averaged over the first 10 generations.