add CPU version of the deployment

2024-02-21 22:01:24 -05:00 · 2024-02-21 22:01:24 -05:00 · 3bb1117361
parent 2f7813c591
commit 3bb1117361
2 changed files with 27 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -38,7 +38,16 @@ A quick prototype to self-host [LibreChat](https://github.com/danny-avila/LibreC
 6. Browse http://localhost:3080/
 7. Create an admin account and start chatting!

-The API along with the APIDoc will be available at http://localhost:8000/
+### Steps for NO GPU (use CPU)
+
+**Warning: This may be very slow depending on your CPU and may us a lot of RAM depending on the model**
+
+1. Make sure your drivers are up to date.
+2. Clone the repo.
+3. Copy the CPU compose spec to select it. `cp docker-compose.cpu.yml docker.compose.yml`
+4. Run `docker compose up`. Wait for a few minutes for the model to be downloaded and served.
+5. Browse http://localhost:3080/
+6. Create an admin account and start chatting!

 ## Configuring additional models

--- a/docker-compose.cpu.yml
+++ b/docker-compose.cpu.yml
@ -0,0 +1,17 @@
+include: 
+  - docker-compose.base.yml
+
+services:
+  # Begin Ollama service
+  ollama:
+    image: ollama/ollama:0.1.23
+    restart: unless-stopped
+    entrypoint: /bootstrap.sh
+    command: mistral
+    env_file:
+      - .env
+    ports:
+     - 11434:11434
+    volumes:
+      - ./ollama/bootstrap.sh:/bootstrap.sh:ro
+      - ./ollama:/root/.ollama