mirror of
https://github.com/webinstall/webi-installers.git
synced 2026-05-17 22:26:42 +00:00
128 lines
3.0 KiB
Markdown
128 lines
3.0 KiB
Markdown
---
|
|
title: ollama
|
|
homepage: https://github.com/jmorganca/ollama
|
|
tagline: |
|
|
ollama is a golang LLM server built for ease of use.
|
|
---
|
|
|
|
To update or switch versions, run `webi ollama@stable` (or `@v0.1.5`, etc).
|
|
|
|
## Cheat Sheet
|
|
|
|
> `ollama` is an LLM serving platform written in golang. It makes LLMs built on
|
|
> Llama standards easy to run with an API.
|
|
|
|
To get started quickly with the open source LLM Mistral-7b as an example is two
|
|
commands.
|
|
|
|
1. Open **TWO Terminals**
|
|
2. In the **first**, start the `ollama` server
|
|
```sh
|
|
OLLAMA_ORIGINS='*' OLLAMA_HOST=localhost:11434 ollama serve
|
|
```
|
|
3. In the **second**, run the `ollama` CLI (using the Mistral-7b model)
|
|
```sh
|
|
ollama pull mistral
|
|
ollama run mistral
|
|
```
|
|
|
|

|
|
|
|
## Table of Contents
|
|
|
|
- Files
|
|
- ChatGPT-style Web UI
|
|
- System Notes
|
|
- Models to Try
|
|
- As a Network API
|
|
|
|
### Files
|
|
|
|
These are the files / directories that are created and/or modified with this
|
|
install:
|
|
|
|
```text
|
|
~/.config/envman/PATH.env
|
|
~/.local/bin/ollama
|
|
~/.ollama/models/
|
|
```
|
|
|
|
### How to Use a ChatGPT-style Web Interface
|
|
|
|
There are [many Ollama UIs][ollama-uis] to choose from, but `ollama-webui` is
|
|
easy to start with (and can be built as a static page):
|
|
|
|

|
|
|
|
[ollama-uis]:
|
|
https://github.com/jmorganca/ollama/issues/875#issuecomment-1778045758
|
|
|
|
1. Install [`node`](../node/)
|
|
```sh
|
|
webi node@lts
|
|
source ~/.config/envman/PATH.env
|
|
```
|
|
2. Clone and enter the `ollama-webui` repo
|
|
```sh
|
|
git clone https://github.com/ollama-webui/ollama-webui.git ./ollama-webui/
|
|
pushd ./ollama-webui/
|
|
```
|
|
3. Install and start the project
|
|
```sh
|
|
cp -RPp ./example.env ./.env
|
|
npm clean-install
|
|
npm run dev
|
|
```
|
|
4. Enjoy!
|
|
- <http://localhost:5173/>
|
|
|
|
Note: Be sure to run `ollama` with CORS enabled:
|
|
|
|
```sh
|
|
OLLAMA_ORIGINS='*' OLLAMA_HOST=localhost:11434 ollama serve
|
|
```
|
|
|
|
## System Notes
|
|
|
|
You'll need a fairly modern computer. An Apple M1 Air works great.
|
|
|
|
- 8GB+ RAM
|
|
- 4GB+ Storage
|
|
- Models range between 3GB and 30GB+ \
|
|
(they can take a while to download, and _several_ seconds to initialize)
|
|
|
|
### How to Downloads Other Models
|
|
|
|
See the list at <https://ollama.ai/library>.
|
|
|
|
For example, we could try `sqlcoder`, or `orca-mini` (because it's small):
|
|
|
|
```sh
|
|
ollama pull sqlcoder
|
|
ollama run sqlcoder
|
|
```
|
|
|
|
```sh
|
|
ollama pull orca-mini
|
|
ollama run orca-mini
|
|
```
|
|
|
|
### How to Use as an API on a Network
|
|
|
|
If you'd like `ollama` to be accessible beyond `localhost` (`127.0.0.1`):
|
|
|
|
- set the host to `0.0.0.0`, which makes it accessible to _ALL_ networks
|
|
- you may wish to **limit origins**
|
|
|
|
```sh
|
|
# fully open to all
|
|
OLLAMA_ORIGINS='*' OLLAMA_HOST=0.0.0.0:11435 ollama serve
|
|
|
|
# restrict browsers (not APIs) to requests from https://example.com
|
|
OLLAMA_ORIGINS='https://example.com' OLLAMA_HOST=0.0.0.0:11435 ollama serve
|
|
```
|
|
|
|
See also:
|
|
|
|
- API Docs: <https://github.com/jmorganca/ollama/blob/main/docs/api.md>
|