NeonAI LLM FastChat

Proxies API calls to FastChat.

Request Format

API requests should include history, a list of tuples of strings, and the current query

Example Request:

{
 "history": [["user", "hello"], ["llm", "hi"]],
 "query": "how are you?"
}

Response Format

Responses will be returned as dictionaries. Responses should contain the following:

response - String LLM response to the query

Docker Configuration

When running this as a docker container, the XDG_CONFIG_HOME envvar is set to /config. A configuration file at /config/neon/diana.yaml is required and should look like:

MQ:
  port: <MQ Port>
  server: <MQ Hostname or IP>
  users:
    neon_llm_fastchat:
      password: <neon_fastchat user's password>
      user: neon_fastchat
LLM_FASTCHAT:
  context_depth: 3
  max_tokens: 256
  num_parallel_processes: 2
  num_threads_per_process: 4

For example, if your configuration resides in ~/.config:

export CONFIG_PATH="/home/${USER}/.config"
docker run -v ${CONFIG_PATH}:/config neon_llm_fastchat

Note: If connecting to a local MQ server, you may need to specify --network host

GPU

System setup

# Nvidia Docker
sudo apt install curl
distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add -
curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list

sudo apt-get update && sudo apt-get install -y nvidia-container-toolkit
sudo systemctl restart docker

Run docker

export CONFIG_PATH="/home/${USER}/.config"
docker run --gpus 0 -v ${CONFIG_PATH}:/config neon_llm_fastchat

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.github/workflows		.github/workflows
docker_overlay		docker_overlay
neon_llm_fastchat		neon_llm_fastchat
requirements		requirements
.dockerignore		.dockerignore
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
Dockerfile.gpu		Dockerfile.gpu
LICENSE.md		LICENSE.md
README.md		README.md
setup.py		setup.py
version.py		version.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NeonAI LLM FastChat

Request Format

Response Format

Docker Configuration

GPU

About

Releases 11

Packages

Languages

License

NeonGeckoCom/neon-llm-fastchat

Folders and files

Latest commit

History

Repository files navigation

NeonAI LLM FastChat

Request Format

Response Format

Docker Configuration

GPU

About

Resources

License

Stars

Watchers

Forks

Releases 11

Packages 0

Languages

Packages