Deploy an LLM on your local machine (Vicuna / GPU / Windows)

Introduction

Vicuna has arrived, a fresh LLM model that aims to deliver 90% of the functionality of ChatGPT on your personal computer. Vicuna is a free LLM model designed to manage shared GPT and a database of interactions collected from ChatGPT users. The developers of Vicuna assert that it can attain up to 90% of ChatGPT's capabilities. This guide outlines the process for utilizing the Vicuna model on your PC.

Prerequisites:

Git (https://git-scm.com/download/win)
An Nvidia GPU
Windows OS

Steps:

Download the oobabooga one-click installer: https://github.com/oobabooga/text-generation-webui/releases/download/installers/oobabooga-windows.zip and extract to a location of your choice.
- a) Double click on the install.bat file, a. Choose NVIDIA GPU and then it will start installing all the requirements that it needs to run.
- b) Run downloadmodel.bat. This will basically give you a list of all the models that you can download right now, and that will be automatically installed for the web UI. Select A for facebook opt 6.7 billion parameters, which takes around 12 gigabytes of space.
- c) Run in the models folder:
```
git lfs install
git clone https://huggingface.co/anon8231489123/vicuna-13b-GPTQ-4bit-128g
```
- d) Go back to the root folder where you have the startwebui.bat file,
  - Right click on the startwebui.bat file, click on edit with notepad,
  - Find call python server.py and paste these two arguments: --wbits4 --group size 128
- e) Go back to root folder and double click on the startwebui.bat file and choose the Vicuna model.
Download the Vacuna model by clicking on this link: https://huggingface.co/anon8231489123/vicuna-13b-GPTQ-4bit-128g
- a) Click on files and versions, then you're gonna click on the save tensor file, and then click on download. This will download the main 8GB model file onto your computer.
- b) Copy and paste the file into the vicuna-13b-GPTQ-4bit-128g folder.
Run start-webui.bat, which will then give you a local url, and to open it just hold ctrl and then left click. And there you go, now you have a beautiful webui to run all of your LLMs for absolutely free on your computer.

Source: https://www.youtube.com/watch?v=cCQdzqAHcFk