If you're ready to tinker a bit i can recommend Ollama for the backend and Open web UI for the frontend. They can also both run on the same machine.
The advantage is that you can use your GPU to compute, which is a lot faster.
If you're ready to tinker a bit i can recommend Ollama for the backend and Open web UI for the frontend. They can also both run on the same machine.
The advantage is that you can use your GPU to compute, which is a lot faster.