llamafile v0.8.17
llamafiler has a new web UI which supports two modes of operation:
chatbot and raw completion. Its syntax highlighting is just as advanced
as the CLI chatbot. It looks much nicer than the old web ui. In a future
release, llamafiler will be folded into llamafile to replace the old server.
- 988c9ec Introduce raw completions web ui
- 241bf21 Introduce /v1/completions endpoint in new server
- 6d89f8f Add binary safety check to server
- d18ddf1 Add redo button to new web ui
- bc82424 Add settings modal to web ui
- bb917bd Add vision model support to new server
- 4c7b7d5 Implement data URI parser
- fb4b3e6 Fix JSON parser bug
- 9d6f89f Improve look and printability of new web ui
- 25b6910 Make chatbot ui more printer friendly
- 30518ca Respond to HTTP OPTIONS requests
- 41abfa3 Work around multiple image handling
- 35bc088 Make default system prompt configurable on web
- 28c8e22 Scale and decimate images as needed in browser
- 14713b5 Get basic chatbot web gui working in llamafiler
- ef08074 Start porting syntax highlighter to JavaScript
- fdfdb13 Port remaining highlighting code to javascript
The following improvements have been made to our terminal chatbot.
- 12c3761 Make CLI chatbot work better with base models
- e5c0921 Improve VT100 support
- 4b61791 Fix VT102 support
- d25c077 Introduce /upload and /forget commands to chatbot
- 880ebc7 Handle empty system prompt better in cli chatbot
General improvements to this project.
- f581c40 Fix futex prototype
- 54d3c72 Make LLaVA fast again
- 01b8d49 Remove n-gpu-layer limitation (#534)
- 566cdc1 Improve Gemma system prompt generation
- 46284fe Reduce attack surface of stb_image
- 9bb262b Log CUDA kernel vs. runtime versions
Syntax highlighting improvements for chatbot and web ui.
- d979a1c Add BNF syntax highlighting
- 4a8311a Add cmake syntax highlighting
- 40e92cf Add Ocaml syntax highlighting
- 0995343 Add more Clojure keywords
- 0068a37 Make D syntax highlighting better
- 0965a4b Make some markdown improvements
- 9b96502 Improve JS/HTML syntax highlighting
- c0622da Put more work into markdown rendering
- fa1c98f Improve markdown to html rendering
- 8915432 Further improve markdown to html
- d25fa3a Improve highlighting in new web ui
- f5a0bd4 Fix JS regex highlighting issue
- 2807ae6 Improve Ada syntax highlighting
- d30da30 Syntax highlight D properly
- 33a057e Improve Ruby some more
- 5b0fff1 Improve Ruby syntax highlighting
- 8413a21 Fix Ruby builtins in web gui
The latest cosmopolitan upgrade introduces a new more powerful syntax
for your .args files. They're now parsed more similarly to the shell,
with support for C style escaping in double-quoted strings. You can also
now add shell-style comments to .args files too. See tool/args/args2.c
in the cosmopolitan codebase for the definitive reference.
The following example of the new .args file syntax is provided:
# specify model
-m Qwen2.5-Coder-34B-Instruct.Q6_K.gguf
# prevent flags below from being changed
...
# specify system prompt
--system-prompt "\
you are a friendly ai assistant\n
your job is to be helpful and intelligent"
# hide some stuff from user interfaces
--nologo
--no-display-prompt
You can put .args files inside llamafile, llamafiler, and whisperfile
using the zipalign program.
The following screenshots are provided of the llamafiler web ui.