LOADING

If loading is slow. Please enable cache. Caching is enabled by default in most browsers.

CHATBOX

chatbox

A standalone OpenAI-compatible chat workspace inspired by the llama.cpp server UI. Start from an endpoint, fetch models, tune reasoning and sampling, then chat in a dedicated page instead of inside the tools catalog.

Back to Tools

Workspace Endpoint-first chat

Model discovery, think controls, advanced sampling and optional speed metrics all live in this one tool page.

chatbox

Standalone chat workspace

Endpoint-first setup, model discovery and advanced controls inspired by the llama.cpp server experience.

Pick an endpoint, then start

Set the API root or a compatible endpoint.
Fetch models from /models or type the model id manually.
Open Controls if you need think or sampling settings, then send the first message.

Prompt Output Prefill Decode TTFT Total

Connection

Endpoint

API Root / Endpoint API Key (Optional)

Model List Model ID

Resolved root

OpenAI-compatible endpoints

Remember the optional API key in this browser's local storage.

The request sends Authorization: Bearer ... only when an API key is present.

Controls

Reasoning, Sampling & Advanced

Reasoning

Think & Display

Reasoning visibility, think mode and streaming behavior.

System Prompt Think Mode

Show reasoning / think blocks when the endpoint returns them. Render user messages as markdown, including math formulas and fenced code blocks. Render think / reasoning content as markdown instead of plain text. Strip thinking blocks from follow-up context to keep prompts leaner. Use streaming responses whenever the endpoint supports streaming. Show prefill and decode speeds when timing or usage data is available.

Sampling

Generation Controls

Temperature, limits and sampling knobs for the current model.

Temperature Max Tokens

Top P Top K

Min P Repeat Penalty

Presence Penalty Frequency Penalty

Seed Stop Sequences

Advanced

Extra Request Body

Pass vendor-specific fields without changing the core request builder.

Extra Body JSON

This JSON is merged into the final request body after the built-in controls, so you can pass extra vendor-specific fields without editing the code again.