Skip to content

local/mistralrs

Uni-Xervo support

  • Provider ID: local/mistralrs
  • Feature flag: provider-mistralrs
  • Capabilities: embed, generate

Uni-Xervo provider options

  • isq (string)
  • force_cpu (boolean)
  • paged_attention (boolean)
  • max_num_seqs (integer > 0)
  • chat_template (string)
  • tokenizer_json (string)
  • embedding_dimensions (integer > 0, embed task only)
  • gguf_files (array of strings)

Authoritative Uni-Xervo option schema:

Available models

local/mistralrs delegates model support to the upstream mistral.rs engine.

Authoritative model/support references:

Model configuration references

Use mistral.rs docs for model-family and runtime behavior details.

Uni-Xervo generation API currently exposes:

  • max_tokens
  • temperature
  • top_p

Example catalog entry

{
  "alias": "generate/local",
  "task": "generate",
  "provider_id": "local/mistralrs",
  "model_id": "mistralai/Mistral-7B-Instruct-v0.2",
  "options": {
    "isq": "Q4K",
    "paged_attention": true,
    "max_num_seqs": 8
  }
}