Self-host Magistral on dedicated GPU clusters

Run Mistral's models on bare-metal Kubernetes in EU data centers. Your data never leaves your infrastructure.

Top GPU offerings

Updated 84 days ago

No model/quant candidates pass the quality filter.

Open-weight variant

Magistral Small 1.2

Modality

Text in Image Text out

Context

128k tokens

License

Recommended GPU

Runs on a single RTX 4090 or 32 GB Apple Silicon once quantized

Magistral Reasoning

ID: magistral-small-2509

PaperPlaneTilt a prompt or image to start.

Use Cases

Compare contract clauses vs policy baseline

Compare incoming contracts against your approved clause baseline before a reviewer opens the file. Magistral reasons across the packet, surfaces meaningful deviations with evidence, and produces a reviewer-ready memo instead of a raw extraction dump.

MSA draft

Security addendum

Clause deviation memo

Review recommendation

Keep clause libraries, fallback language, and reviewer notes versioned inside the same boundary so every comparison runs against the same legal baseline.

Contract Store

Policy Baseline Index

Magistral

Review Memo Writer

Deviation Memo

Redlines

Escalation

Workload fit

Not sure this model fits your use case?

The private LLM study maps 29 workloads across six patterns and shows where each model family fits.

Infrastructure

Looking at the GPU and deployment side?

GPU provider options, deployment architecture, and how we manage the serving layer on Kubernetes.

Self-host Magistral on dedicated GPU clusters

Compare contract clauses vs policy baseline

Compare invoice vs PO vs goods receipt

Create reviewer-ready exception packets with linked evidence