🇬🇧UK Sovereign AI

Frontier AI Models
on UK Infrastructure

Argyll Data Development provides secure, UK-hosted inference and hands-on AI enablement – helping organisations deploy Sovereign AI across internal workflows and customer-facing products while staying in control.

Your data stays in the UK. No US jurisdiction. No data retention. No surveillance. Complete operational control.
SambaNova-poweredSN40L RDU infrastructure for high-throughput inference.
API-firstDeveloper-friendly endpoints for agents and product teams.
UK operationsBuilt around UK-hosted infrastructure and support paths.
Model choiceMiniMax, DeepSeek, gpt-oss, and more open-weight options.
UK sovereign AI infrastructure

Frontier AI should not mean foreign jurisdiction.

Most AI infrastructure decisions force a tradeoff: capability on one side, operational control on the other. We're built for teams that need both.

We give UK enterprises a sovereign path to production AI: OpenAI-compatible API access to advanced models, UK-hosted infrastructure for sensitive data, and the control teams need to scale with confidence.

  • Stay in control – with no surprise model deprecations.
  • Keep sensitive data on British soil.
  • Avoid CLOUD Act exposure and foreign jurisdiction risk.

Built for UK data residency, GDPR, the UK Data Act, and sector-specific requirements across finance, healthcare, legal, government, and defence.

Purpose-built AI inference on SambaRack™ SN40Ls

Trusted by SambaNova as their official UK partner, we run your AI workloads on SambaRack™ SN40Ls purpose-built for production inference. Their dataflow architecture is designed to reduce unnecessary data movement by keeping more computation and intermediate data local to the chip, helping lower memory bottlenecks resulting in higher throughput, lower latency, and better energy efficiency than conventional GPU-based approaches for the workloads they are optimized to accelerate.

SambaNova chip
424.8tokens/sec measured for MiniMax M2.7 on SambaNova in the benchmark snapshot.
8.47sfirst-answer latency for MiniMax M2.7 in the same provider comparison.
205kcontext positioning for agent-heavy MiniMax M2.7 workloads.
SN40LSambaNova RDU rack class used as the infrastructure story.

Benchmark snapshot prepared May 27, 2026. Public speeds vary by prompt length, reasoning mode, batching, region, and provider load.

MiniMax M2.7 neural network

The Model

Powered today by MiniMax M2.7

The model powering Argyll's infrastructure today is MiniMax M2.7 – an open-weight frontier model built specifically for agentic workloads: autonomous multi-step tasks, tool use, and long-running agent loops, not just single-turn responses. Because it's open-weight, we can run M2.7 entirely on UK infrastructure, with no licensing tie to a US provider, ensuring your company gets powerful, agent-ready AI under your control, with data and computation fully within UK jurisdiction and no dependency on US-based model APIs.

MiniMax M2.7 is a 230B-parameter sparse mixture-of-experts (MoE) model. Independent analysis from Artificial Analysis confirms M2.7 delivers strong performance on real-world agentic tasks with reduced hallucinations, achieving a GDPval-AA Elo of 1,494.

What that means in production

  • Built for agents – multi-step tool use, autonomous task execution, and sustained agent loops, with a 97% skill-adherence rate on complex multi-step tasks
  • 56.2% on SWE-Pro – one of the toughest real-world software engineering benchmarks, covering bug troubleshooting, code security review, and log analysis
  • 78% on SWE-bench Verified – validating real-world software engineering capability across diverse repositories
  • 205k context window – entire contracts, codebases, or knowledge bases in a single call
  • High-throughput inference on SN40L – sustained performance for real-time agentic applications on SambaRack™ SN40Ls

As the open-weight landscape develops, Argyll's infrastructure will support more models. The sovereign guarantee stays the same; the roster grows.

How it works

One path from pilot to production.

Start with MiniMax on UK-hosted infrastructure, test through a familiar API, then scale into dedicated capacity, hands-on enablement, and sovereign production operations when you're ready.

01

Choose MiniMax

Start with MiniMax for fast agent execution, summarisation, reasoning, and secure business workflows.

02

Change the endpoint

Use OpenAI-compatible chat completion patterns so engineering teams can evaluate quickly without rebuilding existing workflows.

03

Scale with control

Move from testing to production with UK-hosted inference, reserved capacity, hands-on enablement, and enterprise deployment support.

Get started
curl -H "Authorization: Bearer $API_KEY" \
     -H "Content-Type: application/json" \
     -d '{
	"stream": true,
	"model": "MiniMax-M2.7",
	"messages": [
		{
			"role": "system",
			"content": "You are a helpful assistant"
		},
		{
			"role": "user",
			"content": "Hello!"
		}
	]
	}' \
     -X POST https://api.argylldev.ai/v1/chat/completions
from sambanova import SambaNova

client = SambaNova(
    api_key="",
    base_url="https://api.argylldev.ai/v1",
)

response = client.chat.completions.create(
    model="MiniMax-M2.7",
    messages=[{"role":"system","content":"You are a helpful assistant"},{"role":"user","content":"Hello!"}],
    temperature=0.1,
    top_p=0.1
)

print(response.choices[0].message.content)
import gradio as gr

gr.load("MiniMax-M2.7", accept_token=True).launch()

Pricing

Simple, scalable plans.

Start testing with free tokens, move into production on demand, or reserve capacity for high-volume workloads.

Free

Explore sovereign AI inference without a credit card.

  • 500,000 tokens
  • Access to MiniMax M2.7
  • No credit card required
  • Restricted rate limits
  • Community support
Start Free

Developer

Pay as you go for production inference that scales on demand.

  • Pay-per-token pricing
  • Preview model access
  • Standard rate limits
  • Standard support
Start Building

Enterprise

Scale production traffic with volume pricing, dedicated support, and capacity options.

  • Volume-based pricing
  • Access to all production models
  • Dedicated infrastructure options (including reserved capacity)
  • Custom rate limits
  • Priority support and SLA
Talk to Sales

What does sovereign AI inference cost you?

Example workloads

Input tokens £0.60 / M tokens
Output tokens £3.00 / M tokens
Monthly input tokens 100M
1M100M500M1B
Monthly output tokens 10M
1M100M500M1B
Monthly cost
£300
GBP per month
Annual cost
£3,600
GBP per year
Hosted in the UK
Invoiced in GBP
No US jurisdiction

Ready to build on UK-hosted AI infrastructure?

Start with free credits, validate the API, then move into production or reserve dedicated capacity when your workload is ready.