Public roadmap

Execution roadmap.

Built today, hardening now, and planned next. Srasta separates current capability from committed build direction, and every milestone is tied to a deployment/runtime profile we can actually validate.

Subscribe for roadmap updates Request a roadmap item

Roadmap discipline

We do not pitch roadmap items as shipped capability.

The website and pitch deck now align on the same backbone: private inference, agent-registered install, admin operations, governance evidence, and hardware/runtime profiles that move from Apple Silicon developer installs to NVIDIA Linux enterprise pilots before larger HA promises.

Milestones

Where the platform evolves.

The public roadmap mirrors the investor deck: completed foundation, current install-plane hardening, then proof-gated pilots and enterprise expansion.

Jun 2026 Product baseline

Private inference, admin, audit feed, model policy, and security review packet.

Completed

Q3 2026 Install plane foundation

Srasta-Agent registration, Deployment Charter, catalog/license gates, and receipts.

In progress

Q4 2026 Proof-gated pilots

Prompt-to-audit handover, paid pilot package, and SOC 2 readiness path.

Pending

H1 2027 Ops backbone

Srasta-Agent convergence, support bundles, upgrade, rollback, backup, and restore.

Pending

H1-H2 2027 Enterprise profile

Kubernetes, HA/no-planned-downtime posture, and enterprise hardening.

Pending

H2 2027 Bespoke intelligence

Governed tools, MCP server SDLC, and agentic harness support.

Pending

Build detail

What each phase includes.

Available today Design baseline

Private AI foundation

Keep the public promise grounded while we finish the architecture: private inference, admin operations, governed gateway routing, audit visibility, and customer-controlled deployment profiles.

Private inference path for open-weight models on customer-controlled hardware
Gateway foundations for role-aware model access and auditable AI requests
Admin foundations for users, roles, licenses, model access, and runtime visibility
Governance foundations for prompt, model, memory, tool, policy, and admin events
Deployment profile design for developer local, single-node Linux, multi-host Linux, and Kubernetes
Host agent heartbeat, runtime-health, and controlled-action foundations

Building Q3 2026

Apple Silicon developer path

Deliver the simplest real path first: a developer on a Mac can install, run, test, and understand Srasta locally before we expand the enterprise deployment surface.

Apple Silicon local profile using host-native model runtime where practical
Local install plane with agent registration, prechecks, verify, reset, and logs
Developer proof gate: prompt enters Srasta and produces audit-visible evidence
Clear limits on what is local-only versus enterprise-ready
No SSH/SCP operating model after local agent registration

Building Q4 2026

NVIDIA Linux enterprise pilot path

Turn the architecture into the first enterprise-grade deployment profile: NVIDIA Linux hardware, private inference, admin onboarding, and hard prompt-to-audit proof gates.

NVIDIA Linux single-node profile with vLLM-backed private inference
Agent registration as the primary install backbone after bootstrap
Hard gates for prompt-to-audit workflow proof before handover
Operator-visible evidence for runtime health, gateway routing, role checks, and audit events
Deployment Charter and Pilot Charter for design partners
Sanitized support package flow for debugging without exposing customer data or PII

Planned H1 2027

Multi-host Linux convergence

Expand from a single enterprise node into multi-host Linux deployments where Srasta agents report truth and converge the desired platform state.

Converge desired platform state through registered agents
Inventory, package, runtime, GPU, health, and policy posture reported to the install plane
NVIDIA GPU nodes serve inference while CPU nodes absorb stateful, admin, and observability workloads
Node-level action receipts and rollback evidence for operator review
Stronger release verification and artifact provenance for customer-controlled installs
Support workflow that scrubs customer data before optional diagnostic sharing

Planned H1-H2 2027

Kubernetes and HA operations

Harden multi-host and Kubernetes deployment paths toward no-planned-downtime operations. Formal five-nines SLA remains customer and validation dependent.

Kubernetes operator and upgrade posture for enterprise deployment profiles
HA-oriented topology guidance for gateway, inference, admin, governance, memory, and audit layers
No-planned-downtime upgrade patterns where architecture and customer infrastructure allow it
Disaster recovery, restore drills, rollback drills, and proof artifacts
Formal SLA and five-nines commitments only after measured validation with paying customers

Planned H2 2027

Bespoke intelligence layer

Make Srasta customizable without losing governance: custom tools, MCP servers, agentic harnesses, and approval SDLC tied back to audit and policy.

Governed custom tool registry and approval lifecycle
MCP server onboarding, review, approval, execution policy, and audit trail
Agentic harness support for customer workflows that need planning, tool use, and operator guardrails
Evaluation views for prompt quality, memory behavior, tool behavior, and compliance-rule outcomes
Expansion from one governed workflow into a customer-specific private AI operating layer

Customer-driven Future

Specialized enterprise requirements

These are not day-one promises. We will scope and price them when a paying customer has the requirement and the environment to validate it.

Air-gapped environments and offline artifact logistics
Windows node support
Vast.ai or similar GPU-marketplace customer installs
Formal five-nines SLA backed by measured production evidence
Advanced multi-tenancy and sector-specific compliance automation
AMD, Intel, and specialized edge AI machines after demand and validation

Design partner fit

Have a workflow that should shape the roadmap?

The best roadmap input is a real environment, a real governance boundary, and a measurable workflow. We capture requests in the same CRM-backed funnel as pilot and product-update interest.

Request a pilot Request a feature