Private inference, admin, audit feed, model policy, and security review packet.
CompletedPublic roadmap
Execution roadmap.
Built today, hardening now, and planned next. Srasta separates current capability from committed build direction, and every milestone is tied to a deployment/runtime profile we can actually validate.
We do not pitch roadmap items as shipped capability.
The website and pitch deck now align on the same backbone: private inference, agent-registered install, admin operations, governance evidence, and hardware/runtime profiles that move from Apple Silicon developer installs to NVIDIA Linux enterprise pilots before larger HA promises.
Milestones
Where the platform evolves.
The public roadmap mirrors the investor deck: completed foundation, current install-plane hardening, then proof-gated pilots and enterprise expansion.
Srasta-Agent registration, Deployment Charter, catalog/license gates, and receipts.
In progressPrompt-to-audit handover, paid pilot package, and SOC 2 readiness path.
PendingSrasta-Agent convergence, support bundles, upgrade, rollback, backup, and restore.
PendingKubernetes, HA/no-planned-downtime posture, and enterprise hardening.
PendingGoverned tools, MCP server SDLC, and agentic harness support.
PendingBuild detail
What each phase includes.
Private AI foundation
Keep the public promise grounded while we finish the architecture: private inference, admin operations, governed gateway routing, audit visibility, and customer-controlled deployment profiles.
- Private inference path for open-weight models on customer-controlled hardware
- Gateway foundations for role-aware model access and auditable AI requests
- Admin foundations for users, roles, licenses, model access, and runtime visibility
- Governance foundations for prompt, model, memory, tool, policy, and admin events
- Deployment profile design for developer local, single-node Linux, multi-host Linux, and Kubernetes
- Host agent heartbeat, runtime-health, and controlled-action foundations
Apple Silicon developer path
Deliver the simplest real path first: a developer on a Mac can install, run, test, and understand Srasta locally before we expand the enterprise deployment surface.
- Apple Silicon local profile using host-native model runtime where practical
- Local install plane with agent registration, prechecks, verify, reset, and logs
- Developer proof gate: prompt enters Srasta and produces audit-visible evidence
- Clear limits on what is local-only versus enterprise-ready
- No SSH/SCP operating model after local agent registration
NVIDIA Linux enterprise pilot path
Turn the architecture into the first enterprise-grade deployment profile: NVIDIA Linux hardware, private inference, admin onboarding, and hard prompt-to-audit proof gates.
- NVIDIA Linux single-node profile with vLLM-backed private inference
- Agent registration as the primary install backbone after bootstrap
- Hard gates for prompt-to-audit workflow proof before handover
- Operator-visible evidence for runtime health, gateway routing, role checks, and audit events
- Deployment Charter and Pilot Charter for design partners
- Sanitized support package flow for debugging without exposing customer data or PII
Multi-host Linux convergence
Expand from a single enterprise node into multi-host Linux deployments where Srasta agents report truth and converge the desired platform state.
- Converge desired platform state through registered agents
- Inventory, package, runtime, GPU, health, and policy posture reported to the install plane
- NVIDIA GPU nodes serve inference while CPU nodes absorb stateful, admin, and observability workloads
- Node-level action receipts and rollback evidence for operator review
- Stronger release verification and artifact provenance for customer-controlled installs
- Support workflow that scrubs customer data before optional diagnostic sharing
Kubernetes and HA operations
Harden multi-host and Kubernetes deployment paths toward no-planned-downtime operations. Formal five-nines SLA remains customer and validation dependent.
- Kubernetes operator and upgrade posture for enterprise deployment profiles
- HA-oriented topology guidance for gateway, inference, admin, governance, memory, and audit layers
- No-planned-downtime upgrade patterns where architecture and customer infrastructure allow it
- Disaster recovery, restore drills, rollback drills, and proof artifacts
- Formal SLA and five-nines commitments only after measured validation with paying customers
Bespoke intelligence layer
Make Srasta customizable without losing governance: custom tools, MCP servers, agentic harnesses, and approval SDLC tied back to audit and policy.
- Governed custom tool registry and approval lifecycle
- MCP server onboarding, review, approval, execution policy, and audit trail
- Agentic harness support for customer workflows that need planning, tool use, and operator guardrails
- Evaluation views for prompt quality, memory behavior, tool behavior, and compliance-rule outcomes
- Expansion from one governed workflow into a customer-specific private AI operating layer
Specialized enterprise requirements
These are not day-one promises. We will scope and price them when a paying customer has the requirement and the environment to validate it.
- Air-gapped environments and offline artifact logistics
- Windows node support
- Vast.ai or similar GPU-marketplace customer installs
- Formal five-nines SLA backed by measured production evidence
- Advanced multi-tenancy and sector-specific compliance automation
- AMD, Intel, and specialized edge AI machines after demand and validation
Design partner fit
Have a workflow that should shape the roadmap?
The best roadmap input is a real environment, a real governance boundary, and a measurable workflow. We capture requests in the same CRM-backed funnel as pilot and product-update interest.