Host and runtime
Linux host, Docker Engine 24+, Docker Compose v2, 16 GB RAM minimum, 50 GB free disk minimum.
Installation Process
Srasta's installer turns infrastructure setup into a guided workflow: validate the machine, detect hardware, choose the right topology, configure inference and access, deploy the stack, and verify that the runtime is ready before operators start using it.
curl -fsSL https://get.srasta.ai | sh Starts the installer UI on port 9000. The wizard performs readiness checks before deployment begins.
Flow
The installer is a deterministic state machine, not a loose chat flow. Each screen captures one operator decision, records the result, and carries that context forward into validation, placement, deployment, and day-2 advisories.
Before You Start
The installer can detect hardware, Docker readiness, ports, stale Srasta state, GPU availability, and service health. It still needs the operator to bring the infrastructure access and business intent.
Linux host, Docker Engine 24+, Docker Compose v2, 16 GB RAM minimum, 50 GB free disk minimum.
Trial installs can validate after deployment. Production and HA intents should validate the license up front.
CPU-only for evaluation, local GPU for private inference, or external inference where policy allows it.
LAN-only, public nginx/TLS, or Cloudflare Tunnel depending on who needs to reach the installation.
Wizard Stages
Trial, dev, staging, production, or production HA + DR. This determines warnings, license gates, and topology fit.
Single machine, guided multi-host, or existing Kubernetes cluster. Invalid combinations are warned, not hidden.
Docker, disk, stale containers, port conflicts, GPU/VRAM, OS, architecture, SSH, and Kubernetes capability checks.
Model selection, local or external inference, secrets provider, add-ons, identity, domain, and ingress choices.
Preflight, image pulls, service placement, model preparation, health checks, smoke verification, and Admin URL handoff.
Topologies
Best for trials, prototypes, demos, and one-team evaluation. Everything runs on one Linux host.
Fastest pathControl plane on one host, inference and supporting services placed across worker nodes. SSH bootstrap is handled by the installer.
Common production shapeUses kubectl and Helm against an existing cluster. The installer validates namespace, storage class, RBAC, nodes, GPU capacity, and ingress readiness.
Platform-team friendlyAfter Install
The installer should not declare success just because containers started. Srasta treats deployment as complete only when operators have health visibility, access to Admin, a validated gateway, and a path to troubleshoot or reset safely.
FAQ
The Srasta installer runs a browser-based wizard that checks host readiness, detects hardware, recommends deployment topology, configures inference and access, deploys services, and verifies health.
Yes. A single-node install is the fastest path for trials, prototypes, and team evaluations. Production deployments usually use multi-host or Kubernetes for better separation and availability.
No. Srasta supports Docker Compose for single-node and guided multi-host deployments, plus Kubernetes/Helm for customers that already operate clusters.
Yes. CPU-only inference is supported for low-concurrency evaluation. GPU-backed local inference is recommended for serious enterprise workloads.
Start Small, Install Cleanly
Begin with one clear use case, confirm topology and access, then expand once the control plane, inference path, memory boundary, and operator workflow are verified.