Peer-to-Peer AI

Pistachio

Direct-to-provider AI inference.
No middleman. No broker tax.

Built for capex stability and the lowest cost AI inference available today.

Why We Built It

DRM3 is always looking for arbitrage on AI pricing.

We started with Venice DIEM when it was $80. Today it is $900. Price asymmetry in AI inference is real and constant. Morpheus is pound-for-pound the least expensive option right now. Roughly 30% less per DIEM equivalent than Venice today. But the Morpheus gateway routes through a third-party AI broker who takes a cut.

We built Pistachio to remove the broker entirely. True peer-to-peer: your machine connects directly to the AI provider. One less hop resolves latency and security concerns, and the broker tax disappears. The result: MOR staking for inference per DIEM costs roughly 60% less than Venice DIEM today.

Pistachio replaces the Morpheus consumer node. We rebuilt the Lumerin node for maximum savings and added user-friendly features: auto-reclaim and auto-staking to renew sessions without manual intervention. Then we added multi-tenant gateway key management to the inference API. One app replaces the consumer node and gives anyone the ability to be their own gateway provider, for themselves or others.

One command: pistachio serve

The Economics

MOR is collateral, not payment. Lock MOR to open an inference session. Your full stake returns when the session closes. Providers are paid by Morpheus protocol emissions, not from your deposit. Your only real cost is ETH gas (~$0.01 per cycle on Base).

Venice DIEM

Centralized, dollar-per-day pricing

baseline

Morpheus Gateway

Stake-based, but routed through a broker

~30% less

Pistachio (direct)

Peer-to-peer, no broker, no tax

~60% less

Morpheus DIEM equivalent is fluid (stake-based) compared to Venice's fixed dollar-per-day. Percentages reflect current pricing and may shift.

How It Works

Wallet = Identity

Every request is signed with your wallet's derived key using EIP-712. The network recovers your identity from the signature. No passwords, no tokens, no shared secrets. Cryptographic identity on every call.

NFT-Gated Access

Capacity is determined by Pistachio Pass NFTs on Base. Claim a free pass for 20 MOR. Upgrade to Bronze, Silver, Gold, or Diamond for higher limits. Larger stakes mean longer sessions, fewer transactions, lower gas.

Provenance on Every Response

Every inference result carries a signed receipt: the model version, the provider, the timestamp, the session. The same provenance protocol that powers the rest of DRM3. You can verify what model answered and who provided it.

OpenAI-Compatible API

Pistachio exposes a standard /v1/chat/completions endpoint. Any tool that works with OpenAI works with Pistachio. Point your existing code at localhost:19377 and go.

50+ Models

Every size class. Flash to frontier.

Small / Flash

LLaMA 3.2 3B, Qwen 3.5 9B, DeepSeek V4 Flash, GLM 4.7 Flash, Gemma 4 26B/31B. Fast inference for classification, extraction, tagging, and bulk work.

Mid / Large

LLaMA 3.3 70B, Mistral 3.1 24B, Qwen 3 Next 80B, GPT-OSS 120B, DeepSeek V4 Pro, Qwen 3 235B, Kimi K2.6, Hermes 3 405B. Quality reasoning and general-purpose.

Specialized

Qwen 3 Coder 480B (code generation), BGE-M3 (embeddings), TEE-secured models on Secret Network (hardware-verified inference). Most models have :web variants with search and tool capabilities.

New providers register models continuously. The marketplace is live and growing.

Server-Ready

Runs Headless on Any Linux Server

No GUI required. Run pistachio serve in a systemd unit or Docker container. It auto-restakes expiring sessions with zero manual intervention. Used in production on Fly.io and bare-metal Linux.

Parallel Request Capacity

Each staked session provides one concurrent request lane. Open multiple sessions for parallel inference. The built-in plan manager handles this automatically based on your staking template.

Minimal Capital Exposure

Sessions last minutes to hours, not days. MOR returns immediately on close. Use a dedicated low-balance hot wallet with only the MOR needed for staking. Price volatility risk is minimal at these durations.

Pistachio reads all data directly from Base-chain smart contracts via RPC. No intermediate cache, no API dependency, no third-party data layer. Ground truth is the blockchain, and Pistachio reads it in real time — sessions, marketplace bids, provider endpoints, balances, and NFT capacity.

For a standalone Morpheus block explorer, see Cashew.

Access Passes

Your NFT is your capacity tier.

Pistachio uses consumer passes (ERC-721 NFTs on Base) to determine how much inference capacity you get. Claim a free pass by staking 20 MOR. Upgrade to higher tiers for more capacity, longer sessions, and lower per-request gas costs. Passes are additive. Hold multiple to stack capacity.

Free

20 MOR

Bronze

+200 MOR

Silver

+1K MOR

Gold

+5K MOR

Diamond

+20K MOR

Claim your pass

Get Started

Install Pistachio in one line.

Mac - Desktop app + CLI

$ brew tap drm3labs/drm3 && brew install --cask pistachio-desktop

Linux / WSL2

$ curl -fsSL https://drm3.network/install | sh

macOS (Apple Silicon + Intel), Linux, and Windows via WSL2.

Essential cookies only. No tracking. Privacy