AI4Science User Manual — Physics World Model

⚛ Physics World Model NUMFOCUS · NONPROFIT

AI4Science · User Manual View source on GitHub ↗

# AI4Science User Manual *How to get your first PWM, run the six agents, and earn while you use them.* *(2026-06-10 · updated 2026-06-11: `ai4science update` one-word upgrade, full bordered TUI for all agents)* > **New here? Start with the step-by-step walkthrough → [/start](/start)** — it > takes you through the whole loop with one concrete example (the CASSI > principle): get your first PWM on the website, install and use AI4Science, then > earn PWM back with `/feedback`. This manual is the full reference. AI4Science is a Claude-Code-style agent for science. You pay for usage in **PWM** (the Physics World Model token) — not with API keys, not with a credit card. The LLMs (Claude, GPT, Gemini, …) are served by the founder provider accounts; your PWM balance on **physicsworldmodel.org** is debited a few *thousandths* of a PWM per turn. **The loop in one line:** > **Mine your first PWM on the website → log the CLI in with one browser > approval → use the agents → improve them (feedback, tools, solutions) → > PWM flows back to you automatically.** **Where the real earning is:** website mining pays **≈2 PWM at LLM-accept plus 0.5–10 PWM at founder promotion** per artifact — it bootstraps you. **Improving the agents is the primary track:** feedback instantly refunds (roughly) your next block of usage while an agent is young, and contributed tools/solutions earn weekly from 4,000,000 PWM of agent pools for as long as others use them — which can out-earn website mining by orders of magnitude (full math: [AGENT_IMPROVEMENT_EARNING_METHOD.md](https://github.com/integritynoble/AI4Science/blob/main/docs/AGENT_IMPROVEMENT_EARNING_METHOD.md)). Your wallet **private key is never needed and never asked for** — anywhere. If anything ever asks you for it, it is not us. (Why this is safe: [PWM_LEDGER_SAFETY_AND_ONCHAIN_SETTLEMENT_QA_2026-06-10.md](https://github.com/integritynoble/pwm/blob/main/pwm-team/doc/PWM_LEDGER_SAFETY_AND_ONCHAIN_SETTLEMENT_QA_2026-06-10.md) in the pwm repo.) --- ## Step 1 — Mine your first PWM on physicsworldmodel.org (the bootstrap) Everyone starts here — but note this is the *bootstrap*, not the main earning track (that's improving the agents, Steps 4–5). PWM is **earned by contributing physics**, on [https://physicsworldmodel.org](https://physicsworldmodel.org): 1. **Create an account** — sign in with your wallet (SIWE: your wallet signs a login challenge in your browser; the key never leaves it) or with email + password, then bind your wallet address on the account page. The address is where your earnings ultimately settle on-chain. 2. **Submit one of the four artifact types** (site forms, or the `/cli` browser terminal which includes free drafting assistance): - **Principle** — a physical law/identity with its falsifiable statement - **Digital Twin** — a forward model of a real instrument/setup - **Benchmark** — a task with data + metric on a digital twin - **Solution** — an algorithm that solves a benchmark 3. **Two-stage reward (Option 4, 2026-06-10).** When the automatic S1–S4 physics gates ACCEPT your submission you get a **bootstrap reward ≈ 2 PWM** instantly (from the protocol equation `reward = 0.0001 × (20,000 − already_paid)` — early accepts pay the most; max ~6 PWM/day). When a **founder promotes** it to mainnet you receive your full **A(t) emission share** (currently clamped to 0.5–10 PWM per artifact) — human eyes stand between plausible text and the real payout. **One accepted artifact funds hundreds of agent turns** — a typical turn costs 0.002–0.005 PWM, so the ~2 PWM bootstrap alone ≈ 400–1,000 turns, before the promotion payout. ## Step 2 — Connect the CLI (one browser approval, no keys) Install and log in: ```bash # one line, like Claude Code: curl -fsSL https://physicsworldmodel.org/install.sh | bash ai4science login # browser approval — like `claude login` ``` **Windows** (PowerShell — like Claude Code's Windows installer): ```powershell irm https://physicsworldmodel.org/install.ps1 | iex ``` (WSL users: use the Linux one-liner inside WSL. Equivalent manual install on any OS — no git required: `pip install --user "pwm-ai4science[claude] @ https://github.com/integritynoble/AI4Science/archive/refs/heads/main.zip"` — on Debian/Ubuntu add `--break-system-packages` (PEP 668); to UPGRADE an existing install add `--force-reinstall --no-cache-dir` (pip caches the GitHub zip by URL). The one-line installer above handles all of this automatically.) > **If your network blocks physicsworldmodel.org** (some institutional > networks, e.g. hospital/university filters, block newly registered > domains): everything also works from GitHub, which institutions rarely > block. > 1. **Install from GitHub directly:** > `curl -fsSL https://raw.githubusercontent.com/integritynoble/AI4Science/main/scripts/install.sh | bash` > (Windows: `irm https://raw.githubusercontent.com/integritynoble/AI4Science/main/scripts/install.ps1 | iex`) > 2. **Point the CLI at a mirror backend** for login/billing: > `ai4science login --base <mirror-url>` or `export PWM_BASE=<mirror-url>` > — every command honors it. (Ask in the community channel for the current > mirror, or request your IT department whitelist physicsworldmodel.org.) > 3. The manual itself is mirrored on GitHub: > `AI4Science/docs/AI4SCIENCE_USER_MANUAL.md`. `ai4science login` prints a short code and opens `https://physicsworldmodel.org/cli-auth?code=XXXX-XXXX`. Log in there (if you aren't already), check the code matches your terminal, click **Approve**. The CLI receives a **revocable API key** — never your password, never your wallet key — and stores it at `~/.config/ai4science/pwm_account.json` (permissions 0600). ```bash ai4science whoami # shows your account + bound wallet export AI4SCIENCE_PWM_GATE=1 # turn billing on (off by default) ``` To disconnect: `ai4science logout` (and revoke the key on your account page). **Upgrading — one word, like `claude update`:** ```bash ai4science update ``` It detects how you installed (installer venv / pipx / Debian system python) and runs the right pip itself — including `--break-system-packages` on Debian/Ubuntu PEP 668 hosts and `--force-reinstall --no-cache-dir` always (pip caches the GitHub zip by URL, so a naive upgrade silently reinstalls the stale build). It prints the new version when done. **Restart any running `ai4science chat` sessions afterwards** — a live process keeps the old code. (Builds older than 0.5.1 don't have `update` yet: upgrade once with `pip install --user --break-system-packages --force-reinstall --no-cache-dir "pwm-ai4science[claude] @ https://github.com/integritynoble/AI4Science/archive/refs/heads/main.zip"`, or just re-run the one-line installer — then it's `ai4science update` forever.) ## Step 3 — Use the agents Six agents, one balance, one command: ```bash ai4science chat --mode <agent> ``` | Agent (`--mode`) | What it does | ~PWM/turn | |---|---|---| | `unified-LLM` | General assistant across Claude / GPT / Gemini (switch with `/model`) | ~0.002 | | `research` | Science research harness with read-only access to the PWM registry & solutions | ~0.005 | | `paper` | Peer-review panel (3 reviewers + area chair/editor) with **venue simulation — 39 journals + 12 conferences** (Nature family incl. Machine Intelligence, Science, Cell, NEJM, Lancet, TPAMI… plus the computational-imaging field: IEEE TCI, Optics Express/Letters, Applied Optics, BOE, SIAM Imaging, Inverse Problems, MedIA, MRM · CVPR, ECCV, NeurIPS, ICCP, ICIP, ISBI, COSI…) | ~0.003 | | `claude-code` | The **real Claude Code engine** (claude-agent-sdk: its own system prompt, todos, plan mode, CLAUDE.md memory) **+ PWM GPU providers** (dispatch jobs to the sub-GPU server — stock Claude Code can't) | ~0.002 | | `codex` | The **real OpenAI codex engine** (codex CLI: its prompts, AGENTS.md memory, apply_patch/shell) **+ PWM GPU providers** via MCP (full-trust mode) | ~0.0001 | | `computational-imaging` | CASSI/imaging specialist: forward checks, GPU dispatch, result evaluation | ~0.005 | Anthropic-backed turns are served by **Claude Fable 5** first (fallback chain: Fable 5 → Opus 4.8 → Sonnet 4.6 → GPT-5.5 → Gemini). Inside a chat, `/mode` switches agents and `/model` switches LLM brands. **The terminal experience (all six agents):** the REPL has Claude Code's feel — ↑/↓ command history, the pulsing coral working star while the agent thinks, `⏺ Bash(...)` tool lines with `⎿` result summaries, clean exits (`/exit`, Ctrl-D, Ctrl-C twice). For the **full Anthropic-style bordered TUI**, set one environment variable: ```bash AI4SCIENCE_TUI=full ai4science chat --mode claude-code # full-screen app: # output pane + coral bordered input box ╭─ ai4science · claude-code ─╮ # + persistent status bar (✶ working… · model · cwd) AI4SCIENCE_TUI=1 ai4science chat --mode research # bordered box only ai4science chat --mode paper # plain line-REPL (default) ``` > **Real engines (2026-06-10):** `claude-code` and `codex` run the **genuine > products** — the claude-agent-sdk and the codex CLI — so the experience > matches the originals exactly (their prompts, todos/AGENTS.md memory, plan > mode, session resume), plus the PWM layer on top. One codex caveat: its GPU > tools need full-trust mode (`--yes`, or `AI4SCIENCE_CODEX_GPU=1`) because of > an upstream codex limitation (openai/codex #24135). Either way, a **paid** > GPU dispatch always needs your separate, explicit confirmation — an > interactive `[y/N]` prompt, or `AI4SCIENCE_COMPUTE_AUTOCONFIRM=1` in > scripts. Both fall back to the native harness if the engine/login is absent. > **Venue-simulated peer review (the `paper` agent, 2026-06-10):** rehearse > your manuscript against a real venue's standards before submitting. Inside > `--mode paper`: > > ``` > use paper_review on mypaper.pdf with venue "nature" and depth "deep" > ``` > > - **51 venues:** 39 journals (Nature, Science, Cell, Nature Communications, > **Nature Machine Intelligence**, NEJM, The Lancet, TPAMI, Optica, PRX… and > the computational-imaging field: **IEEE TCI**, Optics Express/Letters, > Applied Optics, BOE, SIAM Imaging Sciences, Inverse Problems, MedIA, MRM) > + 12 conferences (CVPR, ECCV, ICCV, NeurIPS, ICML, ICLR, MICCAI, SIGGRAPH, > **ICCP, ICIP, ISBI, COSI**). Casual aliases work: "NMI", "TCI", "OE", > "PAMI", "nature comms"… > - **Depth:** `shallow` = 1 reviewer (quick take); `deep` = 3 reviewers > (novelty / technical soundness / clarity) **plus** the venue's decider — > a handling **Editor** for journals (accept / minor_revision / > major_revision / reject) or an **Area Chair** for conferences > (accept / borderline / reject). > - Every reviewer judges **by that venue's bar** (Nature: "major conceptual > advance of broad interest, ~8% acceptance"; CVPR: "strong baselines, SOTA > tables, ablations"). The review bundle (JSON + Markdown) records the venue. > - All 51 venues live-tested 2026-06-10 (51/51 pass). It simulates each > venue's standards and style — a pre-submission rehearsal, not a prediction > of real editorial outcomes. Every turn debits your ledger and is split **90% to the LLM provider / 10% to the mining pool** — all visible in your transaction history. ## Step 4 — Earn while you use: feedback sustains your usage **The sustenance loop.** Spend your PWM using an agent; when you've nearly run out, tell us what you found — each accepted `/feedback` pays an **instant PWM reward that refills a shrinking runway of further usage**: ``` /feedback the dispatch step was confusing — suggest showing the queue position ``` How it works, end to end: - **When it unlocks:** only when you have **actually used the agent** AND your **balance is nearly gone** (≤ ~3 of your own turns' worth). A healthy balance returns `balance_not_low`; never having used the agent returns `use_agent_first`; and you must burn each refill before the next one. Feedback is a lifeline for users who've spent their PWM — not bonus income on top of a full wallet. - **The reward — instant, automatic, no claim step:** `reward = runway_turns × your own average turn cost × decay`, where the runway **shrinks per feedback** (~19 turns, then 18, … floor 5), the per-turn cost is capped (anti-inflation), and `decay = 1/(1 + 0.1 × agent_total_usage)`. Early in an agent's life decay ≈ 1.0 — run out, feed back, and you're **refilled for ~19 more turns**. - **The decay is the design:** as the agent accumulates usage the multiplier falls toward 0 — late feedback pays a fraction of a turn and **won't sustain usage**. That's the signal to move to contributing or mining (Step 5). - Feedback is recorded against your wallet (governance can mark down spam), but it takes **no share of the weekly pool epochs** — the big `A_k` emission from the 4M PWM pools is reserved for **usage‑weighted contributions** (tools, solutions — Step 5). What good feedback looks like: a problem you actually hit, a confusing step, a missing capability, a concrete suggestion. > **In short: spend your PWM using an agent → when you're nearly out, feed > back what you found → your runway is refilled (~19 turns, then 18, …) → > repeat.** Early users ride this loop almost for free; later users' refills > taper below their costs — by design — and the real earning shifts to > contributed tools/solutions (or mining). ## Step 5 — Earning later, once feedback rewards have decayed As the agents accumulate usage, the feedback multiplier decays and rewards stop covering your turn costs. Keep earning by: 1. **Improve the agents with registered contributions** — THE main earning track. Contribute a tool, solution, or dataset to an agent (e.g. a CASSI solver to computational-imaging); every paid turn that *uses* it adds usage-weight, and you earn from that agent's pool **every week it keeps being used** — usage-weighted, not first-come, and it scales with the agent's success (same `A_k` formula as feedback; see [AGENT_IMPROVEMENT_EARNING_METHOD.md](https://github.com/integritynoble/AI4Science/blob/main/docs/AGENT_IMPROVEMENT_EARNING_METHOD.md)). 2. **Weekly wins** — benchmark competitions and weekly challenge rewards on the site. 3. **Mining on [physicsworldmodel.org](https://physicsworldmodel.org)** — more principles, digital twins, benchmarks, solutions (the bootstrap path never closes). 4. **Compute provision** — serve GPU jobs for the compute loop and earn the provider side of the 90/10 split. ## The agent-improvement earning method (the math) Everything in Steps 4–5 is one formula, applied per agent, per weekly epoch: ``` A_k = (M_pool − M(t)) × w_k / Σ(w_j over active contributions j) × λ ``` - **M_pool** — that agent's own pool: | Agent | Pool (PWM) | |---|---| | computational-imaging | **1,400,000** (largest) | | research | 600,000 | | codex | 520,000 | | claude-code | 520,000 | | paper | 480,000 | | unified-LLM | 480,000 | | **Total** | **4,000,000** | - **M(t)** — what that pool has already paid out (derived from the ledger, so it can't drift). - **λ = 0.05** — each week emits 5% of what *remains*: the pool never runs out, and **early epochs pay more** (unified-LLM epoch-1 budget 24,000 → epoch-10 ≈15,100 → epoch-50 ≈1,950). - **w_k** — your contribution's weight. This is where the two tracks differ: | Improvement type | w_k rule | |---|---| | **feedback** (`/feedback` in chat) | **Not part of A_k** — paid **instantly** at submission, refilling a shrinking runway: `runway_turns (19, 18, … floor 5) × your capped avg turn cost × 1/(1 + 0.1 × agent_usage)`. **Unlocks only when your balance is nearly exhausted** after real usage of that agent. | | **tool / solution / digital twin / benchmark** (registered, used by agents in paid turns) | **Usage-weighted:** Σ weight_units per *distinct non-author* user × quality (self-usage excluded; sybil-capped). Grows every week others keep using it. | - **Payout:** each epoch, every active contribution receives its A_k — **75% to you** (your wallet-bound account, automatically, no claim step), 25% to the treasury. **Worked example** (epoch 1, unified-LLM, fresh pool): your contributed tool is the only used contribution → `A_k = 0.05 × 480,000 = 24,000` → **you get 18,000 PWM** (75%). With competition — your tool w=30 vs. another author's solution w=10 → Σw=40 → you get `24,000 × 30/40 × 0.75 = 13,500`, the other author 4,500. (Feedback is *not* in this race — it is paid instantly and modestly at submission, Step 4.) Full implementation map (files, endpoints, config): [AGENT_IMPROVEMENT_EARNING_METHOD.md](https://github.com/integritynoble/AI4Science/blob/main/docs/AGENT_IMPROVEMENT_EARNING_METHOD.md). ## Your earnings → your wallet on-chain PWM accrues on your site ledger instantly (free, per-action). On the weekly settlement batch, the treasury sends **real PWM (ERC-20 on Base)** to your bound wallet address — receiving requires nothing from you, and every batch is publicly verifiable on BaseScan. Until settlement, your balance is visible at any time on your account page and via the API. ## Safety rules (the short version) - **Never share your wallet private key.** Not with ai4science, not with the website, not with anyone. Login is browser-approved; spending uses a revocable token; receiving needs no key at all. - The stored CLI token can only spend your *site ledger* balance on agent usage. Leaked? Revoke it on the account page — damage stops instantly. - Billing is **off by default**: nothing charges until you set `AI4SCIENCE_PWM_GATE=1` with a logged-in account. ## Troubleshooting | Symptom | Fix | |---|---| | `insufficient PWM (balance 0.000)` | Mine your first PWM (Step 1) — the gate refuses to run on an empty balance. | | `login expired / could not verify your PWM balance` | `ai4science login --pwm` again (or set `PWM_TOKEN` directly for scripts). | | Browser shows a different code than the terminal | **Deny it.** Someone else may be phishing an approval; restart `login --pwm`. | | A turn charged but the agent errored | Charges are per completed turn; transient provider errors are not billed. Check `/model` to switch brands. | | Want to stop billing immediately | `unset AI4SCIENCE_PWM_GATE` (and/or `ai4science logout`). | | `error: externally-managed-environment` when installing (Debian/Ubuntu) | Use the one-line installer, or add `--break-system-packages` to pip. After that, `ai4science update` handles it automatically. | | Upgraded but behavior didn't change | pip caches the GitHub zip by URL — use `ai4science update` (it forces a fresh download), then **restart** any running `ai4science chat` session. | | Bordered TUI doesn't appear | It's opt-in: set `AI4SCIENCE_TUI=full` (or `1` for box-only), needs a real terminal (TTY) and version ≥ 0.5.0 (`ai4science version`). | --- *Companion docs:* test process — [TEST_AGENTS_WITH_ONE_WALLET.md](https://github.com/integritynoble/AI4Science/blob/main/docs/TEST_AGENTS_WITH_ONE_WALLET.md) · agents status — [AGENTS_STATUS_AND_ONE_WALLET_TEST_2026-06-10.md](https://github.com/integritynoble/AI4Science/blob/main/docs/AGENTS_STATUS_AND_ONE_WALLET_TEST_2026-06-10.md) · live prod test — pwm repo `pwm-team/doc/PWM_AI4SCIENCE_LOGIN_PWM_LIVE_TEST_2026-06-10.md` · safety design — pwm repo `pwm-team/doc/PWM_LEDGER_SAFETY_AND_ONCHAIN_SETTLEMENT_QA_2026-06-10.md`.