# Edxperimental Labs Website Buildout Status

Generated: 2026-05-16T05:29:45.101Z

Public buildout tracker for Edxperimental Labs: what has shipped, what still needs real-world inputs, and which research lanes should keep expanding.

## Snapshot

| Metric | Count |
| --- | ---: |
| Finished items | 92 |
| Next work items | 13 |
| Research backlog items | 8 |

## Finished Categories

| Category | Count |
| --- | ---: |
| Research | 31 |
| Benchmarks | 30 |
| Studio | 12 |
| Company | 7 |
| Consulting | 5 |
| Content | 4 |
| Platform | 3 |

## Next High-Impact Work

- **Benchmarks / Replace:** Feed real provider, notebook, browser-agent, and coding-agent outputs through /api/benchmark-run-intake or the CSV template, then replace prototype benchmark scores and task trace packets with reviewer-signed run rows. Owner: Sanjay. Dependency: Real benchmark exports. Status: Waiting on input.
- **Company / Next:** Connect the first-party /api/newsletter capture route to the chosen mailing-list provider or CRM once Sanjay picks the provider. Owner: Saujas with Sanjay. Dependency: Provider/process decision. Status: Waiting on input.
- **Consulting / Next:** Connect /api/consulting-intake and the generated consulting collateral to signed proposal templates plus the final CRM once the sales process is finalized. Owner: Saujas with Sanjay. Dependency: Provider/process decision. Status: Waiting on input.
- **Company / Next:** Connect /api/careers-application and generated careers collateral to the final hiring inbox, CRM, or applicant tracker once the first candidate process is ready. Owner: Saujas with Sanjay. Dependency: Provider/process decision. Status: Waiting on input.
- **Company / Extend:** Add social links once Sanjay shares them, replacing the current contact placeholders. Owner: Sanjay. Dependency: Official social URLs. Status: Waiting on input.
- **Studio / Replace:** Replace captured Studio preview screenshots with product walkthrough videos and client-approved demo media once live demos mature. Owner: Sanjay. Dependency: Client approval. Status: Waiting on input.
- **Studio / Replace:** Replace generated Studio packets with richer live-demo media, screenshots, and client-approved examples as productized demos mature. Owner: Sanjay. Dependency: Client approval. Status: Waiting on input.
- **Consulting / Wire:** Wire the case-study demo-readiness rows into real product controls, sanitized walkthrough videos, and client-approved proof once those assets are available. Owner: Saujas with Sanjay. Dependency: Client approval. Status: Waiting on input.
- **Benchmarks / Replace:** Replace the synthetic scripts/generate-benchmark-results.mjs seed rows with real model/provider run outputs. Owner: Sanjay. Dependency: Internal buildout. Status: Can improve now.
- **Benchmarks / Wire:** Point the current /api/benchmark-run-intake, CSV/JSON trace importer, and replay-scaffold artifact files at real harness/notebook exports once the first real benchmark runs are available. Owner: Sanjay. Dependency: Real benchmark exports. Status: Waiting on input.
- **Benchmarks / Replace:** Replace synthetic screenshot placeholders in benchmark artifact bundles with real browser/app screenshots and provider response ids from benchmark harness runs. Owner: Sanjay. Dependency: Internal buildout. Status: Can improve now.
- **Consulting / Replace:** Replace provisional case-study metrics with Sanjay's final client-approved numbers and screenshots when available. Owner: Saujas with Sanjay. Dependency: Client approval. Status: Waiting on input.
- **Consulting / Replace:** Replace generated case-study evidence packets with final client-approved screenshots, raw artifacts, and signed-off metrics when Sanjay provides them. Owner: Saujas with Sanjay. Dependency: Client approval. Status: Waiting on input.

## Research Backlog

- **Research / Extend:** Extend the new open-weight inference economics article with measured latency/quality traces from actual Mistral, DeepSeek, Qwen, and hosted inference runs once API keys and benchmark harness outputs are available. Owner: Sanjay. Dependency: Real benchmark exports. Status: Waiting on input.
- **Research / Extend:** Keep expanding data/research-evidence-library.json beyond the current 31 generated reading packets, and add carefully selected short verbatim excerpts only where publication needs exact wording. Owner: Sanjay. Dependency: Internal buildout. Status: Can improve now.
- **Research / Next:** Keep monitoring pricing refresh parser drift as provider pages change, and update provider-specific selectors when diagnostics show low confidence or missing expected model labels. Owner: Sanjay. Dependency: Provider page drift. Status: Can improve now.
- **Benchmarks / Replace:** Replace the generated Indian workflow v0.1 dataset design with redacted source packets, gold answers, reviewer notes, and real model/provider run exports. Owner: Sanjay. Dependency: Internal buildout. Status: Can improve now.
- **Benchmarks / Replace:** Replace generated benchmark-control metadata with real harness metadata once actual runs exist: raw prompts, exact model/provider identifiers, scorer identity, trace artifacts, and run replay links. Owner: Sanjay. Dependency: Real benchmark exports. Status: Waiting on input.
- **Research / Extend:** Extend the generated agent benchmark literature map with measured Edxperimental benchmark traces and a combined Agentic Reliability Index formula once real model/agent runs exist. Owner: Sanjay. Dependency: Internal buildout. Status: Can improve now.
- **Research / Next:** Turn the generated mechanistic interpretability playbook modules into separate long-form article pages if Sanjay wants a full explainer series. Owner: Sanjay. Dependency: Editorial decision. Status: Waiting on input.
- **Research / Extend:** Extend the inference economics playbook with measured latency and throughput traces once real provider/GPU benchmark runs exist. Owner: Sanjay. Dependency: Real benchmark exports. Status: Waiting on input.