Nova Origins
Fixture-only crawler dry run

Crawler Dry-Run Lab

Operator preview for RSS and arXiv-style fixture parsing. This lab parses local files only and never fetches remote feeds.

Live fetching disabled. External API calls disabled. No RSS feed fetching, arXiv API calls, browser scraping, AI summarization, AI drafting, or publishing automation.

Review source monitor historyReview scheduler dry-runs

Dry-Run Policy

Dry-run enabled: yes

Live fetching: disabled

External APIs: disabled

Remote URLs rejected: yes

Fixture Roots

examples/fixturesservices/api/tests/fixtures

Supported Feed Types

rssatom

API base URL is configured. Frontend data mode is static-preview; static builds do not depend on a live backend.

Fixture List

Dry-Run Preview

Local repo fixtures only. Remote URL input is rejected by the API and CLI.

Results

Parsed4
Accepted3
Rejected1
Duplicates0
Staged1
Skipped0

Fixture: JWST spectrum review flags water vapor candidate in sub-Neptune atmosphere

crawler_dry_run_candidate

Synthetic fixture result about atmosphere spectroscopy, water vapor, and follow-up needs.

Relevance10
Science10
Audience10
Archive-style candidate: verify provenance and scientific status before publication.

Reasons: Matched keyword: JWST, Matched keyword: atmosphere, Matched keyword: water vapor, Matched keyword: spectroscopy

Dedup: new_candidate / staged

Fixture: Launch logistics team completes general telescope engineering checklist

crawler_dry_run_candidate

Synthetic low-relevance fixture result about general telescope engineering and launch logistics.

Relevance0
Science0
Audience0
Low relevance candidate retained for operator review only.

Reasons: Low-relevance term: general telescope engineering, Low-relevance term: launch logistics

Dedup: not_checked_low_relevance / not staged

Import Staging

`stage_to_import_batch` stores accepted source candidates in Sprint 9 import staging. It does not commit imports automatically.

Staging requires local API preview and database mode. Static builds keep this disabled.

Learning Core Preview

crawler_dry_run.executedcrawler_dry_run.staged_import_batchimport_batch.validatedimport_batch.committed