Data Extraction
Extract structured page intelligence from DOM and workflow outputs.
What this guide covers
Extract structured page intelligence from DOM and workflow outputs.
Mode selection
Use managed mode for open pages and extension mode when extraction requires an authenticated profile.
Recommended run sequence
- Navigate to the source page and capture a snapshot for stable references.
- Extract key values with DOM inspection commands or a workflow wrapper.
- Normalize outputs into JSON/markdown artifacts for downstream systems.
- Persist artifacts with source references for auditability.
Verification checkpoints
- Extracted fields are tied to deterministic refs or documented selectors.
- Outputs include URL/source provenance and execution metadata.
- Missing fields are explicit and not silently dropped.
Related surfaces
dom-html,dom-text,dom-attr,dom-valueresearch run,shopping run,product-video runartifacts cleanup
command example
npx opendevbrowser dom-text --help