HTML Conversion
Convert HTML files to searchable Markdown
Convert HTML files and saved web pages to clean, searchable Markdown.
Usage
Options
| Option | Default | Description |
|---|---|---|
-o, --output <dir> | ./converted/web | Output directory |
-r, --recursive | false | Process subdirectories |
-v, --verbose | false | Enable verbose logging |
--dry-run | false | Preview without writing |
-t, --tags <tags...> | ["web"] | Additional tags |
Example
Output Format
What Gets Extracted
- Main content (article body)
- Title and metadata
- Images (referenced, not embedded)
- Links (preserved as Markdown)
What Gets Removed
- Navigation menus
- Advertisements
- Scripts and styles
- Cookie banners
- Footer boilerplate
Supported Formats
.html/.htmfiles- Saved web pages
- Browser "Save As" exports
- Google Takeout saved pages
Next Steps
- Batch Upload - Upload converted files