CLI Tools Overview
Convert files to searchable knowledge with command-line tools
textrawl includes CLI tools for converting various file formats to Markdown, then uploading them to your knowledge base.
Quick Start
Available Commands
mbox
Convert MBOX email archives (Gmail exports, Thunderbird backups).
html
Convert HTML files and web pages to searchable markdown.
upload
Upload converted files to Supabase with automatic embedding.
Unified Converter
The main entry point for all conversions:
| Command | Description |
|---|---|
mbox <file> | Convert MBOX email archive |
eml <path> | Convert EML file(s) or directory |
html <path> | Convert HTML file(s) or directory |
takeout <path> | Convert Google Takeout archive |
auto <path> | Auto-detect format and convert |
Common Options
All conversion commands support these options:
| Option | Default | Description |
|---|---|---|
-o, --output <dir> | ./converted | Output directory |
-v, --verbose | false | Enable verbose logging |
--dry-run | false | Preview without writing files |
-t, --tags <tags...> | [] | Additional tags to add |
Workflow
Output Format
Converted files are saved as Markdown with YAML frontmatter:
Frontmatter Fields
| Field | Description |
|---|---|
title | Document title |
source_type | email, web, youtube, calendar, contact |
source_hash | Hash for deduplication |
tags | Array of tags |
created_at | Original creation date |
converted_at | Conversion timestamp |
metadata | Format-specific metadata |
The source_hash prevents duplicate uploads when re-running the upload command.
Web UI
A drag-and-drop web interface for file conversion:
Features:
- Drag-and-drop file upload
- Real-time conversion progress
- Auto-upload option after conversion
- Supports MBOX, EML, ZIP (Takeout), HTML, PDF, DOCX, TXT, MD
Next Steps
- MBOX Conversion - Import email archives
- HTML Conversion - Convert web pages
- Batch Upload - Upload to knowledge base