🚀 Extract structured data from any document format—PDFs, Word docs, emails, HTML, and more. This skill automatically detects your file type and pulls out text, tables, metadata, and elements with consistent, organized output. No manual formatting needed.

💡 Perfect for processing mixed-format documents, parsing emails with attachments, converting PDFs to structured data, or building document pipelines. Works with native PDFs, scanned images, spreadsheets, and presentations all in one go.

✨ Get intelligent element classification (titles, tables, lists), rich metadata preservation, and support for OCR on images—all with a single unified interface.

Data Extractor

Requirements