Flow Table Extractor - Distribution Package
For End Users
This Chrome extension extracts tables from websites and uploads them to Flow Immersive, with optional Claude AI smart parsing for complex tables.
โจ Key Features
- ๐ Automatically detects all tables on any webpage
- ๐ One-click extraction and upload to Flow
- ๐ค Optional Claude AI smart parsing for multi-row headers
- ๐ Secure credential storage (enter once, use everywhere)
- ๐ Dataset versioning (same title = new version)
๐ Quick Start (5 Minutes)
Step 1: Install Python Backend
# Extract the zip file, then:
cd flow-table-extractor/backend
pip install -r requirements.txt
python server.py
Keep this terminal window open while using the extension.
Step 2: Install Browser Extension
- Open Chrome and go to chrome://extensions/
- Enable "Developer mode" (toggle in top-right)
- Click "Load unpacked"
- Select the flow-table-extractorfolder
- Look for the green "FT" icon in your toolbar
Step 3: Configure Credentials (One Time!)
- Click the "FT" extension icon
- Click "Configure Flow Credentials"
- Enter your Flow username and password
- (Optional) Enter Claude API key for smart parsing
- Click "Save All Credentials"
- Done! Never need to enter again โ
Step 4: Extract Your First Table
- Go to any website with tables (try Wikipedia!)
- Click the "FT" extension icon
- Select a table from the dropdown
- Enter a dataset title
- Click "Extract & Upload to Flow"
- Check Flow to see your data!
๐ Security & Privacy
Where are credentials stored?
- Browser: Chrome's encrypted sync storage
- Backend: Local file (~/.flow_extractor_config.json) with restricted permissions
Is it secure?
- โ Chrome automatically encrypts stored credentials
- โ File permissions prevent other users from reading
- โ Communication is localhost-only or HTTPS
- โ No third-party access to your credentials
See SECURITY.md for detailed information.
๐ค Claude AI Smart Parsing (Optional)
For complex tables with multi-row headers (like Wikipedia population tables), enable Claude AI:
Setup:
- Get API key from https://console.anthropic.com/
- Enter it in "Configure Flow Credentials" (or set as environment variable)
- Check โ๏ธ "Use Claude AI for smart table parsing"
Cost: ~$0.01-0.03 per table (1-3 cents)
What it does:
- Intelligently combines multi-row headers
- Handles merged cells properly
- Creates clear column names
- Perfect for complex Wikipedia/financial tables
See CLAUDE_SETUP.md for details.
๐ Documentation
- START_HERE.md - Welcome guide
- QUICKSTART.md - 5-minute setup
- SECURITY.md - Credential storage explained
- CLAUDE_SETUP.md - Claude AI setup
- README.md - Complete documentation
- VISUAL_GUIDE.md - Step-by-step walkthrough
๐งช Test Files
- test-page.html - Simple tables for testing
- test-complex-tables.html - Complex multi-row header tables
๐ก Tips
- Descriptive titles: Use clear dataset names like "Q4 Sales 2024"
- Version control: Re-upload with same title creates new version
- Claude for complex: Only use Claude AI for multi-row header tables
- Keep backend running: Leave python server.pyrunning in background
๐ Troubleshooting
"Connection error: Failed to fetch" โ Backend isn't running. Start python server.py
"Flow credentials not configured" โ Click "Configure Flow Credentials" and save them
"No tables found on this page" โ Page doesn't have HTML <table> elements
Tables extracted incorrectly โ Try enabling "Claude AI smart parsing" for complex tables
More help: See README.md or check the terminal output
๐ Usage Examples
Wikipedia Data:
- Go to "List of countries by population"
- Enable Claude AI (multi-row headers!)
- Extract โ Upload to Flow
- Analyze in Flow!
Your Company Dashboard:
- Open dashboard with data tables
- Select table
- Upload to Flow for analysis
Financial Reports:
- Find tables on investor sites
- Enable Claude for grouped columns
- Extract clean data
๐ Updates
To update credentials:
- Click extension icon
- Click "Configure Flow Credentials"
- Update any field
- Save
To add/change Claude API key: Same as above - it's in the credentials form.
๐ฏ System Requirements
- Browser: Chrome, Edge, or Brave
- Python: 3.7+ (3.12 recommended)
- OS: Windows, Mac, or Linux
- Internet: Required for Flow uploads and Claude AI
๐ฆ What's Included
flow-table-extractor/
โโโ manifest.json          # Extension config
โโโ popup.html/.js         # Extension UI & logic
โโโ icons                  # Extension icons
โโโ backend/
โ   โโโ server.py          # Flask backend
โ   โโโ requirements.txt   # Python dependencies
โโโ test-*.html            # Test pages
โโโ *.md files             # Documentation
โ ๏ธ Important Notes
- Backend must be running: Start python server.pybefore using
- Credentials persist: Only need to enter once
- Claude is optional: Works great without it for simple tables
- Cost awareness: Claude AI has minimal cost (~$0.02/table)
- Privacy: Your credentials stay on your computer
๐ You're Ready!
- โ Backend running (python server.py)
- โ Extension installed
- โ Credentials configured
- โ Extract tables from anywhere!
Need help? See the documentation files or contact your IT administrator.
Enjoy smart table extraction! ๐
