Skip to main content

Flow Table Extractor - Distribution Package

For End Users

This Chrome extension extracts tables from websites and uploads them to Flow Immersive, with optional Claude AI smart parsing for complex tables.

โœจ Key Features

  • ๐Ÿ” Automatically detects all tables on any webpage
  • ๐Ÿ“Š One-click extraction and upload to Flow
  • ๐Ÿค– Optional Claude AI smart parsing for multi-row headers
  • ๐Ÿ” Secure credential storage (enter once, use everywhere)
  • ๐Ÿ”„ Dataset versioning (same title = new version)

๐Ÿš€ Quick Start (5 Minutes)

Step 1: Install Python Backend

# Extract the zip file, then:
cd flow-table-extractor/backend
pip install -r requirements.txt
python server.py

Keep this terminal window open while using the extension.

Step 2: Install Browser Extension

  1. Open Chrome and go to chrome://extensions/
  2. Enable "Developer mode" (toggle in top-right)
  3. Click "Load unpacked"
  4. Select the flow-table-extractor folder
  5. Look for the green "FT" icon in your toolbar

Step 3: Configure Credentials (One Time!)

  1. Click the "FT" extension icon
  2. Click "Configure Flow Credentials"
  3. Enter your Flow username and password
  4. (Optional) Enter Claude API key for smart parsing
  5. Click "Save All Credentials"
  6. Done! Never need to enter again โœ“

Step 4: Extract Your First Table

  1. Go to any website with tables (try Wikipedia!)
  2. Click the "FT" extension icon
  3. Select a table from the dropdown
  4. Enter a dataset title
  5. Click "Extract & Upload to Flow"
  6. Check Flow to see your data!

๐Ÿ” Security & Privacy

Where are credentials stored?

  • Browser: Chrome's encrypted sync storage
  • Backend: Local file (~/.flow_extractor_config.json) with restricted permissions

Is it secure?

  • โœ… Chrome automatically encrypts stored credentials
  • โœ… File permissions prevent other users from reading
  • โœ… Communication is localhost-only or HTTPS
  • โœ… No third-party access to your credentials

See SECURITY.md for detailed information.

๐Ÿค– Claude AI Smart Parsing (Optional)

For complex tables with multi-row headers (like Wikipedia population tables), enable Claude AI:

Setup:

  1. Get API key from https://console.anthropic.com/
  2. Enter it in "Configure Flow Credentials" (or set as environment variable)
  3. Check โ˜‘๏ธ "Use Claude AI for smart table parsing"

Cost: ~$0.01-0.03 per table (1-3 cents)

What it does:

  • Intelligently combines multi-row headers
  • Handles merged cells properly
  • Creates clear column names
  • Perfect for complex Wikipedia/financial tables

See CLAUDE_SETUP.md for details.

๐Ÿ“– Documentation

  • START_HERE.md - Welcome guide
  • QUICKSTART.md - 5-minute setup
  • SECURITY.md - Credential storage explained
  • CLAUDE_SETUP.md - Claude AI setup
  • README.md - Complete documentation
  • VISUAL_GUIDE.md - Step-by-step walkthrough

๐Ÿงช Test Files

  • test-page.html - Simple tables for testing
  • test-complex-tables.html - Complex multi-row header tables

๐Ÿ’ก Tips

  1. Descriptive titles: Use clear dataset names like "Q4 Sales 2024"
  2. Version control: Re-upload with same title creates new version
  3. Claude for complex: Only use Claude AI for multi-row header tables
  4. Keep backend running: Leave python server.py running in background

๐Ÿ› Troubleshooting

"Connection error: Failed to fetch" โ†’ Backend isn't running. Start python server.py

"Flow credentials not configured" โ†’ Click "Configure Flow Credentials" and save them

"No tables found on this page" โ†’ Page doesn't have HTML <table> elements

Tables extracted incorrectly โ†’ Try enabling "Claude AI smart parsing" for complex tables

More help: See README.md or check the terminal output

๐Ÿ“Š Usage Examples

Wikipedia Data:

  1. Go to "List of countries by population"
  2. Enable Claude AI (multi-row headers!)
  3. Extract โ†’ Upload to Flow
  4. Analyze in Flow!

Your Company Dashboard:

  1. Open dashboard with data tables
  2. Select table
  3. Upload to Flow for analysis

Financial Reports:

  1. Find tables on investor sites
  2. Enable Claude for grouped columns
  3. Extract clean data

๐Ÿ”„ Updates

To update credentials:

  1. Click extension icon
  2. Click "Configure Flow Credentials"
  3. Update any field
  4. Save

To add/change Claude API key: Same as above - it's in the credentials form.

๐ŸŽฏ System Requirements

  • Browser: Chrome, Edge, or Brave
  • Python: 3.7+ (3.12 recommended)
  • OS: Windows, Mac, or Linux
  • Internet: Required for Flow uploads and Claude AI

๐Ÿ“ฆ What's Included

flow-table-extractor/
โ”œโ”€โ”€ manifest.json          # Extension config
โ”œโ”€โ”€ popup.html/.js         # Extension UI & logic
โ”œโ”€โ”€ icons                  # Extension icons
โ”œโ”€โ”€ backend/
โ”‚   โ”œโ”€โ”€ server.py          # Flask backend
โ”‚   โ””โ”€โ”€ requirements.txt   # Python dependencies
โ”œโ”€โ”€ test-*.html            # Test pages
โ””โ”€โ”€ *.md files             # Documentation

โš ๏ธ Important Notes

  1. Backend must be running: Start python server.py before using
  2. Credentials persist: Only need to enter once
  3. Claude is optional: Works great without it for simple tables
  4. Cost awareness: Claude AI has minimal cost (~$0.02/table)
  5. Privacy: Your credentials stay on your computer

๐ŸŽ‰ You're Ready!

  1. โœ“ Backend running (python server.py)
  2. โœ“ Extension installed
  3. โœ“ Credentials configured
  4. โœ“ Extract tables from anywhere!

Need help? See the documentation files or contact your IT administrator.

Enjoy smart table extraction! ๐Ÿš€