A powerful Chrome extension built with React + TypeScript + TailwindCSS for comprehensive Instagram data collection. Extract detailed post information including metadata, engagement metrics, and media links from Instagram profiles with advanced scraping modes and export capabilities.
- React 18 with TypeScript for UI components
- Vite for fast development and building
- TailwindCSS v4 for modern, responsive styling
- Gulp for Chrome extension packaging and bundling
- Chrome Extension Manifest v3 for modern extension architecture
- Yarn for dependency management
- Enhanced Data Collection: Extract comprehensive post data including:
- Post URLs (posts and reels)
- Author information and usernames
- Post captions with multi-line support
- Thumbnail image URLs
- Like counts with smart number parsing (K, M, B format)
- Comment counts
- Creation dates and timestamps
- View counts (for videos/reels)
- Post types (post/reel/story)
- Page type detection (main profile/reels tab)
- One-Click Scraping: Navigate to any Instagram profile and start scraping with a single click
- Background Operation: Scraping runs in the background, allowing you to continue browsing
- Dual Scraping Modes:
- Auto-Scroll Mode: Automatically scrolls to end of profile with continuous content loading
- Manual Scroll Mode: Scrapes as you manually scroll with debounced scroll detection
- Real-time Updates: Live progress tracking with immediate status updates
- Stop & Resume: Full control over the scraping process
- Smart End Detection: Enhanced algorithms to detect when profile end is reached
- Multiple Export Formats: Download data as JSON, CSV, or modern Excel (.xlsx)
- Default Format Selection: Set preferred export format with persistent storage
- Smart CSV Export: Proper handling of multi-line captions and special characters
- Modern XLSX Export: Full Excel compatibility with structured worksheets
- Comprehensive History: View all past scraping jobs with detailed metadata
- Data Preview: Preview collected data with thumbnail images before downloading
- Clear History: Manage storage with one-click history clearing
- Instant Results: History updates immediately after scraping completes
- Modern UI: Beautiful Instagram-inspired gradient design with backdrop blur effects
- Compact Design: 360px max-height popup with scrollable content
- Version Display: Current extension version shown in UI
- Responsive Design: Optimized for various screen sizes
- Smooth Animations: Enhanced user experience with fluid transitions
- Comprehensive Info Section:
- Privacy Policy: Transparent data handling and local storage explanation
- License Agreement: Clear terms of use and responsibility disclaimers
- GitHub Integration: Direct links to report issues and suggest features
- Developer Attribution: Credit and contact information
- Node.js (v16 or higher)
- Yarn package manager
-
Clone the repository:
git clone <repository-url> cd gramharvest
-
Install dependencies:
yarn install
-
Build the extension:
yarn build && yarn build:extension -
For development with hot reload:
yarn dev
yarn build- Build React components with Viteyarn build:extension- Package extension with Gulpyarn dev- Start development serveryarn lint- Run TypeScript linting
Since this extension is not on the Chrome Web Store, you need to load it manually in Developer Mode.
- Build the extension: Run
yarn build && yarn build:extensionto create thebuildfolder - Open Chrome Extensions: Navigate to
chrome://extensionsin Chrome - Enable Developer Mode: Toggle the Developer mode switch in the top-right corner
- Load Unpacked: Click "Load unpacked" and select the
buildfolder from this project - Verify Installation: The "GramHarvest" extension should appear in your extensions list
- Login to Instagram: Ensure you're logged into Instagram in your browser
- Navigate to Profile: Go to any Instagram profile page (e.g.,
https://www.instagram.com/instagram/) - Open Extension: Click the GramHarvest icon in your browser toolbar
- Configure Settings:
- Toggle Auto-scroll on/off to choose scraping mode
- Auto-scroll ON: Extension automatically scrolls to profile end
- Auto-scroll OFF: Manually scroll while extension scrapes continuously
- Set your preferred export format (JSON, CSV, Excel)
- Start Scraping: Click "Scrape Current Profile" button
- Monitor Progress: View real-time updates of collected posts count
- Stop if Needed: Use "Stop Scraping" button to halt the process
- View History: Click "View History" to see all past scraping jobs
- Preview Data: Use the "Preview" button to examine collected data
- Export Options: Choose from multiple export formats:
- JSON: Complete data with all metadata
- CSV: Spreadsheet-compatible format
- Excel: .xlsx format for Excel compatibility
- Clear History: Use "Clear History" to manage storage space
- Intelligent Scraping Modes:
- Auto-scroll: Continuous scrolling with content loading detection
- Manual scroll: Debounced scroll event handling for user-controlled scraping
- Background Operation: Close popup while scraping continues
- Real-time Updates: See progress updates without refreshing
- Comprehensive Data: Extract author, captions, engagement metrics, thumbnails, and timestamps
- Duplicate Prevention: Smart filtering to avoid collecting duplicate posts
- Multi-tab Support: Handle different Instagram page types (main profile, reels tab)
- Error Recovery: Robust error handling with graceful degradation
- No Data Collection: This extension does not collect any usage data or personal information
- Local Storage Only: All scraped data is stored locally in your browser
- No Remote Servers: No data is transmitted to external servers
- Open Source: Complete transparency - review the code yourself
- MIT License: Open source project - see LICENSE file for details
- User Responsibility: Users are solely responsible for data collection practices
- Ethical Use: Respect Instagram's terms of service and others' privacy
- No Liability: Developers are not responsible for misuse of this tool
- π Report Issues
- π‘ Suggest Features
- β Buy Me a Coffee
Developed with β€οΈ by sayedmahmoud266
This extension was coded with the help of Cascade, a powerful agentic AI coding assistant from Windsurf, powered by Google's Gemini Pro models & Anthropic's Claude models.




