ljhijonmfahplgbbacgcfnaihbjljhhb
Open-source AI browser agent — chat with pages, automate tasks, multi-provider LLM support. WebBrain is a free, open-source browser extension that brings AI agent capabilities to Chrome. Read pages, extract data, and automate web tasks — powered by your choice of LLM. The self-hostable alternative to proprietary browser AI plugins. Key Features: - Page Understanding: Reads and comprehends any web page — articles, docs, dashboards, forms - Full Browser Agent: Click, type, scroll, navigate, and interact with pages on your behalf - Data Extraction: Extract structured data from any page — tables, lists, links, forms - Multi-Provider LLM: Works with local LLMs (LM Studio, Ollama, llama.cpp) and leading cloud providers (OpenAI, Anthropic Claude, Gemini, OpenRouter, etc.). - Privacy First: Your data stays yours. Run with a local LLM for zero data leakage - Smart Context: Automatic context management prevents token overflow Interaction Modes: - Ask Mode: Read-only questions about current page - Act Mode: Full agent with browser control for automation * Onboarding Flow: New guided setup to help you configure API keys quickly. * Enhanced Capabilities: Improved social media downloads and PDF reading. * Expanded Support: New AI providers added for broader model compatibility.
Page Agent Ext
AI-powered browser automation assistant. Control web pages with natural language. Page Agent Ext — AI-Powered Browser Automation 🌟 What is Page Agent Ext? Page Agent Ext brings AI-powered automation to your browser. Built on the open-source Page Agent framework, it lets you control web pages across multiple tabs using natural language — no scripting required. - Natural Language Control — Command your browser in plain language, no code needed - Cross-Tab Automation — Seamlessly operate across multiple tabs and pages - Smart HTML Cleaning — Intelligently extracts and simplifies page structure for accurate AI understanding - Bring Your Own LLM — Use OpenAI, Anthropic, or any compatible API with full data control - Privacy-First — Zero data collection; all data flows directly to your chosen LLM provider - Open Source — MIT licensed, built on the Page Agent framework with full transparency - 🧪 Connect from your own in-page agents - 🧪 Connect from local MCP Page Agent Ext performs DOM analysis locally in your browser. When you initiate a task, sanitized page structure is sent to the LLM API you configure. Your data is never collected or stored by us. - Your API Key — Configure your own LLM API (OpenAI, Anthropic, etc.). Data goes directly to your provider - Test API — A free test endpoint is available for evaluation; we recommend your own key for regular use Terms of Use & Privacy: https://github.com/alibaba/page-agent/blob/main/docs/terms-and-privacy.md This project is MIT licensed. Review the code, verify privacy claims, or extend it for your needs: https://github.com/alibaba/page-agent
100xBot
AI-powered browser assistant with multi-model LLM support for web automation, data extraction, and intelligent workflows ### What if your browser could do your boring work for you? You know those tasks you dread? - Copying data from websites into spreadsheets - Filling out the same forms over and over - Checking 10 different sites for price changes - Manually updating your CRM after every call **What if you could just *say* what you want, and it happens?** Talk to your browser like you'd talk to an assistant: > *"Grab all the product names and prices from this page"* > *"Fill out this application with my info"* Done. You get an alert when something changes. ### What can you actually do with it? **Save hours on data entry:** - Copy anything from any website into a spreadsheet - Fill forms instantly with your saved information - Update your CRM, database, or tracker without typing **Automate the repetitive stuff:** - Post to multiple platforms at once - Send follow-up emails on schedule - Generate reports from multiple sources **Works with voice or text:** - Speak your commands or type them - Natural language—no special syntax to learn - Works exactly how you'd explain it to a human assistant | Other automation tools | 100xBot | |------------------------|---------| | Need a developer to set up | Works out of the box | | Break when websites change | Adapts automatically | | Require learning complex interfaces | Just talk to it | | Cost $100+/month for teams | Affordable for individuals | **If you can describe what you want done, 100xBot can do it.** - **Sales teams** — Automate lead research and CRM updates - **Recruiters** — Gather candidate info without copy-paste - **Marketing teams** — Manage campaigns across platforms - **E-commerce sellers** — Track competitor prices automatically - **Researchers** — Collect data from multiple sources fast - **Executive assistants** — Book travel, manage schedules, handle admin - **Small business owners** — Do the work of a team, solo - **Anyone** who's tired of repetitive browser tasks 1. Install 100xBot 2. Click the icon 3. Tell it what you want done **Your first automation is one sentence away.**
Chrome Sidekick
Your AI Sidekick for Chrome — Web browser AI agent that automates workflows, explains pages, and extracts data. Chrome Sidekick is an AI Sidebar Agent that can assist with any webpage. It's like having ChatGPT with you on every webpage that can see what you see and automate tasks for you. Save detailed instructions as Workflows to run any time. Voice control allows you to use Chrome hands-free. Ask your assistant questions about any website or easily extract data simply by asking. Connect your favorite apps over MCP. What's new in 0.0.35: - Choose your model - Connect MCP apps - Remote control Chrome from Cursor and Claude Desktop - Bring your own API Key (optional) - Light/dark mode What's new in 0.0.32: - Agent runs without stealing the active window focus - Agent can send status update messages and continue working What's new in 0.0.30: - Saved instructions -- save detailed instructions to replay tasks - Multi-window support - Control window size and position - Automatic page summarization - Memory improvements What's new in 0.0.29: - Agent Memory -- teach it how to do things and it remembers going forward - Ability to wait for something to happen on-screen and take action - Better visibility of focused input fields - Better handling of loading screens What's new in 0.0.26: - Higher accuracy and reliability - Ability to continue tasks in new tabs + ability to open/close tabs - Now uses vision + html to "see" the webpage - Stop button - Now shows the AI's mouse cursor - Various bug fixes
ChromePilot
The AI assistant that lives in your browser. Chat, talk, automate tasks, fill forms instantly, and work with your favorite web apps – no code required. WHAT YOU CAN DO • AI-Powered Browsing Automation: Let the AI agent navigate websites, fill forms, and complete tasks hands-free using chat or voice commands • AI Image Generation: Create stunning images from text prompts powered by Nano Banana 3 Pro – choose aspect ratios and resolutions up to 4K • File Search (RAG): Upload documents and ask questions – AI searches your files semantically to find relevant answers with citations • Voice Mode with 6 Natural Voices: Speak naturally in 25+ languages – get instant answers and control your browser with your voice • Voice Keyboard: Type with your voice anywhere on the web – instantly dictates text into inputs and forms using advanced AI • Smart AI Chatbot: Ask questions, get summaries, research topics, and handle complex tasks through natural conversation • Google Search Integration: Get direct answers powered by AI without switching tabs • Upload Documents & Screenshots: Attach PDFs, images, or capture the current page to discuss with AI • Custom Webhooks: Connect to n8n, Zapier, Make, or any custom webhook to trigger automation workflows directly from chat • Direct Database Connection: Connect your Neon, Supabase, PostgreSQL, or MySQL database to get AI-powered insights via secure, read-only SQL queries • Interactive Chart Generation: Instantly turn data from your database, web pages, or documents into beautiful, interactive bar, line, pie, and doughnut charts inline in the chat • Speechify – Listen to Any Webpage: Turn any article or webpage into audio with one click – AI-powered text-to-speech with word-level highlighting, adjustable playback speed (0.5x–2x) without pitch distortion, and auto-play across paragraphs • Connectors for Workflow Automation: Connect your favorite tools and automate workflows across web apps • Personal Context Memory: Add your own context for tailored, personalized responses WHY CHOOSE THIS EXTENSION ✅ Works on All Chromium Browsers – Chrome, Edge, Brave, Arc, and more ✅ Your Data, Your Control – All chat history syncs to your Google Drive, not our servers ✅ Voice + Chat Modes – Switch between typing and talking seamlessly ✅ No Account Required for Core Features – Just install and start using ✅ Privacy-First – No data stored on external servers 1. Install the extension from Chrome Web Store 2. Click the extension icon to open the AI sidebar 3. Start chatting or activate voice mode 4. Grant permissions when prompted for automation features 🔹 Research and summarize web content 🔹 Automate repetitive browser tasks 🔹 Generate AI images from text descriptions 🔹 Search your uploaded documents with AI (RAG) 🔹 Voice-controlled browsing for accessibility 🔹 Listen to articles and web pages hands-free with text-to-speech 🔹 Voice typing in any text box with high accuracy 🔹 Quick answers from Google Search with AI 🔹 Document analysis and Q&A 🔹 Hands-free navigation while multitasking 🔹 Trigger n8n/Zapier workflows from chat 🔹 Query your live database safely with AI for instant insights and analytics 🔹 Visualize data and metrics instantly with inline charts