Offline GPT: Offline (local) AI Chat Assistant

pnlfedmgfdalhhgjokchaoaablafalfa

Chat with AI using locally downloaded models (WebLLM) and current page context Offline AI Chat Assistant - LLM in your browser 🎯 KEY FEATURES • 100% Offline AI - All processing happens locally on your device • Multiple Models - Choose from Phi-3-mini (2.3GB), Llama-3.1-8B (4.9GB), or Qwen2.5-7B (4.3GB) • Page Context Aware - Automatically reads current webpage to answer questions about it • Complete Privacy - No data sent to external servers, no API keys needed. Actually, no interned is needed. 🔧 HOW IT WORKS 1. Select a model from the dropdown 2. Model downloads once and caches permanently 3. Start chatting - AI runs entirely in your browser using WebGPU 4. Ask questions about the current webpage or general topics 💡 USE CASES • Summarize articles and documentation • Answer questions about webpage content • Code assistance and explanations • General knowledge queries • All without internet connection (after initial setup) ⚙️ REQUIREMENTS • Chrome 113+ (Stable, Beta, Dev, or Canary) • WebGPU support (enabled by default) • 2-7GB storage for models • 2-6GB available RAM 🔒 PRIVACY • All AI processing happens locally • No external API calls (except one-time model download) • No user data collection or tracking • Conversations stored locally in your browser only Models powered by WebLLM and MLC AI.

Related extensions