LLMsTxt Generator

hkfhiobimmpeimihkebmpmppjlkofjie

Convert html pages to llms.txt format (markdown). LLMs read the generated llms.txt files to answer queries related to your pages. Extension scans your `sitemap.xml`, convert pages & live sites to LLM-optimized Markdown, and export instantly. It generates a single llms.txt file and multiple llms-full.txt file for each page which includes all the links. It automatically downloads the whole generated markdowns as a single .zip file. - Parses your `sitemap.xml` and any nested sitemaps, following only valid `http(s)` URLs. - Filters out non-HTTP links for focused scanning. - Generates a single llms.txt file along with sub pages for more details - Zips all generated files and auto downloads as a single file once scan is completed - Converts HTML pages into clean **ATX-style headings** (`#`, `##`, …), fenced code blocks, and absolute URLs. - Removes ``, ``, and `` tags; - Resolves relative links and images to **full URLs** for seamless static `llms.txt` content generation. - **Current Page Converter** - One-click “Convert Current Page” grabs the **rendered DOM** (supports SPA/React/Vue content). - Prepends `` as `# Heading` and `` as `> Blockquote`. - Ideal for ad-hoc page audits, AI training data extraction, and quick Markdown previews. - Built-in **Embed** tab with snippets: - Publish `llms-full.txt` files alongside your pages for easy LLM ingestion and SEO signals. - Four tabs: **Generator**, **Current Page**, **Embed**, **About**. - Real-time **progress bar** & **auto-scrolling log**. - ⚠️ User warning prevents accidental closure during scanning. - **Copy to Clipboard** for instant Markdown transfer. - 100% local conversion—no external servers, no tracking. - Uses Chrome MV3 Offscreen API for accurate DOM parsing. 1. **Auto-Detect** your sitemap URL (`https://your-site.com/sitemap.xml`) on secure pages. 2. **Offscreen Rendering** fetches pages in a hidden DOM, executing scripts for dynamic content. 3. **Clean & Normalize** HTML: strip unwanted nodes, normalize whitespace per text node. 4. **Convert to Markdown** with Turndown: 5. **Download or Copy** your domain’s ZIP or current-page Markdown. Please review the downloaded content before publishing under your webpage. If you see something unwanted, please report it as issue on open-source repository https://github.com/plainsignal/llmstxt - **SEO & Content Marketing**: Ideal for content audits, static migrations, UTM tracking, and structured data extraction. - **AI, LLM Workflows & RAG**: Prep training data, generate knowledge bases, accelerate AI-driven insights. - **Developer Productivity**: Integrates with CI pipelines, GitHub Actions, and static site generators. - **Flexibility & Extensibility**: Open-source under Apache 2.0 — https://github.com/plainsignal/llmstxt and contribute! It is community driven extension. Open an issue if something is not working as expected.

Related extensions