ProRank SEO

Robots & Indexing Complete Guide

Control how search engines and AI bots crawl and index your site with ProRank SEO's comprehensive robots and indexing tools

Overview

ProRank SEO's Robots & Indexing module provides complete control over how search engines and AI bots interact with your website. Located under Technical SEO → Robots & Indexingin your WordPress admin, this feature helps you manage crawling, indexing, and content protection.

With the rise of AI training bots in 2025, this module now includes comprehensive AI bot blocking capabilities alongside traditional SEO tools like robots.txt management and indexing APIs.

Robots & Indexing is a free feature. The base module is available in all tiers. Google Indexing API is disabled by default and requires a compile-time constant to enable.View licensing options

Key Features

Virtual Robots.txt Editor

Manage robots.txt rules without creating a physical file, with automatic conflict detection

AI Bot Protection

Block common AI/ML training bots from crawling your content with comprehensive protection

Indexing APIs

Submit URLs with IndexNow and advanced premium Google indexing support

Global Noindex Settings

Control which sections of your site appear in search results with granular settings

Content Safeguard

Protect your content from AI training with meta tags and rule-based noindex

Advanced Directives

Control snippet length, image preview, and other advanced robots meta directives

Module Tabs Overview

Available Tabs in Robots & Indexing

1. Robots.txt Tab

Virtual robots.txt editor with custom rules, AI bot blocking toggle, and automatic sitemap URL addition. Includes physical file detection to prevent conflicts.

2. Indexing APIs Tab

Configure IndexNow directly in the plugin for instant submission to Bing and Yandex. Premium Google indexing exists as an advanced backend path and is not part of the standard visible admin flow by default.

3. Global Noindex Tab

Set noindex rules for entire sections: post types, archives, and special pages like search results and 404 pages.

4. Content Safeguard Tab

AI protection meta tags (noai, noimageai) and rule-based noindex for thin content based on word count and age.

5. Help Tab

Built-in setup guidance for IndexNow, plus documentation links for advanced Google indexing behavior.

Quick Start Guide

Follow these steps to configure Robots & Indexing in ProRank SEO:

  1. Navigate to WordPress Admin → ProRank SEO → Technical SEO → Robots & Indexing
  2. In the Robots.txt tab, enable the virtual robots.txt editor if no physical file exists
  3. Toggle "Block AI/ML Training Bots" to protect your content from AI scraping
  4. Configure IndexNow for the standard indexing workflow. Use premium Google indexing only if you have explicitly enabled the advanced backend path.
  5. Set up Global Noindex rules for archives and special pages
  6. Enable Content Safeguard meta tags for additional AI protection
  7. Click Save Settings to apply your configuration

AI Protection Overview

🤖 2025 AI Bot Protection

ProRank SEO can block common AI/ML training bots via robots.txt rules, including:

OpenAI

  • • GPTBot
  • • ChatGPT-User
  • • OAI-SearchBot

Google AI

  • • Google-Extended
  • • Gemini-Bot
  • • Bard-Bot

Anthropic

  • • Claude-Web
  • • ClaudeBot
  • • Anthropic-AI

Image AI

  • • MidJourney-Bot
  • • DALL-E-Bot
  • • StableDiffusion-Bot

Plus 40+ additional AI crawlers, research bots, and training systems

Feature Availability

All Robots & Indexing features are available in the free tier unless noted otherwise.

Free (all tiers)

  • Virtual Robots.txt Editor
  • AI Bot Blocking
  • IndexNow API
  • Content Safeguard Meta Tags (noai, noimageai)
  • Rule-based Noindex (posts, pages, taxonomies, archives)
  • X-Robots-Tag Headers
  • Global Noindex controls

Premium only (disabled by default)

  • Google Indexing API — exists in the premium plugin only, disabled by default via PRORANK_ENABLE_GOOGLE_INDEXING constant

Important Concepts

Understanding Protection Methods

Robots.txt Blocking

How it works: Adds Disallow rules to completely block bots from crawling

Effectiveness: Very high - bots cannot access content at all

Downside: May block legitimate uses like AI-powered search

Meta Tag Protection

How it works: Adds noai/noimageai meta tags as polite requests

Effectiveness: Medium - respected by ethical companies

Benefit: Content remains accessible for legitimate uses

Combined Approach

For maximum protection, enable both robots.txt blocking and meta tags. This provides strong protection while signaling your preferences to all systems.

Common Use Cases

Content Publishers

  • ✓ Block AI training bots to protect original content
  • ✓ Use IndexNow for fast news indexing
  • ✓ Noindex thin or outdated content automatically

E-commerce Sites

  • ✓ Block faceted navigation from crawling
  • ✓ Noindex cart and checkout pages
  • ✓ Fast indexing for new products

Portfolio Sites

  • ✓ Protect creative work from AI training
  • ✓ Control image preview settings
  • ✓ Noindex attachment pages

Corporate Sites

  • ✓ Protect proprietary content
  • ✓ Control snippet length in search
  • ✓ Noindex internal search results

Pro Tip: Start with Content Safeguard meta tags (available in Free tier) for basic protection, then use the robots.txt editor and AI bot blocking settings for stronger control. Google Indexing API requires enabling via wp-config.php.