Skip to content
Legal & Pricing

About CopyleaksBot

CopyleaksBot is the official web crawler for Copyleaks. Its purpose is to discover and crawl publicly available web pages for our search services. We are committed to respecting the rules set forth by webmasters in their robots.txt files.

CopyleaksBot identifies itself with the following User-Agent string in its HTTP requests:

CopyleaksBot/1.0

To limit which pages Copyleaks can index, use your website’s robots.txt file. CopyleaksBot fully respects the robots.txt standard. You can use it to prevent our bot from indexing your entire site, specific directories, or individual pages. The User-agent token for our bot is CopyleaksBot.

  1. Add the following to your robots.txt file:

    User-agent: CopyleaksBot/1.0
    Disallow: /
  2. Block CopyleaksBot from a specific directory

    Section titled “Block CopyleaksBot from a specific directory”

    Add the following to your robots.txt file:

    User-agent: CopyleaksBot/1.0
    Disallow: /private-directory/
  3. Add the following to your robots.txt file:

    User-agent: CopyleaksBot/1.0
    Disallow: /path/to/page.html