About CopyleaksBot

CopyleaksBot is the official web crawler for Copyleaks. Its purpose is to discover and crawl publicly available web pages for our search services. We are committed to respecting the rules set forth by webmasters in their robots.txt files.

User Agent

CopyleaksBot identifies itself with the following User-Agent string in its HTTP requests:

CopyleaksBot/1.0

How to Control CopyleaksBot

To limit which pages Copyleaks can index, use your website’s robots.txt file. CopyleaksBot fully respects the robots.txt standard. You can use it to prevent our bot from indexing your entire site, specific directories, or individual pages. The User-agent token for our bot is CopyleaksBot.

Block CopyleaksBot from your entire site

Add the following to your robots.txt file:

User-agent: CopyleaksBot
Disallow: /

Block CopyleaksBot from a specific directory

Add the following to your robots.txt file:

User-agent: CopyleaksBot
Disallow: /private-directory/ 

Block CopyleaksBot from a single page

Add the following to your robots.txt file:

User-agent: CopyleaksBot
Disallow: /path/to/page.html

Contact Support

Get help with CopyleaksBot or other questions from our support team.

Ruby What's New

Get started

Guides

Using the APIs

Concepts

Resources

User Agent

How to Control CopyleaksBot

Contact Support

​User Agent

​How to Control CopyleaksBot

​Related Resources

Contact Support

User Agent

How to Control CopyleaksBot

Related Resources