Legal & Pricing
About CopyleaksBot
CopyleaksBot is the official web crawler for Copyleaks. Its purpose is to discover and crawl publicly available web pages for our search services. We are committed to respecting the rules set forth by webmasters in their robots.txt files.
User Agent
Section titled “User Agent”CopyleaksBot identifies itself with the following User-Agent string in its HTTP requests:
CopyleaksBot/1.0How to Control CopyleaksBot
Section titled “How to Control CopyleaksBot”To limit which pages Copyleaks can index, use your website’s robots.txt file. CopyleaksBot fully respects the robots.txt standard. You can use it to prevent our bot from indexing your entire site, specific directories, or individual pages. The User-agent token for our bot is CopyleaksBot.
-
Block CopyleaksBot from your entire site
Section titled “Block CopyleaksBot from your entire site”Add the following to your robots.txt file:
User-agent: CopyleaksBot/1.0Disallow: / -
Block CopyleaksBot from a specific directory
Section titled “Block CopyleaksBot from a specific directory”Add the following to your robots.txt file:
User-agent: CopyleaksBot/1.0Disallow: /private-directory/ -
Block CopyleaksBot from a single page
Section titled “Block CopyleaksBot from a single page”Add the following to your robots.txt file:
User-agent: CopyleaksBot/1.0Disallow: /path/to/page.html
Related Resources
Section titled “Related Resources” Contact Support Get help with CopyleaksBot or other questions from our support team.