The full site of Copyleaks Documentation # AI Logic - Transparency in AI Detection > A guide to using Copyleaks AI Logic to understand the reasoning behind AI detection results, providing transparency and enabling more confident decision-making. Features In a world where the line between human and AI-generated content is increasingly blurred, transparency is paramount. Copyleaks AI Logic is a groundbreaking feature that provides unprecedented insight into the “why” behind our AI detection results. It moves beyond a simple probability score to reveal the specific patterns and characteristics that indicate the presence of AI-generated content. ## The Power of Explainability [Section titled “The Power of Explainability”](#the-power-of-explainability) AI Logic is designed to address the “black box” problem in AI detection. By providing a clear, data-driven explanation for each result, we empower you to: * **Make Confident Decisions**: Understand the specific signals that led to a piece of content being flagged, allowing you to make more informed and defensible decisions. * **Build Trust with Users**: Provide your users with a transparent explanation of why their content was flagged, fostering a sense of fairness and trust. * **Facilitate Constructive Dialogue**: Use the detailed insights from AI Logic to have more productive conversations about the appropriate use of AI tools. ### How to Use AI Logic [Section titled “How to Use AI Logic”](#how-to-use-ai-logic) AI Logic is available as an optional parameter in your AI detection API requests. When enabled, the API response will include an additional `patterns` object that contains a detailed breakdown of the AI and human-like patterns found in the text. This object includes: * **Statistics**: A comparison of the statistical properties of the text against known AI and human writing patterns. * **Textual Analysis**: The specific segments of the text that exhibit AI-like characteristics, including their exact location and length. By analyzing this data, you can gain a deeper understanding of the AI’s assessment and build more sophisticated and transparent content integrity workflows. ### A New Level of Transparency [Section titled “A New Level of Transparency”](#a-new-level-of-transparency) AI Logic represents a significant step forward in the field of AI detection. By providing a clear and understandable explanation for our results, we are empowering developers, educators, and content creators to navigate the evolving landscape of AI-generated content with confidence and integrity. ## Usage Explanation [Section titled “Usage Explanation”](#usage-explanation) Upon successful completion of the request, the API will return a response containing various logics related to AI and human-written texts. **Example JSON Response** ```json { "patterns": { "statistics": { "aiCount": [ 15.9636, 39.5495, 84.7079, 119.8710, 9.9233, 185.6670, 14.4536, 19.1995 ], "humanCount": [ 0.8076, 1.5076, 3.8228, 8.5071, 0.3769, 4.2536, 0.3231, 1.1845 ] }, "text": { "chars": { "starts": [31, 55, 303, 909, 961, 987, 1129, 1775], "lengths": [23, 32, 23, 33, 25, 30, 30, 19] }, "words": { "starts": [5, 9, 45, 135, 144, 148, 169, 257], "lengths": [4, 6, 3, 6, 4, 5, 5, 3] } } } } ``` ## Understanding the Response [Section titled “Understanding the Response”](#understanding-the-response) The response provides data that helps understand why certain text patterns have been flagged as likely AI-generated. This information enables: * Clear documentation explaining AI detection results * Facilitation of dialogue that leads to mutual understanding and trust * Easy interpretation of detection results for quick action * Collaboration and data-driven conclusions. ## Next Steps [Section titled “Next Steps”](#next-steps) [API Reference ](/reference/actions/writer-detector/check)Explore the full API reference for the AI Detection endpoint. --- # AI Source Match > A comprehensive guide to using Copyleaks AI Source Match to identify online sources suspected of containing AI-generated content, enhancing plagiarism detection capabilities. In today’s digital landscape, the challenge of academic integrity has evolved beyond traditional plagiarism. AI Source Match represents a revolutionary advancement in plagiarism detection, specifically designed to identify online sources that may contain AI-generated content. This dual-layer detection system not only finds potential plagiarism but also reveals whether the matched source content itself might have been created by artificial intelligence. ## The Evolution of Plagiarism Detection [Section titled “The Evolution of Plagiarism Detection”](#the-evolution-of-plagiarism-detection) AI Source Match addresses a critical gap in modern content verification. As AI-generated content becomes more prevalent across the internet, traditional plagiarism detection tools may flag content against sources that are themselves artificially created. This creates a complex scenario where understanding the nature of the source material is as important as identifying the match itself. By combining internet scanning capabilities with AI detection technology, AI Source Match provides: * **Comprehensive Source Analysis**: Identify not just plagiarism, but the AI likelihood of the source material itself. * **Enhanced Academic Integrity**: Make more informed decisions about content originality by understanding the nature of matched sources. * **Improved Context**: Distinguish between copying from human-authored sources versus AI-generated content. * **Advanced Reporting**: Provide detailed insights into both plagiarism patterns and source authenticity. ### Prerequisites and Configuration [Section titled “Prerequisites and Configuration”](#prerequisites-and-configuration) AI Source Match builds upon existing Copyleaks functionality and requires specific configurations to operate effectively: **Required Dependencies:** * `scanning.internet` must be enabled to scan online sources * `aiGenerateText.detect` must be activated for AI content detection **Configuration Parameters:** ```json { "aiSourceMatch": { "enable": true }, "scanning": { "internet": true }, "aiGenerateText": { "detect": true } } ``` ### How AI Source Match Works [Section titled “How AI Source Match Works”](#how-ai-source-match-works) When enabled, AI Source Match operates through a sophisticated multi-step process: 1. **Internet Scanning**: The system performs comprehensive internet searches to identify potential source matches for the submitted content. 2. **Source Analysis**: Each identified source is analyzed using advanced AI detection algorithms to determine the likelihood that the source content was AI-generated. 3. **Dual Reporting**: Results provide both traditional plagiarism metrics and AI probability scores for the matched sources. 4. **Enhanced Insights**: Users receive detailed information about both the plagiarism match and the authenticity nature of the source material. ## Understanding AI Source Match Results [Section titled “Understanding AI Source Match Results”](#understanding-ai-source-match-results) The AI Source Match feature enhances standard plagiarism reports by adding an additional layer of source authenticity analysis. This enables educators, publishers, and content managers to: **Identify Complex Plagiarism Scenarios:** * Detect when students copy from AI-generated sources * Understand if matched content originates from authentic human sources * Distinguish between different types of content integrity issues **Make Informed Decisions:** * Assess the severity of plagiarism based on source authenticity * Develop appropriate responses based on the nature of the matched content * Build comprehensive content integrity policies ### Best Practices for Implementation [Section titled “Best Practices for Implementation”](#best-practices-for-implementation) To maximize the effectiveness of AI Source Match: **For Educators:** * Use results to facilitate discussions about AI usage and academic integrity * Consider source authenticity when determining appropriate responses to plagiarism * Develop clear policies that address both traditional plagiarism and AI-source copying **For Publishers:** * Implement AI Source Match as part of content verification workflows * Use insights to understand content originality in submissions * Maintain transparency about detection methods with contributors **For Content Managers:** * Integrate AI Source Match into content quality assurance processes * Use results to assess content authenticity across digital platforms * Develop comprehensive content integrity standards ## Technical Implementation [Section titled “Technical Implementation”](#technical-implementation) AI Source Match integrates seamlessly with existing Copyleaks API implementations. When enabled, the feature automatically enhances plagiarism detection results without requiring additional API calls or modifications to your existing workflow. **Key Implementation Notes:** * Default setting is `false` - must be explicitly enabled * Requires active internet scanning and AI detection capabilities * Results are included in standard plagiarism detection response objects * No additional authentication or setup required beyond existing API access ## Benefits and Use Cases [Section titled “Benefits and Use Cases”](#benefits-and-use-cases) AI Source Match provides value across multiple scenarios: **Academic Institutions:** * Enhanced student submission evaluation * Improved academic integrity enforcement * Better understanding of content originality issues **Publishing Industry:** * Comprehensive manuscript authenticity verification * Source material quality assessment * Content originality validation **Content Platforms:** * User-generated content quality control * Authenticity verification for submissions * Platform integrity maintenance --- # Data Hubs > Learn how to compare multiple documents against each other using Copyleaks' private and shared databases to find similarities and prevent plagiarism. Features Copyleaks’ Data Hubs provide a powerful way to compare multiple documents against each other, allowing you to detect similarities and prevent plagiarism within a large batch of content. This is particularly useful for educators who want to check if students have shared work or submitted identical content across a batch of assignments, or companies with large amounts of documents in order to find duplication. ## How It Works [Section titled “How It Works”](#how-it-works) Copyleaks provides two types of databases for storing and comparing documents: * **Shared Data Hub**: Global database that contains millions of documents from institutions worldwide. * **Private Cloud Hub**: Private database that is exclusive to your organization, ensuring that your documents remain confidential and secure. You can contribute documents to those databases and compare your documents against them. Note You can use both databases simultaneously to maximize detection coverage while keeping sensitive documents private. ## Understanding Your Database Options [Section titled “Understanding Your Database Options”](#understanding-your-database-options) You have two database options for storing and comparing your documents: ### Shared Data Hub (Free) [Section titled “Shared Data Hub (Free)”](#shared-data-hub-free) * Contains millions of documents from institutions worldwide * When you index a document, it becomes available for **everyone** to compare against * Contributes to the global academic integrity community * Your documents will be matched against submissions from other institutions ### Private Cloud Hub (Paid) [Section titled “Private Cloud Hub (Paid)”](#private-cloud-hub-paid) * Creates a completely **private database** for your organization only * Your documents stay within your private environment * Perfect for sensitive or confidential documents * Only you and your organization can access and compare against these documents * Built for large organizations looking to securely store and manage documents * Enables team collaboration with controlled access and user management Note You can use both databases simultaneously. Your documents can be stored in your Private Cloud Hub while also being compared against the Shared Data Hub for maximum detection coverage. ## How Cross-Comparison Works [Section titled “How Cross-Comparison Works”](#how-cross-comparison-works) The process involves two main steps: 1. **📥 Index your documents**: Upload documents to your chosen database using `IndexOnly` mode. 2. **🚀 Start the comparison**: Run a scan that compares all indexed documents against each other and your selected databases. This two-step approach ensures all documents are properly stored before the comparison begins. ## 🚀 Get Started [Section titled “🚀 Get Started”](#-get-started) 1. #### Before you begin [Section titled “Before you begin”](#before-you-begin) Before you start, ensure you have the following: * An active Copyleaks account. If you don’t have one, **[sign up for free](https://api.copyleaks.com/signup)**. * You can find your API key on the **[API Dashboard](https://api.copyleaks.com/dashboard)**. 2. #### Installation [Section titled “Installation”](#installation) Choose your preferred method for making API calls. * HTTP You can interact with the API using any standard HTTP client. For a quicker setup, we provide a Postman collection. See our [Postman guide](/resources/postman) for instructions. * cURL * Ubuntu/Debian ```bash sudo apt-get install curl ``` * Windows Download it from [curl.se](https://curl.se). * macOS ```bash brew install curl ``` * Python ```bash sudo apt-get install curl ``` * JavaScript Download it from [curl.se](https://curl.se). * Java ```bash brew install curl ``` * Ubuntu/Debian ```bash pip install copyleaks ``` * Windows ```bash npm install plagiarism-checker ``` * macOS [Download from Maven](https://central.sonatype.com/artifact/com.copyleaks.sdk/copyleaks-java-sdk?smo=true) 3. #### Login [Section titled “Login”](#login) To perform a scan, we first need to generate an access token. For that, we will use the [login](/reference/actions/account/login) endpoint. The API key can be found on the [Copyleaks API Dashboard](https://api.copyleaks.com/dashboard). Upon successful authentication, you will receive a token that must be attached to subsequent API calls via the Authorization: Bearer `` header. This token remains valid for 48 hours. * HTTP ```http POST https://id.copyleaks.com/v3/account/login/api Headers Content-Type: application/json Body { "email": "your@email.address", "key": "00000000-0000-0000-0000-000000000000" } ``` * cURL ```bash export COPYLEAKS_EMAIL="your@email.address" export COPYLEAKS_API_KEY="your-api-key-here" curl --request POST \ --url https://id.copyleaks.com/v3/account/login/api \ --header 'Accept: application/json' \ --header 'Content-Type: application/json' \ --data "{ \"email\": \"${COPYLEAKS_EMAIL}\", \"key\": \"${COPYLEAKS_API_KEY}\" }" ``` * Python ```python from copyleaks.copyleaks import Copyleaks EMAIL_ADDRESS = "your@email.address" API_KEY = "your-api-key-here" # Login to Copyleaks auth_token = Copyleaks.login(EMAIL_ADDRESS, API_KEY) print("Logged successfully!\nToken:", auth_token) ``` * JavaScript ```javascript const { Copyleaks } = require('plagiarism-checker'); const EMAIL_ADDRESS = "your@email.address"; const API_KEY = "your-api-key-here"; async function login() { const copyleaks = new Copyleaks(); const loginResult = await copyleaks.loginAsync(EMAIL_ADDRESS, API_KEY); console.log('Logged successfully!\nToken:', loginResult); return loginResult; } ``` * Java ```java import com.copyleaks.sdk.api.Copyleaks; String EMAIL_ADDRESS = "your@email.address"; String API_KEY = "00000000-0000-0000-0000-000000000000"; // Login to Copyleaks try { String authToken = Copyleaks.login(EMAIL_ADDRESS, API_KEY); System.out.println("Logged successfully!\nToken: " + authToken); } catch (CommandException e) { System.out.println("Failed to login: " + e.getMessage()); System.exit(1); } ``` **Response** ```json { "access_token": "", ".issued": "2025-07-31T10:19:40.0690015Z", ".expires": "2025-08-02T10:19:40.0690016Z" } ``` Note Save this token! It’s valid for 48 hours and can be reused for subsequent API calls. 4. #### Index Your Documents [Section titled “Index Your Documents”](#index-your-documents) For each document you want to include in the comparison, submit it for indexing using one of the submit endpoints (`submit-file`, `submit-url`, or `submit-ocr`). Set `properties.action` to `2` (`IndexOnly`) to store the document without scanning it immediately. This avoids consuming scan credits during the indexing phase. You also need to specify which repository to index the document into. Caution **Important**: Any other scanning options (like `internet` or `aiDetection`) must be configured during this indexing step. They cannot be changed later when you start the comparison scan. * HTTP ```http PUT https://api.copyleaks.com/v3/scans/submit/file/my-index-scan-1 Content-Type: application/json Authorization: Bearer { "base64": "SGVsbG8gd29ybGQh", "filename": "document1.txt", "properties": { "action": 2, "indexing": { "repositories": ["my-repo-id"] }, "sandbox": true } } ``` * cURL ```bash curl --request PUT \ --url https://api.copyleaks.com/v3/scans/submit/file/my-index-scan-1 \ -H "Authorization: Bearer " \ -H "Content-Type: application/json" \ -d '{ "base64": "SGVsbG8gd29ybGQh", "filename": "document1.txt", "properties": { "action": 2, "indexing": { "repositories": ["my-repo-id"] }, "sandbox": true } }' ``` * Python ```python from copyleaks.copyleaks import Copyleaks from copyleaks.models.submit.document import FileDocument from copyleaks.models.submit.properties.scan_properties import ScanProperties from copyleaks.models.submit.properties.indexing_properties import IndexingProperties scan_id = "my-index-scan-1" properties = ScanProperties() properties.set_action(2) # IndexOnly properties.set_sandbox(True) indexing = IndexingProperties() indexing.add_repository("my-repo-id") properties.set_indexing(indexing) file_submission = FileDocument(base64="SGVsbG8gd29ybGQh", filename="document1.txt", properties=properties) response = Copyleaks.Scans.submit_file(auth_token, scan_id, file_submission) print(response) ``` You will need to wait for the `IndexOnly` webhook for each document to confirm it has been successfully indexed before proceeding to the next step. 5. #### Start Your Cross-Comparison [Section titled “Start Your Cross-Comparison”](#start-your-cross-comparison) Once all your documents are indexed, make a `PATCH` request to the [`/v3/scans/start`](/reference/actions/scans/start/) endpoint. This will begin the comparison scan for all the documents you indexed. Provide the list of `scanId`s from the previous step in the `trigger` array. * HTTP ```http PATCH https://api.copyleaks.com/v3/scans/start Content-Type: application/json Authorization: Bearer { "trigger": [ "my-index-scan-1", "my-index-scan-2", "my-index-scan-3" ], "errorHandling": 0 } ``` * cURL ```bash curl --request PATCH \ --url https://api.copyleaks.com/v3/scans/start \ -H "Authorization: Bearer " \ -H "Content-Type: application/json" \ -d '{ "trigger": [ "my-index-scan-1", "my-index-scan-2", "my-index-scan-3" ], "errorHandling": 0 }' ``` 6. #### Interpreting The Results [Section titled “Interpreting The Results”](#interpreting-the-results) A successful `200 OK` response from the `start` endpoint will confirm which scans were started. The actual scan results for each document will be delivered asynchronously via the `Completed` webhook, just like a regular scan. **Example Success Response from `/v3/scans/start`:** ```json { "success": [ "my-index-scan-1", "my-index-scan-2", "my-index-scan-3" ], "failed": [] } ``` 7. #### 🎉Congratulations! [Section titled “🎉Congratulations!”](#congratulations) You have successfully started a cross-comparison scan between multiple documents in your Data Hub. ## 👥 Team Collaboration with Private Cloud Hub [Section titled “👥 Team Collaboration with Private Cloud Hub”](#-team-collaboration-with-private-cloud-hub) Multiple users can access, scan against, and index to your Private Cloud Hub. Manage permissions and data masking settings through the [admin dashboard](https://admin.copyleaks.com/repositories). ## 💡 Best Practices [Section titled “💡 Best Practices”](#-best-practices) * **Plan your scanning options**: Configure settings during indexing. * **Monitor indexing progress**: Wait for all `IndexOnly` webhooks before starting the comparison. * **Choose your database strategy**: Decide whether to use Private, Shared, or both. * **Batch efficiently**: Group related documents together. * **Respect API limits**: Monitor your [API dashboard](https://api.copyleaks.com/dashboard). ## 🚀 Next Steps [Section titled “🚀 Next Steps”](#-next-steps) [Create Private Cloud Hub ](https://admin.copyleaks.com/repositories)Set up your own private database for document storage. ## Support [Section titled “Support”](#support) Should you require any assistance, please contact [Copyleaks Support](https://help.copyleaks.com/hc/en-us/requests/new) or ask a question on [StackOverflow](https://stackoverflow.com/questions/tagged/copyleaks-api) with the `copyleaks-api` tag. ## Schedule a Live Demo Want to see how Data Hubs can help you manage and compare your documents? Our technical team can walk you through live examples of setting up a Private Cloud Hub, indexing large batches of content, and running cross-comparisons in a secure environment. [Book a Demo](https://copyleaks.com/book-a-demo) --- # Excluding and Preventing Indexing of Content > Learn how to exclude parts of a document from a scan and how to prevent documents from being added to the Copyleaks Internal Database. Features You have granular control over what content is scanned and what data is stored when you submit a document to Copyleaks. This guide covers two distinct types of exclusion: 1. **Excluding Parts of a Document from Scan Analysis**: This allows you to refine the plagiarism scan by ignoring specific elements like quotes or code blocks. 2. **Preventing a Document from Being Indexed**: This allows you to control whether the entire document is added to the Copyleaks Internal Database for future comparisons. *** ### Exclude Options [Section titled “Exclude Options”](#exclude-options) The `exclude` object can contain the following boolean properties: * `quotes`: If set to `true`, all text within quotation marks will be ignored. * `citations`: If set to `true`, citations and references will be ignored. * `references`: If set to `true`, the bibliography or reference list will be ignored. * `tableOfContents`: If set to `true`, the table of contents will be ignored. * `titles`: If set to `true`, titles and headings will be ignored. * `code`: If set to `true`, code blocks will be ignored. ### Request Example [Section titled “Request Example”](#request-example) * HTTP ```http PUT https://api.copyleaks.com/v3/scans/submit/file/my-scan-exclude-example Content-Type: application/json Authorization: Bearer YOUR_LOGIN_TOKEN { "base64": "VGhpcyBpcyBhIHRlc3QgZmlsZS4gIkhlbGxvLCB3b3JsZCEiIGlzIGEgcXVvdGUuIChTbWl0aCwgMjAyMyk=", "filename": "document-with-exclusions.txt", "properties": { "webhooks": { "status": "https://my-server.com/webhook/{STATUS}" }, "exclude": { "quotes": true, "citations": true }, "sandbox": true } } ``` * cURL ```bash curl --request PUT \ --url https://api.copyleaks.com/v3/scans/submit/file/my-scan-exclude-example \ --header 'Authorization: Bearer YOUR_LOGIN_TOKEN' \ --header 'Content-Type: application/json' \ --data '{ "base64": "VGhpcyBpcyBhIHRlc3QgZmlsZS4gIkhlbGxvLCB3b3JsZCEiIGlzIGEgcXVvdGUuIChTbWl0aCwgMjAyMyk=", "filename": "document-with-exclusions.txt", "properties": { "webhooks": { "status": "https://my-server.com/webhook/{STATUS}" }, "exclude": { "quotes": true, "citations": true }, "sandbox": true } }' ``` ### Response Example [Section titled “Response Example”](#response-example) When the scan is processed, the `scannedDocument` object in the response will reflect the number of words that were excluded. * 201 201 Created The scan was successfully created and is now processing. The excluded word count is reflected in the response. #### Example Response A typical response from this endpoint: Show full example (39 lines) ```json { "scannedDocument": { "scanId": "my-scan-exclude-example", "totalWords": 8, "totalExcluded": 4, "credits": 0, "expectedCredits": 1, "creationTime": "2025-08-10T10:00:00.000000Z", "metadata": { "filename": "document-with-exclusions.txt" }, "enabled": { "plagiarismDetection": true, "aiDetection": false, "explainableAi": false, "writingFeedback": false, "pdfReport": true, "cheatDetection": false, "aiSourceMatch": false }, "detectedLanguage": "en" }, "results": { "score": { "identicalWords": 0, "minorChangedWords": 0, "relatedMeaningWords": 0, "aggregatedScore": 0 }, "internet": [], "database": [], "batch": [], "repositories": [] }, "notifications": {}, "writingFeedback": {}, "status": 0, "developerPayload": "" } ``` ```json { "scannedDocument": { "scanId": "my-scan-exclude-example", "totalWords": 8, "totalExcluded": 4, "credits": 0, "expectedCredits": 1, "creationTime": "2025-08-10T10:00:00.000000Z", "metadata": { "filename": "document-with-exclusions.txt" }, "enabled": { "plagiarismDetection": true, "aiDetection": false, "explainableAi": false, // ... truncated ``` ## 🗺️ Next Steps [Section titled “🗺️ Next Steps”](#️-next-steps) [Webhooks Overview ](/reference/data-types/authenticity/webhooks/overview/)Learn how to securely receive and process notifications from Copyleaks. [Viewing Scan Results ](/concepts/features/how-to-display/)Understand the scan result format and how to display it to your users. --- # Export PDF Report > This document provides a comprehensive overview of the essential steps for creating and customizing PDF reports, setting up report generation, handling webhooks, and managing completed reports. Features This document provides a comprehensive overview of the essential steps for creating and customizing PDF reports, setting up report generation, handling webhooks, and managing completed reports. ## Introduction [Section titled “Introduction”](#introduction) The PDF API allows you to generate detailed and customizable PDF reports of scan results, including plagiarism checks, AI detection, and writing assistant feedback. These reports can be branded with your own logo and customized to match your organization’s needs. The PDF generation process is integrated with the Copyleaks scanning process, where you enable PDF creation during scan submission and then receive the generated PDF through webhooks. Copyleaks API also offers different PDF report versions, with version 3 being the latest and most feature-rich option that includes advanced formatting and comprehensive analysis visualization. ## Before you begin [Section titled “Before you begin”](#before-you-begin) Before you start using the PDF API, ensure you have the following: 1. An active Copyleaks account: If you don’t have one, [sign up here](https://copyleaks.com). 2. Familiarity with RESTful API principles: Basic knowledge of HTTP requests and responses. 3. A tool for HTTP requests: Use tools like cURL, Postman, or Copyleaks’ SDK. ## Installations [Section titled “Installations”](#installations) * HTTP [![Run In Postman](https://run.pstmn.io/button.svg)](https://god.gw.postman.com/run-collection/25022653-e2c2987f-c1b5-462f-8590-71d28b7f4b32?action=collection%2Ffork\&source=rip_markdown\&collection-url=entityId%3D25022653-e2c2987f-c1b5-462f-8590-71d28b7f4b32%26entityType%3Dcollection%26workspaceId%3D21147b7c-8cb3-4566-8f5d-8dfc6926eb5e) * cURL * Ubuntu/Debian ```bash sudo apt-get install curl ``` * Windows Download it from [curl.se](https://curl.se). * macOS ```bash brew install curl ``` * Python ```bash sudo apt-get install curl ``` * JavaScript Download it from [curl.se](https://curl.se). * Java ```bash brew install curl ``` * Ubuntu/Debian ```bash pip install copyleaks ``` * Windows ```bash npm i plagiarism-checker ``` * macOS [Download from Maven](https://search.maven.org/artifact/com.copyleaks.sdk/copyleaks-java-sdk) or `git clone https://github.com/Copyleaks/Java-Plagiarism-Checker.git` ## Login [Section titled “Login”](#login) To enable PDF report generation, we first need to generate an access token. We will use the [login](/reference/actions/account/login) endpoint. The API key can be found on the [Copyleaks API Dashboard](https://api.copyleaks.com/dashboard). Upon successful authentication, you will receive a token that must be attached to subsequent API calls via the Authorization: Bearer `` header. This token remains valid for 48 hours. Tip To boost performance, cache your login token and reuse it for all requests. The token remains valid for up to 48 hours, so you don’t need to log in repeatedly. The login method has stricter rate limits than other endpoints. * HTTP ```http POST https://id.copyleaks.com/v3/account/login/api Content-Type: application/json { "email": "", "key": "" } ``` * cURL ```bash curl -X POST "https://id.copyleaks.com/v3/account/login/api" \ -H "Content-Type: application/json" \ -d '{ "email": "", "key": "" }' ``` * Python ```python from copyleaks.copyleaks import Copyleaks from copyleaks.exceptions.command_error import CommandError from copyleaks.models.submit.document import FileDocument from copyleaks.models.submit.properties.scan_properties import ScanProperties from copyleaks.models.export import Export, ExportCrawledVersion, ExportResult, ExportPDF import base64 import random EMAIL_ADDRESS = "" API_KEY = "" # Login to Copyleaks try: auth_token = Copyleaks.login(EMAIL_ADDRESS, API_KEY) except CommandError as ce: response = ce.get_response() print(f"An error occurred (HTTP status code {response.status_code}):") print(response.content) exit(1) print("Logged successfully!\nToken:") print(auth_token) ``` * JavaScript ```javascript const copyleaks = require('copyleaks'); const EMAIL_ADDRESS = ""; const API_KEY = ""; // Login to Copyleaks const login = async () => { try { const authToken = await copyleaks.login(EMAIL_ADDRESS, API_KEY); console.log('Logged successfully!\nToken:', authToken); return authToken; } catch (error) { console.error('Failed to login:', error); process.exit(1); } }; ``` * Java ```java import com.copyleaks.sdk.api.Copyleaks; import com.copyleaks.sdk.api.exceptions.CommandException; import com.copyleaks.sdk.api.models.ScanProperties; import com.copyleaks.sdk.api.models.FileSubmission; import com.copyleaks.sdk.api.models.Export; import com.copyleaks.sdk.api.models.ExportCrawledVersion; import com.copyleaks.sdk.api.models.ExportResult; import com.copyleaks.sdk.api.models.ExportPDF; import java.util.Base64; import java.util.Arrays; String EMAIL_ADDRESS = ""; String API_KEY = ""; // Login to Copyleaks try { String authToken = Copyleaks.login(EMAIL_ADDRESS, API_KEY); System.out.println("Logged successfully!\nToken: " + authToken); } catch (CommandException e) { System.out.println("Failed to login: " + e.getMessage()); System.exit(1); } ``` ## Submit Scan with PDF Report Enabled [Section titled “Submit Scan with PDF Report Enabled”](#submit-scan-with-pdf-report-enabled) Use the [submit](/reference/actions/scans/submit-file/) file endpoint to send content for analysis while enabling PDF report generation. The key difference for PDF reports is including the `properties.pdf.create` parameter set to true. In the URL, supply your chosen scan ID, which serves as the identifier for the scan. Each scan needs to have a unique scan ID. The `properties.pdf.version` should be set to 3 to use the latest PDF report format with enhanced visuals and comprehensive data visualization. The `properties.pdf.title` allows you to customize the title that appears on the PDF report. For branding purposes, you can include your organization’s logo using `properties.pdf.largeLogo` (PNG format, base64 encoded, max 100kb, recommended size 185x50px). Note There are more customization options available for PDF reports, but this guide doesn’t cover all of them. For this tutorial, we also pass in `properties.sandbox` as **TRUE** to enable sandbox mode. The sandbox mode is free to use, but it returns mock results. Using sandbox mode while working on integrating with the Copyleaks API is helpful. * HTTP ```http PUT https://api.copyleaks.com/v3/scans/submit/file/ Authorization: Bearer Content-Type: application/json { "base64": "", "filename": "", "properties": { "webhook": "https://your.server/webhook?event={\{STATUS\}}", "sandbox": true, "pdf": { "create": true, "version": 3, "title": "Custom PDF Report Title", "customLogo": "" } } } ``` * cURL ```bash curl -X PUT "https://api.copyleaks.com/v3/scans/submit/file/" \ -H "Authorization: Bearer " \ -H "Content-Type: application/json" \ -d '{ "base64": "", "filename": "", "properties": { "webhook": "https://your.server/webhook?event={\{STATUS\}}", "sandbox": true, "pdf": { "create": true, "version": 3, "title": "Custom PDF Report Title", "customLogo": "" } } }' ``` * Python ```python # Submit a file for scanning with PDF generation enabled scan_id = ""; file_name = "" base64_file_content = base64.b64encode(b'Hello world.').decode('utf8') # or read your file and convert it into BASE64 presentation. print("Submitting a new file...") file_submission = FileDocument(base64_file_content, file_name) # Set scan properties with PDF options scan_properties = ScanProperties('https://your.server/webhook?event={\{STATUS\}}') scan_properties.set_sandbox(True) # Turn on sandbox mode. Turn off on production. # Enable PDF report generation scan_properties.set_pdf({ "create": True, "version": 3, "title": "Custom PDF Report Title" # Add base64_logo if needed }) file_submission.set_properties(scan_properties) # Submit the file for scanning Copyleaks.submit_file(auth_token, scan_id, file_submission) print("Sent to scanning with PDF report enabled") print("You will be notified, using your webhook, once the scan is completed.") ``` * JavaScript ```javascript // Submit a file for scanning with PDF report enabled const scanId = ""; // Replace with your unique scan ID const filename = ""; const fileContent = Buffer.from('Hello world').toString('base64'); // Convert file content to base64 const submitFile = async (authToken) => { const scanProperties = new copyleaks.ScanProperties('https://your.server/webhook?event={\{STATUS\}}'); scanProperties.setSandbox(true); // Enable sandbox mode for testing // Enable PDF report generation scanProperties.setPDF({ create: true, version: 3, title: "Custom PDF Report Title" // Add customLogo if needed }); const fileSubmission = new copyleaks.FileSubmission(fileContent, filename); fileSubmission.setProperties(scanProperties); try { await copyleaks.submitFile(authToken, scanId, fileSubmission); console.log('File submitted for scanning with PDF report enabled. Scan ID:', scanId); } catch (error) { console.error('Failed to submit file:', error); } }; ``` * Java ```java // Submit a file for scanning with PDF report enabled String scanId = ""; // Replace with your unique scan ID String filename = ""; String fileContent = Base64.getEncoder().encodeToString("Hello world".getBytes()); // Convert file content to base64 ScanProperties scanProperties = new ScanProperties("https://your.server/webhook?event={\{STATUS\}}"); scanProperties.setSandbox(true); // Enable sandbox mode for testing // Enable PDF report generation Map pdfProperties = new HashMap<>(); pdfProperties.put("create", true); pdfProperties.put("version", 3); pdfProperties.put("title", "Custom PDF Report Title"); // Add customLogo if needed scanProperties.setPDF(pdfProperties); FileSubmission fileSubmission = new FileSubmission(fileContent, filename); fileSubmission.setProperties(scanProperties); try { Copyleaks.submitFile(authToken, scanId, fileSubmission); System.out.println("File submitted for scanning with PDF report enabled. Scan ID: " + scanId); } catch (CommandException e) { System.out.println("Failed to submit file: " + e.getMessage()); } ``` ## PDF Customization Options [Section titled “PDF Customization Options”](#pdf-customization-options) The PDF report can be extensively customized to match your organization’s branding and requirements. | Property | Type | Description | Default | | ----------- | --------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | ------- | | `create` | boolean | Add a request to generate a customizable export of the scan report, in a pdf format. Set to true in order to generate a pdf report for this scan. | false | | `title` | string | Customize the title for the PDF report. Maximum 256 characters. | null | | `colors` | object | Object containing color customization options for the PDF report. | - | | `largeLogo` | string (base64) | Customize the logo image in the PDF report. Only supports **png** format. Max file size: 100kb. Recommended size: width 185px, height 50px. | null | | `rtl` | boolean | When set to true, the text in the report will be aligned from right to left. | false | | `version` | integer | PDF version to generate. By default version 1 will be generated as it is the current stable version. Version 3 is the latest iteration of the PDF report. Available values: **1**, **2**, **3** (Beta) | 1 | ## Wait For Scan Completion [Section titled “Wait For Scan Completion”](#wait-for-scan-completion) The scan and PDF report generation may take seconds to minutes, depending on the content, features used, and products enabled. Once the scan is complete successfully, Copyleaks API will send a completed webhook to the URL you supplied in the submit under `properties.webhooks.status`. At the same time, the **{STATUS}** is replaced with “completed”. The completed webhooks hold the summary information about the scan, such as the number of matched words, total words, and results found. If the scan finishes with an error, an error webhook will be sent to the `properties.webhooks.status` while the **{STATUS}** is replaced with **error**. Note For testing purposes, we recommend using a third-party service such as **request bin** or **ngrok**. ## Exporting PDF Reports [Section titled “Exporting PDF Reports”](#exporting-pdf-reports) Use the [export](/reference/actions/downloads/export/) method to retrieve the generated PDF report along with other scan artifacts. The export method sends webhooks with each artifact’s content to your specified target server. We supply the Scan ID we used earlier in the submit endpoint in the URL. The user chooses a unique Export ID for each export. For PDF reports specifically, we add a `pdf` section to the export request, providing an endpoint where the PDF should be sent when ready. * HTTP ```http POST https://api.copyleaks.com/v3/downloads//export/ Authorization: Bearer Content-Type: application/json { "completionWebhook": "https://your.server/webhook/export/completion", "pdf": { "endpoint": "https://your.server/webhook/export/pdf", "verb": "POST", "headers": { "key": "value", "key2": "value2" } } } ``` * cURL ```bash curl -X POST "https://api.copyleaks.com/v3/downloads//export/" \ -H "Authorization: Bearer " \ -H "Content-Type: application/json" \ -d '{ "completionWebhook": "https://your.server/webhook/export/completion", "pdf": { "endpoint": "https://your.server/webhook/export/pdf", "verb": "POST", "headers": { "key": "value", "key2": "value2" } } }' ``` * Python ```python # Export scan results including PDF report export_id = "" export = Export() export.set_completion_webhook('https://your.server/webhook/export/completion') # Export PDF report pdf_export = ExportPDF() pdf_export.set_endpoint('https://your.server/webhook/export/pdf') pdf_export.set_verb('POST') pdf_export.set_headers([['key', 'value'], ['key2', 'value2']]) # optional export.set_pdf(pdf_export) # Trigger the export Copyleaks.export(auth_token, scan_id, export_id, export) print("Export initiated. You will be notified via webhook once the export is completed.") ``` * JavaScript ```javascript // Export scan results including PDF report const exportId = ''; const exportResults = async (authToken, scanId) => { const exportRequest = new copyleaks.Export(); exportRequest.setCompletionWebhook('https://your.server/webhook/export/completion'); // Export PDF report const pdfExport = new copyleaks.ExportPDF(); pdfExport.setEndpoint('https://your.server/webhook/export/pdf'); pdfExport.setVerb('POST'); exportRequest.setPDF(pdfExport); try { await copyleaks.export(authToken, scanId, exportId, exportRequest); console.log('Export initiated with PDF report. Export ID:', exportId); } catch (error) { console.error('Failed to export results:', error); } }; ``` * Java ```java // Export scan results including PDF report String exportId = ""; Export exportRequest = new Export(); exportRequest.setCompletionWebhook("https://your.server/webhook/export/completion"); // Export PDF report ExportPDF pdfExport = new ExportPDF(); pdfExport.setEndpoint("https://your.server/webhook/export/pdf"); pdfExport.setVerb("POST"); exportRequest.setPDF(pdfExport); try { Copyleaks.export(authToken, scanId, exportId, exportRequest); System.out.println("Export initiated with PDF report. Export ID: " + exportId); } catch (CommandException e) { System.out.println("Failed to export results: " + e.getMessage()); } ``` ## Next Steps [Section titled “Next Steps”](#next-steps) [Webhooks Overview ](/reference/data-types/authenticity/webhooks/overview/)Learn about webhooks in the Copyleaks API and how to handle real-time notifications. [Export Method ](/reference/actions/downloads/export/)Understand the export method for retrieving various scan artifacts including PDF reports. [Authenticity API Overview ](/reference/actions/scans/overview/)Learn about the comprehensive scanning API that supports multiple products and features. ## Support [Section titled “Support”](#support) Should you require any assistance or have inquiries, please contact [Copyleaks Support](https://help.copyleaks.com/hc/en-us/requests/new) or ask a question on [StackOverflow](https://stackoverflow.com/questions/tagged/copyleaks-api) with the `copyleaks-api` tag. We appreciate your interest in Copyleaks and look forward to supporting your efforts to maintain originality and integrity. --- # GenAI Scan Overview > Overview of Copyleaks' GenAI Scan feature, which provides AI-generated insights into scan results, including plagiarism, AI detection, and writing quality. Features Copyleaks’ Gen AI automatically reviews and summarizes each scan. It highlights the main data points, identifies key insights, and uses the author’s past work (if available) to add helpful context. This makes it easy to understand the writing quality, AI involvement, and any signs of plagiarism, all in one clear and simple overview. ## Properties Fields [Section titled “Properties Fields”](#properties-fields) | **Property** | **Type** | **Default** | **Description** | | --------------------------- | --------- | ----------- | ------------------------------------------------------------------------------------------------------------------------------- | | `enable` | `boolean` | `false` | Enable Gen-AI Overview feature to extract key insights from the scan data. | | `ignoreAIDetection` | `boolean` | `false` | Ignore AI detection when generating the scan’s overview. Only applicable if AI detection was enabled. | | `ignorePlagiarismDetection` | `boolean` | `false` | Ignore plagiarism detection when generating the scan’s overview. Only applicable if plagiarism detection was enabled. | | `ignoreWritingFeedback` | `boolean` | `false` | Ignore writing assistant when generating the scan’s overview. Only applicable if the writing assistant was enabled. | | `ignoreAuthorData` | `boolean` | `false` | Ignore the author’s historical data when generating the scan’s overview. Only applicable if author ID was added to the request. | ## Examples [Section titled “Examples”](#examples) ### Request - Submit Endpoint Example [Section titled “Request - Submit Endpoint Example”](#request---submit-endpoint-example) ```json { "base64": "...", "filename": "file.txt", "properties": { "action": 0, "overview": { "enable": true, }, } } ``` ### Export Endpoint Example [Section titled “Export Endpoint Example”](#export-endpoint-example) This is the same description as other types of [export](/reference/actions/downloads/export/), just the key is `overview`. ```json { "overview": { "verb": "POST", "headers": [ [ "header-key", "header-value" ] ], "endpoint": "https://yoursite.com/export/overview" }, "completionWebhook": "https://yoursite.com/export/completed", "maxRetries": 3 } ``` ### Response [Section titled “Response”](#response) #### Example [Section titled “Example”](#example) ```json { "overview": "### Historical Author Data:\n- Four scans analyzed with an average plagiarism similarity of 13.18%\n- 4 instances of AI-generated content detected\n\n### Current Plagiarism Detection:\n- 0% overall plagiarism\n- Main sources: \n - yard.com (3.0%, 74 words)\n - brainly.com (3.2%, 45 words)\n - llcattorney.com (1.3%, 29 words)\n - montrosedemocrats.org (3.0%, 12 words)\n\n### AI Content Detection:\n- 100% AI-written content detected\n\n### Writing Assistant:\n- 100% writing quality with no errors in grammar, sentence structure, word choice, and mechanics.", "modelVersion": "v1" } ``` #### Response Fields [Section titled “Response Fields”](#response-fields) | Field | Type | Description | | -------------- | ------ | ----------------------------------------------------------------------- | | `overview` | string | A markdown-formatted string containing the Gen-AI overview of the scan. | | `modelVersion` | string | The version of the AI model used to generate the overview. | ## Next Steps [Section titled “Next Steps”](#next-steps) [Check for Plagiarism ](/guides/authenticity/detect-plagiarism-text)Detect plagiarism in text documents using the Copyleaks API. Search billions of sources to find unoriginal content. [Detect AI-Generated Content ](/guides/ai-detector/ai-text-detection)Detect AI-generated text via sync or async API calls. This guide covers sync detection—see the Authenticity API Guide for async. [Assess Grammar and Writing Quality ](/guides/writing/check-grammar)Get writing and grammar suggestions via API. Authenticate, submit text, and access full details in the docs. [Moderate Text ](/guides/moderation/moderate-text)Scan and moderate text content for unsafe or policy-relevant material across 10+ categories. --- # Ways to Display Reports > This document provides a comprehensive overview of the different methods available for viewing and displaying Copyleaks API scan results, including interactive reports, hosted solutions, PDF reports, and custom white-label implementations. Features Copyleaks offers multiple flexible options for displaying scan results to accommodate different technical requirements and use cases. Whether you prefer a ready-to-use hosted solution, want to integrate an interactive report into your existing application, need downloadable PDF reports, or require a completely custom implementation, Copyleaks provides the tools and data necessary to meet your specific needs. The display options range from plug-and-play solutions that require minimal technical implementation to comprehensive data exports that enable full customization of the user experience. Each method offers different levels of control, customization, and integration complexity to match your project requirements. All display methods are designed to work seamlessly with the Copyleaks scanning process and provide comprehensive visualization of plagiarism detection, AI content detection, and other scan results through modern, responsive interfaces. ## Viewing the Results [Section titled “Viewing the Results”](#viewing-the-results) ### Hosted Web Report [Section titled “Hosted Web Report”](#hosted-web-report) A responsive web page that can be opened in a new window or an iframe. This solution provides immediate access to professional report viewing without any development overhead or hosting requirements. The hosted report automatically updates with the latest features and improvements, ensuring users always have access to the most current functionality. Perfect for rapid deployment and scenarios where minimal technical implementation is preferred. [Embed Hosted Web Report ](/guides/display/embed-hosted-web-report/)Learn how to embed the Copyleaks hosted web report into your application. ### Open-Source Web Report [Section titled “Open-Source Web Report”](#open-source-web-report) Build your own Copyleaks Interactive Report from our [Github repository](https://github.com/copyleaks/plagiarism-report). You will need to host the report and send Copyleaks data to populate it. The Interactive Report provides a comprehensive, web-based interface that displays detailed scan results including highlighted matches, source comparisons, and AI detection results. This solution offers maximum customization flexibility while leveraging Copyleaks’ proven report interface design and functionality. Integration requires [Angular](https://angular.dev/) development skills and the ability to host and maintain the report within your own infrastructure. [Install Open-Source Report ](/guides/display/install-open-source-report/)Learn how to install and configure the Copyleaks open-source web report. ### PDF Report [Section titled “PDF Report”](#pdf-report) Enable the Copyleaks PDF Report by requesting it via the `properties.pdf.create` parameter in your scans. You can also make a variety of customizations to the Copyleaks PDF Report, like the title and logo used in the report. PDF reports are ideal for archival purposes, offline viewing, printing, and sharing with stakeholders who prefer traditional document formats. [Export PDF Report ](/concepts/features/export-pdf-report/)Learn how to create and customize PDF reports of your scan results. ### White Label Customization [Section titled “White Label Customization”](#white-label-customization) Some Copyleaks customers opt to build their own custom solutions for viewing data. We empower you with resourceful data like the start and length of plagiarism and detected AI, and you can build your own custom viewing platform designed to your preferences and specifications. This approach provides complete control over the user experience, allowing you to integrate scan results seamlessly into your existing application workflow and design language. [Learn more about Custom Implementations](/guides/authenticity/detect-plagiarism-text) ## Choosing the Right Display Method [Section titled “Choosing the Right Display Method”](#choosing-the-right-display-method) To help you decide which display method is right for you, here is a side-by-side comparison of the available options: | Display Method | Implementation Effort | Customization Level | Hosting | Primary Use Case | | :--------------------- | :-------------------- | :------------------ | :---------- | :---------------------------------------------------------- | | **Hosted Report** | Low | Low | Copyleaks | Quick setup, embedding in iframes, maintenance-free. | | **Open-Source Report** | Medium | High | Self-hosted | Full UI control, integration with your brand, feature-rich. | | **PDF Report** | Low | Medium | N/A | Offline sharing, archiving, formal documentation. | | **Custom (API Data)** | High | Complete | Self-hosted | Seamless integration into existing apps, unique workflows. | ## Support [Section titled “Support”](#support) Should you require any assistance or have inquiries about implementing any of these display methods, please contact [Copyleaks Support](https://help.copyleaks.com/hc/en-us/requests/new) or ask a question on [StackOverflow](https://stackoverflow.com/questions/tagged/copyleaks-api) with the `copyleaks-api` tag. We appreciate your interest in Copyleaks and look forward to supporting your implementation efforts. --- # Identical Matches Detection > Learn how to configure the Copyleaks API to detect only identical text matches by filtering out paraphrased content and minor changes. Features This document provides an overview of configuring the Copyleaks API to detect only identical text matches while filtering out paraphrased and minor changed content, focusing specifically on exact duplications. ## 📖 Introduction [Section titled “📖 Introduction”](#-introduction) The Identical Matches Only configuration is designed for teams that need to focus specifically on exact text duplications without the complexity of similarity detection. This approach is particularly useful for detecting direct copying and verbatim plagiarism. By focusing exclusively on identical matches, teams can efficiently identify the most straightforward cases of content duplication while reducing false positives and ambiguous similarity detections that might require additional review time. ## 🔧 Configuration [Section titled “🔧 Configuration”](#-configuration) To detect only identical matches, you must disable the settings for paraphrased content and minor changes in your API request. * **Disable Minor Changes**: Set `properties.filters.minorChangesEnabled` to `false`. * **Disable Paraphrased Content**: Set `properties.filters.relatedMeaningEnabled` to `false`. * **Enable Internet Scanning**: Set `properties.scanning.internet` to `true` to scan against online sources. Tip By setting both `minorChangesEnabled` and `relatedMeaningEnabled` to `false`, the API will only return results that are exact, word-for-word matches. ### Example JSON Configuration [Section titled “Example JSON Configuration”](#example-json-configuration) Here is an example of the `properties` object configured for an identical-only scan against internet sources. Request Body Properties ```json { "properties": { "filters": { "minorChangesEnabled": false, "relatedMeaningEnabled": false }, "scanning": { "internet": true } } } ``` ## 🚀 Next Steps [Section titled “🚀 Next Steps”](#-next-steps) After configuring your identical matches detection: [How to Display Scan Reports ](/concepts/features/how-to-display/)Learn how to interpret and display identical match results. [Webhooks Overview ](/reference/data-types/authenticity/webhooks/overview/)Set up automated notifications for scan completion. [Detect Plagiarism in Text ](/guides/authenticity/detect-plagiarism-text)Explore additional scanning capabilities beyond identical matches. [Book a Demo ](https://copyleaks.com/book-a-demo)Get personalized guidance on optimizing your identical match detection. ## 💬 Support [Section titled “💬 Support”](#-support) Should you require any assistance, please contact [Copyleaks Support](https://help.copyleaks.com/hc/en-us/requests/new) or ask a question on [StackOverflow](https://stackoverflow.com/questions/tagged/copyleaks-api) with the `copyleaks-api` tag. --- # Prevent Self-Plagiarism and Author Conflicts > Learn how to use scan ID patterns to prevent an author's documents from being flagged as plagiarism against their own previous submissions. Features This guide helps you avoid situations where documents from the same author are flagged as plagiarism against each other. This is particularly important when authors submit multiple assignments, revisions, or when you want to prevent matches within the same author’s work while maintaining detection across different authors. ## 🔄 Understanding the Problem [Section titled “🔄 Understanding the Problem”](#-understanding-the-problem) When working with document databases, you may encounter scenarios where: * An author’s current document matches against their previous work from earlier submissions. * Multiple versions or drafts of the same document are flagged against each other. * Legitimate self-referencing or building upon previous work is incorrectly identified as plagiarism. This guide provides strategies to prevent these false positives while maintaining effective plagiarism detection. ## 🛡️ Prevention Strategy: Smart Scan ID Structure [Section titled “🛡️ Prevention Strategy: Smart Scan ID Structure”](#️-prevention-strategy-smart-scan-id-structure) The best strategy is to design a strategic `scanId` for each submission. A well-structured ID makes it easy to include or exclude specific groups of documents from a scan. Caution **Important**: The maximum `scanId` length is 36 characters. Plan your structure accordingly. ### Example ID Structures [Section titled “Example ID Structures”](#example-id-structures) **Basic Structure:** * `-` (e.g., `author123-essay1`, `emp456-report2`) **Extended Structure:** * `--` (e.g., `acmeuni-author123-essay1`, `techcorp-emp456-proposal`) This structure enables you to: * **Exclude by author**: Use `author123-*` or `emp456-*`. * **Include by organization**: Use `acmeuni-*` or `techcorp-*`. * **Focus on document types**: Use `*-final` or `*-report`. ## 🚫 Using Exclude Patterns [Section titled “🚫 Using Exclude Patterns”](#-using-exclude-patterns) Use the `properties.scanning.exclude.idPattern` parameter to exclude specific patterns from your scan results. The `*` character acts as a wildcard. ```json { "properties": { "scanning": { "exclude": { "idPattern": "author123-*" } } } } ``` This example excludes all submissions with IDs starting with `author123-`. ## ✅ Using Include Patterns [Section titled “✅ Using Include Patterns”](#-using-include-patterns) Use the `properties.scanning.include.idPattern` parameter to *only* include specific patterns in your scan results. This is useful for limiting comparisons to specific groups, like an organization or a class. ```json { "properties": { "scanning": { "include": { "idPattern": "acmeuni-*" } } } } ``` This example will only compare the submitted document against other documents with IDs starting with `acmeuni-`. ## 📝 Implementation Examples [Section titled “📝 Implementation Examples”](#-implementation-examples) ### Example 1: Exclude Same Author’s Previous Work [Section titled “Example 1: Exclude Same Author’s Previous Work”](#example-1-exclude-same-authors-previous-work) ```json { "properties": { "scanning": { "copyleaksDb": { "includeMySubmissions": true, "includeOthersSubmissions": true }, "exclude": { "idPattern": "author123-*" } } } } ``` ### Example 2: Compare Only Within Same Organization [Section titled “Example 2: Compare Only Within Same Organization”](#example-2-compare-only-within-same-organization) ```json { "properties": { "scanning": { "repositories": [{ "id": "assignment_repository", "includeMySubmissions": true, "includeOthersSubmissions": true }], "include": { "idPattern": "acmeuni-*" } } } } ``` ## 💡 Best Practices [Section titled “💡 Best Practices”](#-best-practices) * **📋 Plan your ID structure**: Design scan ID patterns from the beginning. * **🎯 Be specific**: Use precise patterns to avoid excluding too much or too little. * **📊 Test patterns**: Verify your patterns work correctly with sample data. * **🔄 Document conventions**: Maintain clear documentation of your ID structure for your team. * **📏 Keep it short**: Remember the 36-character limit. ## 🚀 Next Steps [Section titled “🚀 Next Steps”](#-next-steps) [Submit File Documentation ](/reference/actions/scans/submit-file/)Learn how to submit files with your custom scan IDs. [Compare Multiple Documents ](/concepts/features/data-hubs/)Learn about cross-document comparison strategies. ## Support [Section titled “Support”](#support) Should you require any assistance or have inquiries about implementing author conflict prevention, please contact [Copyleaks Support](https://help.copyleaks.com/hc/en-us/requests/new) or ask a question on [StackOverflow](https://stackoverflow.com/questions/tagged/copyleaks-api) with the `copyleaks-api` tag. --- # Detecting Text Manipulation > This document provides a comprehensive overview of using the Copyleaks API to detect text manipulation attempts in submitted documents. Text manipulation detection helps identify when users attempt to deceive detection systems through various deceptive techniques. Features This document provides a comprehensive overview of using the Copyleaks API to detect text manipulation attempts in submitted documents. Text manipulation detection helps identify when users attempt to deceive detection systems through various deceptive techniques. ## Introduction [Section titled “Introduction”](#introduction) The Text Manipulation detection feature is designed to identify sophisticated attempts to bypass detection systems. This feature recognizes when users employ various deceptive techniques to hide copied content or manipulate the scanning process. Text manipulation attempts can include: * **Hidden Characters**: Inserting invisible characters to break up text patterns * **Character Replacement**: Using special characters or symbols that look similar to normal letters * **Invisible or White Text**: Adding white text on white backgrounds or other concealment methods (works only in PDF and DOCX documents) * **Major Text Exclusion**: Attempting to exclude large portions of text from scanning By detecting these manipulation attempts, you can maintain the integrity of your plagiarism detection or AI detection process and ensure accurate results. ## Before You Begin [Section titled “Before You Begin”](#before-you-begin) To get the most out of this document, you should first be familiar with how to submit a basic scan. If you’re new to the process, we recommend starting with the guide below. Note **Detect Plagiarism in Text**: This guide walks you through the fundamentals of customizing your API request to scan for plagiarism. ## Getting Started [Section titled “Getting Started”](#getting-started) ### Enabling Text Manipulation Detection [Section titled “Enabling Text Manipulation Detection”](#enabling-text-manipulation-detection) To enable text manipulation detection in your scans, set the `properties.cheatDetection` parameter to `true`: ```json { "properties": { "cheatDetection": true } } ``` **Default**: `false` When enabled, the submitted document will be analyzed for various text manipulation techniques. If manipulation is detected, a scan alert will be added to the completed webhook. For more information on submitting documents, check out our documentation for URL, OCR, and File scans. ## Interpreting Scan Results [Section titled “Interpreting Scan Results”](#interpreting-scan-results) ### Scan Alerts [Section titled “Scan Alerts”](#scan-alerts) When text manipulation is detected, you’ll receive specific alerts in your scan completion webhook. These alerts are found at: ```plaintext notifications.alerts[] ``` ### Types of Text Manipulation Alerts [Section titled “Types of Text Manipulation Alerts”](#types-of-text-manipulation-alerts) | Alert Code | Title | Description | | --------------------------------- | ----------------------------------------- | -------------------------------------------------------------------------------------------- | | `suspected-cheating-detected` | Advanced Detection: Hidden Characters | Detected possible use of hidden characters to cheat the plagiarism scan | | `suspected-character-replacement` | Advanced Detection: Character Replacement | Detected possible use of special characters to cheat the plagiarism scan | | `suspected-white-text` | Suspected Cheating: Invisible Text | Detected possible use of invisible or white text - switch to textual version to see all text | | `text-mostly-excluded` | Advanced Detection: Major Text Exclusion | Detected possible attempt to exclude the majority of text from scanning | | `cheat-detection-failed` | Advanced Detection Failed | Unable to validate that there was no manipulation in the submitted document | For a complete list of all possible alerts, see our Scan Alerts documentation. ### Example Alert Response [Section titled “Example Alert Response”](#example-alert-response) ```json { "notifications": { "alerts": [ { "code": "suspected-character-replacement", "title": "Advanced Detection: Character Replacement", "message": "We have detected possible use of special characters to cheat the plagiarism scan.", "category": 3 } ] } } ``` ## Best Practices [Section titled “Best Practices”](#best-practices) * **Monitor alerts**: Check for text manipulation alerts in your webhook responses * **Document findings**: Keep records of detected manipulation attempts for policy enforcement * **Handle failures**: Implement proper error handling for cases where detection fails * **Stay updated**: Alert titles and messages may change over time as the system improves ## Next Steps [Section titled “Next Steps”](#next-steps) After implementing text manipulation detection: [Detect Plagiarism in Text ](/guides/authenticity/detect-plagiarism-text)Explore additional scanning capabilities beyond text manipulation detection. [How to Display Scan Reports ](/concepts/features/how-to-display)Learn how to present detection results to users effectively. ## Support [Section titled “Support”](#support) Should you require any assistance or have inquiries about implementing text manipulation detection, please contact [Copyleaks Support](https://help.copyleaks.com/hc/en-us/requests/new) or ask a question on [StackOverflow](https://stackoverflow.com/questions/tagged/copyleaks-api) with the `copyleaks-api` tag. We appreciate your interest in Copyleaks and look forward to supporting your efforts to maintain originality and integrity. By implementing text manipulation detection, you’re adding an essential layer of security to your plagiarism detection workflow, ensuring that sophisticated cheating attempts don’t go unnoticed. ## Need advanced content analysis? Get personalized guidance on implementing comprehensive content analysis with text manipulation detection and other advanced features. [Book a Demo](https://copyleaks.com/book-a-demo) --- # Choosing Your Scan ID > Learn how to choose a scan ID that fits your organization's needs while adhering to Copyleaks' requirements. Management A **Scan ID** is a unique identifier that you assign to every scan submitted to Copyleaks. This ID acts as a crucial link between your system and ours, allowing you to manage, track, and organize your scans effectively. Choosing a thoughtful and consistent naming convention for your Scan IDs is essential for leveraging advanced Copyleaks features, such as preventing self-plagiarism and managing large volumes of scans. ## Core Requirements [Section titled “Core Requirements”](#core-requirements) While you have the flexibility to choose a Scan ID that aligns with your internal system, there are a few limitations to keep in mind: * **Character Length**: Must be between 3 and 36 characters. * **Allowed Characters**: The Scan ID can include lower case characters `a-z`, digits `0-9` and special symbols `!@$&-=_()';:., ~`. We recommend using lower case letters, digits and dashes for simplicity. Unsupported Characters Uppercase letters (`A-Z`) and any characters not listed above are not permitted. If your internal IDs use unsupported characters, see the section on [Handling ID Mismatches](#handling-id-mismatches). ## Strategies for Naming Your Scan ID [Section titled “Strategies for Naming Your Scan ID”](#strategies-for-naming-your-scan-id) The best approach is to create a structured Scan ID that embeds useful information. This allows you to easily identify a scan based on its scan ID and use advanced features like include or exclude specific groups of documents from a scan. ### Recommended Structure [Section titled “Recommended Structure”](#recommended-structure) A highly effective structure is: `--` #### Examples [Section titled “Examples”](#examples) * `tech-corp-employee456-q3-report` * `acme-university-student123-final-thesis` Note In plagiarism scans, this structure enables powerful filtering capabilities: * **Exclude by author**: Use a pattern like `*-student123-*` to prevent a student’s new submission from being checked against their previous work. * **Include by organization**: Use `acme-university-*` to compare a document only against others from the same institution. * **Focus on document types**: Use `*-final-thesis` to analyze all final theses submitted. For more information, see the [Prevent Self-Plagiarism](/concepts/features/self-plagiarism/) guide. ## Handling ID Mismatches [Section titled “Handling ID Mismatches”](#handling-id-mismatches) If your internal system uses IDs that don’t meet Copyleaks’ requirements (e.g., they are too long or contain uppercase letters), the recommended solution is to generate a compliant Scan ID and maintain a mapping table on your end. This table will link your internal entity ID to the corresponding Copyleaks Scan ID, ensuring seamless integration. | Your Internal ID | Copyleaks Scan ID | | :------------------- | :------------------- | | `USER-9876-DOC-A` | `user9876-doca` | | `Submission_ABC_123` | `submission-abc-123` | ## Next Steps [Section titled “Next Steps”](#next-steps) [Prevent Self-Plagiarism ](/concepts/features/self-plagiarism/)Learn more about how to exclude previous submissions from the same student to prevent self-plagiarism. [Submit a File ](/reference/actions/scans/submit-file/)See how to implement your Scan ID strategy when submitting a file for scanning. --- # Manage Your Credits > Learn how to manage your Copyleaks credits effectively to optimize usage and prevent unnecessary costs. Management Copyleaks provides a comprehensive suite of content integrity services through a flexible, credit-based API. To help you maximize the value of the platform and manage your usage effectively, it is essential to implement smart credit management strategies. Copyleaks offers robust tools to monitor and control your credit consumption, ensuring full transparency and predictability. This guide outlines the available options to help you get started. ## Price Check Before Scan [Section titled “Price Check Before Scan”](#price-check-before-scan) Some applications may not have visibility into document sizes before submission, as end-users directly upload files. This can lead to **unintended credit consumption** when scanning large documents. To mitigate this, Copyleaks recommends **pre-checking** the number of credits required for a scan before proceeding. This allows you to decide whether to continue with the scan or abort it, avoiding unnecessary charges. ### How to Enable Price Check [Section titled “How to Enable Price Check”](#how-to-enable-price-check) To activate the **Check-Credits** flow, set the `properties.action` parameter to `1` (Check-Credits) when submitting a document. In the webhook response, you will receive the expected cost of the scan without actually performing it. After receiving the response, you can decide whether to proceed with the scan or not. To start the scan use the [**Start**](/reference/actions/scans/start) endpoint. ## Confirming Scan Cost After Completion [Section titled “Confirming Scan Cost After Completion”](#confirming-scan-cost-after-completion) Once a scan is completed, Copyleaks sends a **[Completed Webhook](/reference/data-types/authenticity/webhooks/scan-completed)** to your application, including details about the **final cost** of the scan. By tracking this information, you can develop insights into your **expected service costs** and optimize your usage accordingly. Note You are only charged for **successfully completed** scans. If a scan fails due to an error, the credits will be **automatically refunded**. ## Programmatically Monitor Your Remaining Credits [Section titled “Programmatically Monitor Your Remaining Credits”](#programmatically-monitor-your-remaining-credits) Copyleaks API is designed to provide **full automation**, reducing the need for manual intervention. You can retrieve your **current credit balance** to implement various control mechanisms: * **Set spending limits** – Define a threshold (e.g., limit usage to 50% of the budget by mid-month) and configure your system to react accordingly. * **Trigger alerts** – Automatically send notifications when your remaining credits fall below a certain percentage (e.g., below 10%). This can be implemented as a **cron job** for regular monitoring. ### Retrieve Credit Balance [Section titled “Retrieve Credit Balance”](#retrieve-credit-balance) You can check your current credit balance using the **[Get Credits Balance](/reference/actions/scans/check-credits)** endpoint. This will return the number of credits available in your account. ## Set a Spend Limit on Automatic Refills [Section titled “Set a Spend Limit on Automatic Refills”](#set-a-spend-limit-on-automatic-refills) To prevent running out of credits during a billing cycle, **Copyleaks offers automatic refills**, ensuring that scans are never interrupted. ### Why Use Automatic Refills? [Section titled “Why Use Automatic Refills?”](#why-use-automatic-refills) * Ensures that ongoing scans are not disrupted. * Eliminates the need for manual intervention. However, **uncontrolled automatic refills can lead to unexpected costs**, especially if a bug in your application results in excessive scans. Best Practice Set a **maximum budget** for automatic refills to prevent unforeseen expenses. ### How to Enable Automatic Refills [Section titled “How to Enable Automatic Refills”](#how-to-enable-automatic-refills) You can manage this feature via the **billing settings** in your Copyleaks account. ## Predict Your Usage [Section titled “Predict Your Usage”](#predict-your-usage) To forecast your credit consumption, **Copyleaks provides access to historical usage data**. By retrieving your **usage history**, you can generate reports (in CSV format) to analyze past trends and predict future requirements. ### Retrieve Usage History [Section titled “Retrieve Usage History”](#retrieve-usage-history) You can programmatically retrieve a detailed history of your credit consumption using the **[API Usage History](/reference/actions/scans/usage-history)** endpoint. This allows you to fetch data for specific date ranges, which can then be exported or integrated into your own internal dashboards. By analyzing this historical data, you can identify usage patterns, track costs associated with different projects, and build more accurate forecasts for future credit needs. ## Wrap Up [Section titled “Wrap Up”](#wrap-up) The **Copyleaks API** offers extensive flexibility to help you manage credits efficiently. By leveraging these features strategically, you can **prevent excessive usage, optimize costs, and maintain control over your plagiarism detection workflows**. Use these tools **wisely** to ensure a cost-effective and seamless integration! ## Next Steps [Section titled “Next Steps”](#next-steps) [Start a Scan ](/reference/actions/scans/start/)Learn how to initiate a scan after checking the credit cost. [Completed Webhook ](/reference/data-types/authenticity/webhooks/scan-completed)Understand the details provided in the completed webhook, including the final scan cost. [Get Credit Balance ](/reference/actions/scans/check-credits/)Retrieve your current credit balance programmatically. [API Usage History ](/reference/actions/scans/usage-history/)Access your historical usage data to predict future credit consumption. --- # Best Practices for Optimizing Performance > Learn how to optimize performance when using Copyleaks APIs, including data compression, feature management, and scan submission strategies. Performance Copyleaks is designed for **scalability and high performance**, handling large workloads efficiently. To get the best results from your integration, follow these best practices to **optimize speed, reduce bottlenecks, and maximize efficiency**. ## Use Network Data Compression [Section titled “Use Network Data Compression”](#use-network-data-compression) Transmitting large amounts of data over the internet **slows down** performance. **Compressing data** can reduce payload size by **up to 70%**, speeding up processing times. ### Enable Request Compression [Section titled “Enable Request Compression”](#enable-request-compression) Compress the data before sending it to Copyleaks and add this header: ```http Content-Encoding: gzip ``` This is **especially useful** when submitting large files. ### Enable Response Compression [Section titled “Enable Response Compression”](#enable-response-compression) To receive compressed responses from Copyleaks, include this header in your request: ```http Accept-Encoding: gzip, compress ``` This ensures faster data transfer between your system and Copyleaks. ## Disable Unused Features [Section titled “Disable Unused Features”](#disable-unused-features) Copyleaks offers **many configurable features**, but enabling unnecessary ones can **slow down** scans. Only enable what you need. Some features to **disable if not needed**: | Feature | Description | Recommendation | | ------------------------ | ------------------------------------ | ------------------------------------------------- | | `properties.includeHtml` | Includes results in **HTML format** | Disable if plain text is enough. | | `properties.pdf.create` | Generates a **PDF report** | Turn off if you don’t need a PDF. | | `properties.expiration` | Defines how long scan data is stored | Use **7 days or less** for optimal speed. | | `properties.filters` | Narrows search results | Customize filters to **improve scan efficiency**. | Note Check the [Authenticity API methods](/reference/actions/scans/overview) for the full list of features you can toggle. ## Submit Scans at an Optimal Rate [Section titled “Submit Scans at an Optimal Rate”](#submit-scans-at-an-optimal-rate) Copyleaks runs on cloud infrastructure, dynamically scaling resources based on demand. However, submitting too many requests at once can reduce efficiency. ### Avoid Overloading the System [Section titled “Avoid Overloading the System”](#avoid-overloading-the-system) * Instead of submitting all documents at once, send them gradually at a controlled rate (`N` calls per second). * If handling large volumes (e.g., 1M+ files), adjust to the maximum allowed rate limit (see [Rate Limit Policy](/reference/data-types/authenticity/technical-specifications)). ### Prevent Slow Start Issues [Section titled “Prevent Slow Start Issues”](#prevent-slow-start-issues) * Don’t flood the system with a sudden burst of requests. * Instead, start with a low rate and gradually increase to maintain stable performance. Note Need higher limits? **We offer custom plans** for large-scale users. Contact us at ****. ## Adjust Sensitivity for Speed vs. Accuracy [Section titled “Adjust Sensitivity for Speed vs. Accuracy”](#adjust-sensitivity-for-speed-vs-accuracy) Copyleaks supports different **sensitivity levels**, balancing **speed** and **comprehensiveness** based on your needs. Set the **`properties.sensitivityLevel`** value based on priority: | Level | Focus | Best For | | --------------- | ----------------- | ----------------------------- | | `1` | **Speed** | Quick scans, minimal checks. | | `3` *(default)* | **Balanced** | Most use cases. | | `5` | **Comprehensive** | Deep analysis, high accuracy. | Note We recommend level `3` for most users, but feel free to adjust as needed. ## Reuse Your Authentication Token [Section titled “Reuse Your Authentication Token”](#reuse-your-authentication-token) Each **JWT token** generated during login is **valid for 48 hours**. Tip Avoid unnecessary login calls, reuse your token for multiple requests within its validity period. For more information on obtaining a new token, refer to the **[Login API](/reference/actions/account/login)**. ## Next Steps [Section titled “Next Steps”](#next-steps) [Authenticity API Overview ](/reference/actions/scans/overview/)Explore the full list of features and options available for configuring your scans. [Technical Specifications ](/reference/data-types/authenticity/technical-specifications/)Understand the rate limit policy and other technical specifications for optimal API usage. [Login API ](/reference/actions/account/login/)Learn how to obtain and manage your authentication token for API access. --- # Handling Failures > Learn how to implement an exponential backoff strategy for retrying requests to the Copyleaks API. Performance This document outlines how to handle failures when interacting with the Copyleaks API, specifically focusing on implementing an **exponential backoff strategy** for retrying requests. ## Understanding Failure Responses [Section titled “Understanding Failure Responses”](#understanding-failure-responses) When making requests to the Copyleaks API, you may encounter various HTTP status codes indicating different types of failures. Here are some common ones: * **Error code 503:** Service Unavailable. Typically, this error will appear when Copyleaks is undergoing a maintenance period. You can be notified for these events using [**Copyleaks Status**](https://status.copyleaks.com) by subscribing to alerts. We broadcast a message days prior to the event time so users will be able to make preparations in advance. * **Error code 5xx**: Internal errors. There is an issue pertaining to Copyleaks’ service and\or the network. * **Error code 429**: Too many requests. Copyleaks, like other REST API services, has a rate limit policy that defines the maximum calls that can be made. Exceeding the maximum calls repeatedly will lead to temporary/permanent blocks. ## Suggested Retry Strategy [Section titled “Suggested Retry Strategy”](#suggested-retry-strategy) [**Exponential backoff**](https://en.wikipedia.org/wiki/Exponential_backoff) is a standard algorithm that helps applications define a retry strategy for consuming a network service. For these status codes mentioned above, we recommend implementing a retry algorithm by doing the following: 1. Make a request to the Copyleaks API. *** 2. If the requests fail, wait 1 + `rand_seconds_number` seconds. Then, retry. *** 3. If the requests fail, wait 2 + `rand_seconds_number` seconds. Then, retry. *** 4. If the requests fail, wait 4 + `rand_seconds_number` seconds. Then, retry. *** 5. … *** 6. And so on, up to `max_time` seconds. *** 7. Wait `max_time` and retry up to a limit of n times. *** ### Definitions: [Section titled “Definitions:”](#definitions) `rand_seconds_number` - Is a random number to add to the wait time. This is to prevent multiple clients from retrying at the same time, which can lead to a thundering herd problem. Suggested values is between 1 and 10 seconds. `max_time` - Is the maximum number of seconds to wait. Suggested value is 60 seconds. ## Next Steps [Section titled “Next Steps”](#next-steps) [Webhooks Overview ](/reference/data-types/authenticity/webhooks/overview/)Learn how to use webhooks to receive real-time notifications about scan statuses, including failures. [Technical Specifications ](/reference/data-types/authenticity/technical-specifications/)Review the technical specifications, including rate limits, to optimize your API usage. [Authenticity API Overview ](/reference/actions/scans/overview/)Explore the comprehensive Authenticity API for managing your plagiarism and AI detection processes. --- # Trust & Security > How we protect your data Trust & Security At Copyleaks, we are committed to the security of your data and privacy. We understand that our customers are entrusting us with their data, and we take that responsibility very seriously. We have implemented a comprehensive security program that includes administrative, technical, and physical safeguards to protect your data from unauthorized access, use, or disclosure. This page provides an overview of our security program, including our security architecture, data handling policies, and compliance certifications. ## Our Commitment to Security [Section titled “Our Commitment to Security”](#our-commitment-to-security) Our approach to security is built on several key pillars: ### Security Architecture and Infrastructure [Section titled “Security Architecture and Infrastructure”](#security-architecture-and-infrastructure) Our platform is built on a robust and secure foundation to protect your data at every level. * **Secure Network Design:** All platform components communicate through a secure internal company network. Access to this network is highly restricted, even for Copyleaks employees, and requires identity verification via an SSL client certificate. All communication within the internal network is secured using TLS v1.2 or newer. * **Cloud-Based Architecture:** We leverage a secure, cloud-based system architecture to provide scalable and reliable service. * **On-Premises Option:** For organizations requiring complete control over their data infrastructure, we offer on-premises Cloud Private Hubs. This allows you to retain all sensitive data within your own secured digital environment while utilizing our advanced detection technology. * **Continuous Monitoring:** Our systems are monitored 24/7, enabling us to respond instantly to any downtime or security incidents as they are detected. ### Data Encryption [Section titled “Data Encryption”](#data-encryption) Data safety is a cornerstone of our security mechanisms. We employ military-grade encryption to ensure your data is protected at all times. * **Encryption in Transit:** All data transferred to and from our platform is sent exclusively over secure channels (100% HTTPS) using SSL connections. * **Encryption at Rest:** All data saved on our platform is encrypted using the AES-256 standard. Encryption keys are managed by our Cloud providers and are rotated automatically to ensure maximum security. * **Data Backup:** We perform daily data backups, which are stored securely in our backup data centers. ## Compliance and Certifications [Section titled “Compliance and Certifications”](#compliance-and-certifications) Our products routinely undergo independent verification of privacy, security, and compliance controls to meet global standards and earn the trust of our users. * [**SOC 2 & SOC 3:**](https://en.wikipedia.org/wiki/System_and_Organization_Controls) Copyleaks is SOC 2 & 3 certified, demonstrating our commitment to securely managing data to protect our customers’ interests and privacy. Our SOC 3 report, audited by KPMG, is publicly available and outlines our high-powered system’s adherence to security, privacy, and confidentiality standards. * [**GDPR:**](https://en.wikipedia.org/wiki/General_Data_Protection_Regulation) We are fully committed to adhering to the guidelines of the EU General Data Protection Regulation (GDPR). For our European customers, we offer the `copyleaks.eu` site with servers located in Germany, ensuring data processing remains within Europe. * [**PCI DSS:**](https://en.wikipedia.org/wiki/Payment_Card_Industry_Data_Security_Standard) We adhere to the Payment Card Industry Data Security Standard (PCI DSS). All payments are processed through Stripe, and we do not access or store any personal credit card information within the Copyleaks system. * [**NIST RMF:**](https://en.wikipedia.org/wiki/Risk_Management_Framework) We meet the guidelines of the NIST Risk Management Framework (RMF), a systematic process for managing information security risk developed by the U.S. National Institute of Standards and Technology. * [**Accessibility:**](https://copyleaks.com/accessibility) We believe technology should be accessible to everyone. Our platform is designed to be user-friendly for all, and our Voluntary Product Accessibility Templates (VPATs) are available for review. ## Application and Operational Security [Section titled “Application and Operational Security”](#application-and-operational-security) We maintain a rigorous application security program to protect our platform from threats. * **Vulnerability Management:** We routinely run vulnerability scans of our system components and use static code analyzers to detect problematic code before it is deployed. * **Regular Updates:** We regularly update the security of our products to protect against emerging threats. * **Responsible Disclosure:** We take security and privacy very seriously and encourage our users to report any identified vulnerabilities. If you believe you have found a security vulnerability, please submit a report with details such as your account email and a screenshot of the issue so our team can investigate. ## Useful Links [Section titled “Useful Links”](#useful-links) [Compliance & Certifications ](https://copyleaks.com/compliance-certifications)Learn about our compliance with global security standards and certifications. [Security Practices ](https://copyleaks.com/security-practices)Explore our security practices and measures to protect your data. --- # Webhooks Security > Learn how to secure your Copyleaks webhook endpoints against unauthorized access and ensure reliable communication. Trust & Security Communication with the Copyleaks service is conducted via RESTful requests and responses. Some operations involve asynchronous processing, during which a webhook notification is sent upon completion. Since your server must be accessible over the internet to receive webhook notifications, it is crucial to ensure that incoming requests originate from Copyleaks. To verify the authenticity of webhook requests, you can implement one or more of the following security measures. ## Authentication via HTTPS Client Certificate [Section titled “Authentication via HTTPS Client Certificate”](#authentication-via-https-client-certificate) Copyleaks webhook servers support **HTTPS connections** for secure communication with your endpoints, preventing unauthorized access to transmitted data. To enable this security feature, simply provide an **HTTPS endpoint** when submitting a file for scanning. To further secure your endpoint, Copyleaks employs **SSL client certificates** to authenticate webhook requests and confirm they originate from Copyleaks. Self-signed certificates are also supported. To retrieve the latest SSL client certificate thumbprints, use the following REST API request: ```http GET https://api.copyleaks.com/v2/security/client-certificates ``` Note This authentication method requires an HTTPS-enabled endpoint with SSL support. Non-secure HTTP connections do not support this feature. Since this list is **dynamic** and subject to change, we recommend setting up an automated process to update your environment daily. ## Authentication via Developer Payload [Section titled “Authentication via Developer Payload”](#authentication-via-developer-payload) An alternative method to prevent unauthorized access is by utilizing the `properties.developerPayload` field. To implement this: 1. Set the `developerPayload` value to a unique, secret string known only to you. 2. When receiving a webhook request, verify that the `developerPayload` in the request matches the expected value. 3. For enhanced security, consider encrypting the secret string with a private key known only to your system. By employing these authentication methods, you can safeguard your webhook endpoints and ensure secure communication with Copyleaks. ## Configuring Web Application Firewalls (WAF) [Section titled “Configuring Web Application Firewalls (WAF)”](#configuring-web-application-firewalls-waf) Many users have security measures such as AWS WAF, Cloudflare, or other Web Application Firewalls (WAF) in place, which may block webhook requests if they appear suspicious. If you are not receiving webhook notifications, it may be due to your WAF filtering the requests. ### Exclude Copyleaks Webhook Requests from WAF [Section titled “Exclude Copyleaks Webhook Requests from WAF”](#exclude-copyleaks-webhook-requests-from-waf) To resolve this, allow Copyleaks’ webhooks by adding a custom header to the requests and configuring your WAF to allow requests containing this header. This ensures that webhook notifications are received without interference from security mechanisms. By employing these authentication methods and considering WAF exclusions, you can safeguard your webhook endpoints and ensure secure, uninterrupted communication with Copyleaks. ## Static IP Addresses for Webhook Delivery Enterprise [Section titled “Static IP Addresses for Webhook Delivery Enterprise”](#static-ip-addresses-for-webhook-delivery-enterprise) For an enhanced layer of security, we offer enterprise customers the option to receive all webhook notifications from a static, predefined list of IP addresses. Enabling this feature allows you to configure your firewall to accept incoming traffic exclusively from our trusted servers, a practice known as IP allowlisting. This significantly reduces the risk of spoofing and ensures that your systems only process legitimate, verified requests from our platform. To have this feature enabled and to receive the list of static IPs for allowlisting, please contact your account manager. ## Next Steps [Section titled “Next Steps”](#next-steps) [Webhooks Overview ](/reference/data-types/authenticity/webhooks/overview/)Learn about the different types of webhooks and how to configure them. [Technical Specifications ](/reference/data-types/authenticity/technical-specifications/)Review the technical specifications, including security considerations for API interactions. [Export Method ](/reference/actions/downloads/export/)Understand how to export scan results, often delivered via webhooks. --- # 🎓 Maintaining Academic Integrity with the Copyleaks API > Learn how to integrate Copyleaks API for academic integrity, including plagiarism detection, AI content identification, and secure document management. Use Cases In today’s evolving academic landscape, upholding integrity and originality is fundamental. Copyleaks is dedicated to empowering instructors and academic institutions with the tools to champion authentic student work and maintain the highest academic standards. Our mission is to provide comprehensive solutions that address the core challenges of modern education, from traditional plagiarism to the nuances of AI-generated content. For developers, the Copyleaks API is the key to seamlessly integrating these critical capabilities directly into your institution’s unique ecosystem, such as a Learning Management System (LMS) or other academic platforms. By leveraging the API, you can empower instructors with robust plagiarism scanning, award-winning AI content detection, and a secure Shared Data Hub for assignments - all within the native workflows they use every day. This guide will walk you through the essential API steps, from indexing documents to configuring advanced scanning options, enabling you to build a powerful, integrated solution that supports your institution’s commitment to academic integrity. *** ## 📚 Before You Begin [Section titled “📚 Before You Begin”](#-before-you-begin) To get the most out of this document, you should first be familiar with how to submit a basic scan. If you’re new to the process, we recommend starting with the guide below. **[Detect Plagiarism in Text](/guides/authenticity/detect-plagiarism-text)**: This guide walks you through the fundamentals of customizing your API request to scan for plagiarism, AI-generated text and grammar correction. ## 📄 Submitting Your Documents [Section titled “📄 Submitting Your Documents”](#-submitting-your-documents) The Copyleaks API allows you to submit a variety of document types for analysis. You can [upload files](/reference/actions/scans/submit-file/) in formats such as PDF and DOCX. Additionally, you can submit documents by providing a [URL](/reference/actions/scans/submit-url/). Copyleaks also offers advanced capabilities, allowing you to [upload images](/reference/actions/scans/submit-ocr/) of text. This is made possible using Optical Character Recognition (OCR) technology. ## 🌐 Scanning with the Shared Data Hub [Section titled “🌐 Scanning with the Shared Data Hub”](#-scanning-with-the-shared-data-hub) ### What is the Shared Data Hub? [Section titled “What is the Shared Data Hub?”](#what-is-the-shared-data-hub) The Shared Data Hub is a comprehensive database containing millions of user-submitted documents from institutions worldwide. This powerful resource significantly enhances academic integrity by expanding the scope of plagiarism detection beyond traditional sources. ### How It Improves Academic Integrity [Section titled “How It Improves Academic Integrity”](#how-it-improves-academic-integrity) The Shared Data Hub is particularly effective at detecting instances where students submit work that isn’t their own. For example, it can detect when: * A student submits an assignment previously written by a friend * The same paper is submitted to different institutions * Work is recycled from previous semesters or academic years This detection capability helps maintain academic standards across educational institutions globally. ### Contributing to the Community [Section titled “Contributing to the Community”](#contributing-to-the-community) When you choose to scan against the Shared Data Hub, you’re not just benefiting from the collective database - you’re also contributing to it. Each document you submit helps strengthen the system for all users, creating a more robust detection network that benefits the entire academic community. This contribution also benefits your own institution directly. Once your students’ assignments are added to the database, they cannot be recycled or reused by other students at your institution in future semesters. ### Customizing Your Scan Settings [Section titled “Customizing Your Scan Settings”](#customizing-your-scan-settings) You have full control over how your documents are compared within the Shared Data Hub: * **Compare against your institution’s submissions**: Use the `properties.scanning.copyleaksDb.includeMySubmissions` parameter to scan against documents from your own institution * **Compare against other institutions’ submissions**: Use the `properties.scanning.copyleaksDb.includeOthersSubmissions` parameter to scan against submissions of other users in the network * **Use both options**: Enable both parameters for the most comprehensive plagiarism detection ### Automatic Indexing and Management [Section titled “Automatic Indexing and Management”](#automatic-indexing-and-management) After completing a scan, your document is automatically indexed and stored within the Shared Data Hub. This makes it available for future comparisons against new submissions at your institution. 🗑️ If you need to remove a document from the database, you can use our [delete request](/reference/actions/scans/delete/) and set the `purge` parameter to `true`. This will completely remove the document from the Shared Data Hub. ### Benefits of Using the Shared Data Hub [Section titled “Benefits of Using the Shared Data Hub”](#benefits-of-using-the-shared-data-hub) * **Broader Detection**: Access to millions of documents increases the likelihood of identifying plagiarism * **Cross-Institutional Protection**: Detect submissions that may have originated from other schools * **Internal Protection**: Prevent students from reusing assignments within your own institution * **Community Collaboration**: Help build a stronger academic integrity ecosystem for everyone By leveraging the Shared Data Hub, you’re taking advantage of one of the most comprehensive plagiarism detection resources available while contributing to the fight against academic dishonesty. ## 🌍 Scanning Against the Internet [Section titled “🌍 Scanning Against the Internet”](#-scanning-against-the-internet) To scan against a vast range of online sources, including many academic journals, set the `properties.scanning.internet` parameter to `true`. Internet results will be included in the [Scan Completion Webhook](/reference/data-types/authenticity/webhooks/scan-completed). ## 🤖 Detecting AI-Generated Content [Section titled “🤖 Detecting AI-Generated Content”](#-detecting-ai-generated-content) * To check for AI-written text, set the `properties.aiGeneratedText.detect` parameter to `true`. * Your AI detection results are delivered to a dedicated export webhook. For an example of how the data will be structured, see the [Export AI Detection Response documentation](/reference/data-types/authenticity/results/ai-detection/). ## 💬 Support [Section titled “💬 Support”](#-support) Need help implementing these solutions? Our team is here to assist you every step of the way. Whether you have technical questions or need guidance on best practices, don’t hesitate to reach out through [Copyleaks Support](https://help.copyleaks.com/hc/en-us/requests/new) or engage with our developer community on [StackOverflow](https://stackoverflow.com/questions/tagged/copyleaks-api) using the `copyleaks-api` tag. ## 🚀 Next Steps [Section titled “🚀 Next Steps”](#-next-steps) [Assess Grammar and Writing Quality ](/guides/writing/check-grammar/)Enhance your academic integrity solution with grammar and writing quality assessment. [Detecting Text Manipulation ](/concepts/features/text-manipulation/)Learn how to automatically identify text manipulation techniques being used to bypass detection. [Webhooks Overview ](/reference/data-types/authenticity/webhooks/overview/)To get scan results, you must set up webhooks. These automated messages will notify your system as scans are completed. [How to Display Scan Reports ](/concepts/features/how-to-display/)Learn how to present the scan data to your users with our customizable interactive report, a downloadable PDF, or by integrating the results directly into your own UI. ## Schedule a Live Demo Want to see how Copyleaks can enhance your academic integrity solutions? Our technical team can walk you through live examples of scanning against the Shared Data Hub, AI detection, and more. [Book a Demo](https://copyleaks.com/book-a-demo) --- # Content Integrity for Publishers > Learn how to maintain content integrity in pre-publication workflows using Copyleaks to detect plagiarism by comparing your content against billions of online sources, including academic journals, websites, and more. Use Cases In the digital age, ensuring the originality of your content is more crucial than ever. With the vast amount of information available online, it is easy for content to be copied or plagiarized without proper attribution. This can lead to significant issues for publishers, including legal challenges, loss of credibility, and damage to brand reputation. ### The Power of Internet-Wide Scanning [Section titled “The Power of Internet-Wide Scanning”](#the-power-of-internet-wide-scanning) The Copyleaks Plagiarism Checker API provides a powerful solution for detecting internet plagiarism, allowing you to compare your content against billions of online sources, including websites, articles, and academic journals. When you enable internet scanning, you are tapping into a vast and ever-growing database of online content. This allows you to: * **Verify Originality**: Ensure that your content is original before publishing. * **Protect Your IP**: Discover if your content has been plagiarized and published elsewhere without your permission. * **Maintain SEO Rankings**: Avoid penalties from search engines for duplicate content. ### Text Moderation for Safe Content [Section titled “Text Moderation for Safe Content”](#text-moderation-for-safe-content) The Copyleaks Text Moderation API is designed to detect harmful content, including hate speech, adult content, and other forms of inappropriate material. This is particularly useful for publishers who want to ensure that their content adheres to community guidelines and standards. ## 📚 Before You Begin [Section titled “📚 Before You Begin”](#-before-you-begin) Make sure you are familiar with Copyleaks scans by completing the [Check for Plagiarism](/guides/authenticity/detect-plagiarism-text) guide. ## Verify Content Originality Against Online Sources [Section titled “Verify Content Originality Against Online Sources”](#verify-content-originality-against-online-sources) ### Enabling Internet Scanning [Section titled “Enabling Internet Scanning”](#enabling-internet-scanning) To scan your document against internet sources, set the `properties.scanning.internet` parameter to `true`. This enables scanning against all non-paywalled online sources, including a variety of academic journals. For more information check out our documentation for [URL scans](/reference/actions/scans/submit-url/), [OCR scans](/reference/actions/scans/submit-ocr/), and [File scans](/reference/actions/scans/submit-file/). Enable Internet Scanning ```json { "properties": { "scanning": { "internet": true } } } ``` ### Receiving Results [Section titled “Receiving Results”](#receiving-results) Once your scan is completed, you’ll receive the results through the [completed webhook](/reference/data-types/authenticity/webhooks/scan-completed) event. This webhook is triggered when the scan process finishes successfully and contains the output information from the scan. The internet plagiarism results will be located in the `results.internet` array within the webhook payload. Each internet match includes: * `id` - Unique identifier for the match * `title` - Title of the matched content * `url` - Source URL where the match was found * `matchedWords` - Number of words that matched * `metadata` - Additional information about the source (author, organization, publish date, etc.) ### Example payload structure [Section titled “Example payload structure”](#example-payload-structure) ```json { "status": 0, "scannedDocument": { "scanId": "your-scan-id", "totalWords": 1250, "credits": 1 }, "results": { "internet": [ { "id": "match-id", "title": "Source Title", "url": "https://example.com/source", "matchedWords": 45, "metadata": { "author": "Author Name", "organization": "Publisher", "publishDate": "2023-01-01" } } ] } } ``` ## Moderating Content for Safety [Section titled “Moderating Content for Safety”](#moderating-content-for-safety) To ensure that your published content is safe and adheres to your community standards, you can use the Copyleaks Text Moderation API. This API allows you to scan text for harmful content across more than 10 categories, including hate speech, adult content, and other inappropriate material. ### Submitting Content for Moderation [Section titled “Submitting Content for Moderation”](#submitting-content-for-moderation) To moderate a piece of content, send a POST request to the `/v1/text-moderation/{scanId}/check` endpoint. In the request body, you will provide the text to be analyzed and specify which content moderation labels you want to check for. For example, a publisher might want to check for toxicity, profanity, and hate speech: Example Moderation Request ```json { "text": "Your text content to be moderated goes here.", "labels": [ { "id": "toxic-v1" }, { "id": "profanity-v1" }, { "id": "hate-speech-v1" } ] } ``` ### Understanding the Results [Section titled “Understanding the Results”](#understanding-the-results) The API will respond with a detailed analysis, pinpointing the exact segments of text that were flagged and for which categories. This allows you to build a workflow to automatically handle or review content that violates your policies. For a complete list of supported categories, see the [Content Moderation Labels](/reference/data-types/moderation/text-moderation-labels/) documentation. To get started with your integration, follow the [Moderate Text Content](/guides/moderation/moderate-text/) guide. ## 💬 Support [Section titled “💬 Support”](#-support) Should you require any assistance or have inquiries, please contact [Copyleaks Support](https://help.copyleaks.com/hc/en-us/requests/new) or ask a question on [StackOverflow](https://stackoverflow.com/questions/tagged/copyleaks-api) with the `copyleaks-api` tag. We appreciate your interest in Copyleaks and look forward to supporting your efforts to maintain originality and integrity. ## 🚀 Next Steps [Section titled “🚀 Next Steps”](#-next-steps) [Check for Plagiarism ](/guides/authenticity/detect-plagiarism-text)Detect plagiarism in text documents using the Copyleaks API. Search billions of sources to find unoriginal content. [Moderate Text ](/guides/moderation/moderate-text)Scan and moderate text content for unsafe or policy-relevant material across 10+ categories. ## Schedule a Live Demo Want to see how internet plagiarism detection works with your specific content? Our technical team can walk you through live examples of scanning against billions of online sources, including academic journals and websites. [Book a Demo](https://copyleaks.com/book-a-demo) --- # User-Generated Content Platforms > Learn how to maintain a safe online environment by integrating Copyleaks' context-aware AI for moderating user-generated content (UGC) at scale. Use Cases In today’s digital landscape, user-generated content (UGC) platforms face a multi-faceted challenge: fostering a vibrant community while protecting it from harmful, inauthentic, or unoriginal content. From blog comments and product reviews to social media posts and forum discussions, maintaining content integrity is crucial for brand reputation and user trust. Copyleaks provides a comprehensive suite of tools to address these challenges, including real-time Text Moderation, precise AI Content Detection, and robust Plagiarism Detection. ## Moderate User-Generated Content [Section titled “Moderate User-Generated Content”](#moderate-user-generated-content) User-generated content is the lifeblood of many platforms, but it also opens the door to significant risks. Harmful content—such as hate speech, harassment, and explicit material—can poison your community, drive away users, and damage your brand’s reputation. Manually reviewing every piece of content is often impossible at scale. This is where automated text moderation becomes essential for creating a safe and welcoming online environment. ### Understanding the Challenge [Section titled “Understanding the Challenge”](#understanding-the-challenge) User-generated content platforms across various industries face common moderation challenges. Blogs & Publishing Comment sections with hate speech, user-submitted articles with inappropriate content, and forums requiring real-time moderation. Reviews & Ratings Fake or spam reviews, personal attacks between users, and content that violates platform policies. AdTech & Marketing User-generated ad content with policy violations, harmful campaign descriptions, and community content needing compliance checks. Social Media High-volume content requiring instant, context-aware moderation decisions in real-time. ### Why Choose Copyleaks Text Moderation [Section titled “Why Choose Copyleaks Text Moderation”](#why-choose-copyleaks-text-moderation) Our text moderation AI goes beyond simple keyword matching. It understands context, nuance, and intent, providing highly accurate detection of harmful content while minimizing false positives. * **Superior Accuracy:** Exceptional detection rates with minimal false positives. * **Context Understanding:** Words are evaluated based on their context, not just their presence. * **Precise Location Detection:** Pinpoint exactly where in the text harmful content appears. * **Explanation-Driven:** Get clear reasoning for each moderation decision. * **Comprehensive Coverage:** Detect content across 10+ categories, including adult content, hate speech, harassment, self-harm, and more. ### Implementation Benefits [Section titled “Implementation Benefits”](#implementation-benefits) Enhanced Safety & Trust Maintain community standards with context-aware decisions, protecting users and your brand reputation while reducing legal and regulatory risks. Operational Efficiency Reduce manual moderation workload through precise automated flagging and scale your moderation efforts seamlessly as your platform grows. Superior User Experience Minimize false positives and provide clear explanations for moderation decisions, maintaining authentic user interactions while ensuring safety. ### Integration Made Simple [Section titled “Integration Made Simple”](#integration-made-simple) Our Text Moderation API integrates seamlessly into existing content workflows. 1. **Submit Content** Send user-generated text through our API for analysis. 2. **Receive Detailed Analysis** Get character-level flagging information with explanations for each detected violation. 3. **Take Informed Action** Implement automated or manual review based on precise, explained results. 4. **Monitor Performance** Track moderation effectiveness and adjust thresholds based on clear metrics. ## Detect AI-Generated Content [Section titled “Detect AI-Generated Content”](#detect-ai-generated-content) The rise of generative AI has introduced a new layer of complexity for UGC platforms. Inauthentic content, such as AI-generated fake reviews, spammy comments, or low-quality articles, can mislead users, manipulate ratings, and erode the trust you’ve built with your community. The Copyleaks AI Detector helps you maintain authenticity by identifying text produced by models like ChatGPT, Gemini, and others, allowing you to flag and manage potentially deceptive submissions. ### Submitting Content for AI Detection [Section titled “Submitting Content for AI Detection”](#submitting-content-for-ai-detection) To check for AI-generated content, send a POST request to the `/v2/writer-detector/{scanId}/check` endpoint. Example AI Detection Request ```json { "text": "Lions are social animals, living in groups called prides, typically consisting of several females, their offspring, and a few males. Female lions are the primary hunters, working together to catch prey. Lions are known for their strength, teamwork, and complex social structures." } ``` ### Understanding the Results [Section titled “Understanding the Results”](#understanding-the-results) The API returns a summary indicating the likelihood of AI-generated content, along with a detailed breakdown of the text. This allows you to flag potentially inauthentic user submissions, such as AI-written reviews or comments. For more details, follow the [Detect AI-Generated Text](/guides/ai-detector/ai-text-detection/) guide. ## Detect Plagiarism [Section titled “Detect Plagiarism”](#detect-plagiarism) For platforms that rely on original user submissions—such as publishing platforms, educational forums, or creative communities—plagiarism poses a significant threat. Submitting copied content can lead to copyright infringement issues, damage your platform’s credibility, and devalue the contributions of your authentic users. The Copyleaks Plagiarism Checker empowers you to verify the originality of every submission, protecting your platform and upholding your content standards. ### Submitting Content for Plagiarism Detection [Section titled “Submitting Content for Plagiarism Detection”](#submitting-content-for-plagiarism-detection) To check for plagiarism, send a POST request to the `/v3/scans/submit/file/{scanId}` endpoint. Example Plagiarism Detection Request ```json { "base64": "SGVsbG8gd29ybGQh", "filename": "file.txt", "properties": { "webhooks": { "status": "https://yoursite.com/webhook/{STATUS}/my-custom-id" }, "sandbox": true } } ``` ### Understanding the Results [Section titled “Understanding the Results”](#understanding-the-results-1) The API will notify you via webhooks when the scan is complete. You can then export the results to see if any plagiarism was detected. For more details, follow the [Detect Plagiarism in Text](/guides/authenticity/detect-plagiarism-text/) guide. ## 💬 Support [Section titled “💬 Support”](#-support) Need help implementing content moderation for your platform? Our team understands the unique challenges of UGC platforms and can provide tailored guidance. Contact [Copyleaks Support](https://help.copyleaks.com/hc/en-us/requests/new) or engage with our developer community on [StackOverflow](https://stackoverflow.com/questions/tagged/copyleaks-api) using the `copyleaks-api` tag. ## 📈 Next Steps [Section titled “📈 Next Steps”](#-next-steps) Ready to implement effective content moderation for your platform? [Text Moderation Guide ](/guides/moderation/moderate-text/)Step-by-step guide to integrating Copyleaks Text Moderation API into your platform. [Moderation Labels ](/reference/data-types/moderation/text-moderation-labels/)Detailed information about all supported content categories. [Detect AI Content ](/guides/ai-detector/ai-text-detection/)Identify AI-generated content in user submissions. [Book a Demo ](https://copyleaks.com/book-a-demo)Get personalized guidance on implementing content moderation. ## Schedule a Live Demo Want to see how Copyleaks can enhance your user-generated content moderation? Our technical team can walk you through live examples of real-time moderation and AI detection. [Book a Demo](https://copyleaks.com/book-a-demo) --- # Overview > Your journey to building with the Copyleaks API begins here. Find everything you need to integrate our powerful content integrity and authenticity tools. Get Started Find user guides, quickstarts, tutorials, API workflows, implementation use cases, and more to help you integrate and get the most out of Copyleaks. ## Begin Your Integration Follow our Quickstart guide for a step-by-step walkthrough of your first API call. [Quickstart](/get-started/quickstart) ## Guides [Section titled “Guides”](#guides) * Authenticity [Detect Plagiarism in Text ](/guides/authenticity/detect-plagiarism-text)Analyze plagiarism, AI-generated text, and writing quality via API. Supports async calls with webhooks and data masking. [Embed Hosted Web Report ](/guides/display/embed-hosted-web-report)Provides a seamless way to display detailed plagiarism and AI detection reports. [Install Open Source Report ](/guides/display/install-open-source-report)Integrating Copyleaks' web report module into your application to display plagiarism detection, AI content detection, and writing assistance reports. * AI Detection [Detect AI-Generated Content ](/guides/ai-detector/ai-text-detection)Detect AI-generated text via sync or async API calls. This guide covers sync detection—see the Authenticity API Guide for async. * Writing [Assess Grammar and Writing Quality ](/guides/writing/check-grammar)Get writing and grammar suggestions via API. Authenticate, submit text, and access full details in the docs. * Moderation [Moderate Text ](/guides/moderation/moderate-text)Scan and moderate text content for unsafe or policy-relevant material across 10+ categories. ## Official SDKs [Section titled “Official SDKs”](#official-sdks) [Python ](/resources/sdks/python) [JavaScript ](/resources/sdks/javascript) [Java ](/resources/sdks/java) [C# ](/resources/sdks/csharp) [PHP ](/resources/sdks/php) [Ruby ](/resources/sdks/ruby) ## Use Cases [Section titled “Use Cases”](#use-cases) [Academic Integrity ](/concepts/use-cases/academic-integrity)Uphold academic standards by detecting plagiarism and identifying AI-generated content in student submissions. [Content Integrity for Publishers ](/concepts/use-cases/publishers)Ensure originality and prevent copyright infringement before publishing. [User-Generated Content Platforms ](/concepts/use-cases/user-generated-content-platforms)Maintain a safe online environment by scanning user-generated content. ## Features [Section titled “Features”](#features) [Data Hubs ](/concepts/features/data-hubs)Learn how to compare multiple documents against each other using Copyleaks' private and shared databases to find similarities and prevent plagiarism. [Embed Hosted Web Report ](/guides/display/embed-hosted-web-report/)Ready-to-use UI hosted by Copyleaks for quick, secure, and fully-managed integration. [Detecting Text Manipulation ](/concepts/features/text-manipulation)Identify sophisticated attempts to bypass detection. [AI Logic ](/concepts/features/ai-logic/)Gain insight into the “why” behind our AI detection results and how to interpret them. --- # Quickstart > Get started with the Copyleaks API in under 5 minutes. This guide will walk you through creating an account, authenticating, and running your first AI detection scan. Get Started Welcome to the Copyleaks Quickstart! This guide will walk you through the essential steps to get you up and running with the Copyleaks API in just a few minutes. Let’s begin. ## 🚀 Let’s Get You Started [Section titled “🚀 Let’s Get You Started”](#-lets-get-you-started) 1. #### Create Your Account [Section titled “Create Your Account”](#create-your-account) Before you start, ensure you have the following: * An active Copyleaks account. If you don’t have one, **[sign up for free](https://api.copyleaks.com/signup)**. * You can find your API key on the **[API Dashboard](https://api.copyleaks.com/dashboard)**. 2. #### Installation [Section titled “Installation”](#installation) Choose your preferred method for making API calls. * HTTP You can interact with the API using any standard HTTP client. For a quicker setup, we provide a Postman collection. See our [Postman guide](/resources/postman) for instructions. * cURL * Ubuntu/Debian ```bash sudo apt-get install curl ``` * Windows Download it from [curl.se](https://curl.se). * macOS ```bash brew install curl ``` * Python ```bash sudo apt-get install curl ``` * JavaScript Download it from [curl.se](https://curl.se). * Java ```bash brew install curl ``` * Ubuntu/Debian ```bash pip install copyleaks ``` * Windows ```bash npm install plagiarism-checker ``` * macOS [Download from Maven](https://central.sonatype.com/artifact/com.copyleaks.sdk/copyleaks-java-sdk?smo=true) 3. #### Login [Section titled “Login”](#login) To perform a scan, we first need to generate an access token. For that, we will use the [login](/reference/actions/account/login) endpoint. The API key can be found on the [Copyleaks API Dashboard](https://api.copyleaks.com/dashboard). Upon successful authentication, you will receive a token that must be attached to subsequent API calls via the Authorization: Bearer `` header. This token remains valid for 48 hours. * HTTP ```http POST https://id.copyleaks.com/v3/account/login/api Headers Content-Type: application/json Body { "email": "your@email.address", "key": "00000000-0000-0000-0000-000000000000" } ``` * cURL ```bash export COPYLEAKS_EMAIL="your@email.address" export COPYLEAKS_API_KEY="your-api-key-here" curl --request POST \ --url https://id.copyleaks.com/v3/account/login/api \ --header 'Accept: application/json' \ --header 'Content-Type: application/json' \ --data "{ \"email\": \"${COPYLEAKS_EMAIL}\", \"key\": \"${COPYLEAKS_API_KEY}\" }" ``` * Python ```python from copyleaks.copyleaks import Copyleaks EMAIL_ADDRESS = "your@email.address" API_KEY = "your-api-key-here" # Login to Copyleaks auth_token = Copyleaks.login(EMAIL_ADDRESS, API_KEY) print("Logged successfully!\nToken:", auth_token) ``` * JavaScript ```javascript const { Copyleaks } = require('plagiarism-checker'); const EMAIL_ADDRESS = "your@email.address"; const API_KEY = "your-api-key-here"; async function login() { const copyleaks = new Copyleaks(); const loginResult = await copyleaks.loginAsync(EMAIL_ADDRESS, API_KEY); console.log('Logged successfully!\nToken:', loginResult); return loginResult; } ``` * Java ```java import com.copyleaks.sdk.api.Copyleaks; String EMAIL_ADDRESS = "your@email.address"; String API_KEY = "00000000-0000-0000-0000-000000000000"; // Login to Copyleaks try { String authToken = Copyleaks.login(EMAIL_ADDRESS, API_KEY); System.out.println("Logged successfully!\nToken: " + authToken); } catch (CommandException e) { System.out.println("Failed to login: " + e.getMessage()); System.exit(1); } ``` **Response** ```json { "access_token": "", ".issued": "2025-07-31T10:19:40.0690015Z", ".expires": "2025-08-02T10:19:40.0690016Z" } ``` Note Save this token! It’s valid for 48 hours and can be reused for subsequent API calls. 4. #### Detect AI-Generated Text [Section titled “Detect AI-Generated Text”](#detect-ai-generated-text) Now let’s test some text. We’ll start with a sample that’s clearly AI-generated: * HTTP ```http POST https://api.copyleaks.com/v2/writer-detector/my-first-scan/check Authorization: Bearer your-access-token-here Content-Type: application/json { "text": "Artificial intelligence has revolutionized numerous industries by automating complex tasks and providing data-driven insights. Machine learning algorithms can analyze vast datasets to identify patterns that humans might miss. In healthcare, AI assists with diagnosis and drug discovery.", "sandbox": true } ``` * cURL ```bash curl --location "https://api.copyleaks.com/v2/writer-detector/my-first-scan/check" \ --header "Content-Type: application/json" \ --header "Authorization: Bearer " \ --data "{ \"text\": \"Lions are social animals, living in groups called prides, typically consisting of several females, their offspring, and a few males. Female lions are the primary hunters, working together to catch prey. Lions are known for their strength, teamwork, and complex social structures.\", \"sandbox\": true }" ``` * Python ```python from copyleaks.models.submit.ai_detection_document import NaturalLanguageDocument scan_id = "my-first-scan" sample_text = "Lions are social animals, living in groups called prides, typically consisting of several females, their offspring, and a few males. Female lions are the primary hunters, working together to catch prey. Lions are known for their strength, teamwork, and complex social structures." document = NaturalLanguageDocument(sample_text) document.set_sandbox(True) response = Copyleaks.AiDetectionClient.submit_natural_language( auth_token, scan_id, document) print("Response:") print(response) print("AI Score:") print(str(response['summary']['ai']*100) + "%") ``` * JavaScript ```javascript const { CopyleaksNaturalLanguageSubmissionModel } = require('plagiarism-checker'); function logSuccess(response) { console.log('Success', response); console.log('AI Score:', response.summary.ai*100, '%'); } function logError(error) { console.error('Error', error); } async function detect(loginResult) { const copyleaks = new Copyleaks(); const sampleText = "Lions are social animals, living in groups called prides, typically consisting of several females, their offspring, and a few males. Female lions are the primary hunters, working together to catch prey. Lions are known for their strength, teamwork, and complex social structures."; const submission = new CopyleaksNaturalLanguageSubmissionModel(sampleText); submission.sandbox = true; copyleaks.aiDetectionClient .submitNaturalTextAsync(loginResult, Date.now() + 1, submission) .then((response) => { logSuccess(response); }) .catch((error) => { logError(error); }); } async function main() { const loginResult = await login(); await detect(loginResult); } main() ``` * Java ```java import com.copyleaks.sdk.api.models.AiDetectionDocument; import com.copyleaks.sdk.api.models.AiDetectionResponse; String scanId = "my-first-scan"; String sampleText = "Artificial intelligence has revolutionized numerous industries by automating complex tasks and providing data-driven insights. Machine learning algorithms can analyze vast datasets to identify patterns that humans might miss."; AiDetectionDocument submission = new AiDetectionDocument(sampleText); submission.setSandbox(true); try { AiDetectionResponse response = Copyleaks.aiDetectionClient.submitNaturalLanguage(authToken, scanId, submission); System.out.println("AI Score: " + response.getSummary().getAi()); } catch (CommandException e) { System.out.println("Error: " + e.getMessage()); } ``` **Response** AI Detection Results ```json { "summary": { "ai": 0.95, // 95% likely to be AI-generated "human": 0.05 // 5% likely to be human-written }, "results": [ { "classification": 2, // 2 = AI-generated, 1 = human-written "probability": 0.95 } ] } ``` ## 🎉 Congratulations! [Section titled “🎉 Congratulations!”](#-congratulations) **You have just:** * ✅ Authenticated with the Copyleaks API * ✅ Made your first AI detection request * ✅ Interpreted the results ### What’s Next? [Section titled “What’s Next?”](#whats-next) [Check for Plagiarism ](/guides/authenticity/detect-plagiarism-text)Detect plagiarism in text documents using the Copyleaks API. Search billions of sources to find unoriginal content. [Detect AI-Generated Content ](/guides/ai-detector/ai-text-detection)Detect AI-generated text via sync or async API calls. This guide covers sync detection—see the Authenticity API Guide for async. [Assess Grammar and Writing Quality ](/guides/writing/check-grammar)Get writing and grammar suggestions via API. Authenticate, submit text, and access full details in the docs. [Moderate Text ](/guides/moderation/moderate-text)Scan and moderate text content for unsafe or policy-relevant material across 10+ categories. ## Ready to scale beyond the basics? Get a personalized demo and discover how to process thousands of documents seamlessly, integrate Copyleaks into your existing systems, and achieve enterprise-grade accuracy for your specific use case. [Book a Demo](https://copyleaks.com/book-a-demo) --- # Detect AI-Generated Text > Learn how to use the AI Detection API to check if content was written by a human or generated by an AI. AI Detector The Copyleaks AI Detection API is a powerful tool to determine if a given text was written by a human or generated by an AI. The API is synchronous, meaning you get the results in the same API call. This guide will walk you through the process of submitting text for AI detection and understanding the results. ## 🚀 Get Started [Section titled “🚀 Get Started”](#-get-started) 1. #### Before you begin [Section titled “Before you begin”](#before-you-begin) Before you start, ensure you have the following: * An active Copyleaks account. If you don’t have one, **[sign up for free](https://api.copyleaks.com/signup)**. * You can find your API key on the **[API Dashboard](https://api.copyleaks.com/dashboard)**. 2. #### Installation [Section titled “Installation”](#installation) Choose your preferred method for making API calls. * HTTP You can interact with the API using any standard HTTP client. For a quicker setup, we provide a Postman collection. See our [Postman guide](/resources/postman) for instructions. * cURL * Ubuntu/Debian ```bash sudo apt-get install curl ``` * Windows Download it from [curl.se](https://curl.se). * macOS ```bash brew install curl ``` * Python ```bash sudo apt-get install curl ``` * JavaScript Download it from [curl.se](https://curl.se). * Java ```bash brew install curl ``` * Ubuntu/Debian ```bash pip install copyleaks ``` * Windows ```bash npm install plagiarism-checker ``` * macOS [Download from Maven](https://central.sonatype.com/artifact/com.copyleaks.sdk/copyleaks-java-sdk?smo=true) 3. #### Login [Section titled “Login”](#login) To perform a scan, we first need to generate an access token. For that, we will use the [login](/reference/actions/account/login) endpoint. The API key can be found on the [Copyleaks API Dashboard](https://api.copyleaks.com/dashboard). Upon successful authentication, you will receive a token that must be attached to subsequent API calls via the Authorization: Bearer `` header. This token remains valid for 48 hours. * HTTP ```http POST https://id.copyleaks.com/v3/account/login/api Headers Content-Type: application/json Body { "email": "your@email.address", "key": "00000000-0000-0000-0000-000000000000" } ``` * cURL ```bash export COPYLEAKS_EMAIL="your@email.address" export COPYLEAKS_API_KEY="your-api-key-here" curl --request POST \ --url https://id.copyleaks.com/v3/account/login/api \ --header 'Accept: application/json' \ --header 'Content-Type: application/json' \ --data "{ \"email\": \"${COPYLEAKS_EMAIL}\", \"key\": \"${COPYLEAKS_API_KEY}\" }" ``` * Python ```python from copyleaks.copyleaks import Copyleaks EMAIL_ADDRESS = "your@email.address" API_KEY = "your-api-key-here" # Login to Copyleaks auth_token = Copyleaks.login(EMAIL_ADDRESS, API_KEY) print("Logged successfully!\nToken:", auth_token) ``` * JavaScript ```javascript const { Copyleaks } = require('plagiarism-checker'); const EMAIL_ADDRESS = "your@email.address"; const API_KEY = "your-api-key-here"; async function login() { const copyleaks = new Copyleaks(); const loginResult = await copyleaks.loginAsync(EMAIL_ADDRESS, API_KEY); console.log('Logged successfully!\nToken:', loginResult); return loginResult; } ``` * Java ```java import com.copyleaks.sdk.api.Copyleaks; String EMAIL_ADDRESS = "your@email.address"; String API_KEY = "00000000-0000-0000-0000-000000000000"; // Login to Copyleaks try { String authToken = Copyleaks.login(EMAIL_ADDRESS, API_KEY); System.out.println("Logged successfully!\nToken: " + authToken); } catch (CommandException e) { System.out.println("Failed to login: " + e.getMessage()); System.exit(1); } ``` **Response** ```json { "access_token": "", ".issued": "2025-07-31T10:19:40.0690015Z", ".expires": "2025-08-02T10:19:40.0690016Z" } ``` Note Save this token! It’s valid for 48 hours and can be reused for subsequent API calls. 4. #### Submit for Analysis [Section titled “Submit for Analysis”](#submit-for-analysis) Use the [Writer Detector Endpoint](/reference/actions/writer-detector/check) to send text for analysis. You need to provide a unique `scanId` for each submission. Tip For testing, set `"sandbox": true`. Sandbox mode is free and returns mock results. * HTTP ```http POST https://api.copyleaks.com/v2/writer-detector/my-scan-1/check Headers Authorization: Bearer Content-Type: application/json Body { "text": "Lions are social animals, living in groups called prides, typically consisting of several females, their offspring, and a few males. Female lions are the primary hunters, working together to catch prey. Lions are known for their strength, teamwork, and complex social structures.", "sandbox": true } ``` * cURL ```bash curl -X POST "https://api.copyleaks.com/v2/writer-detector/my-scan-1/check" \ -H "Authorization: Bearer " \ -H "Content-Type: application/json" \ -d '{ "text": "Lions are social animals, living in groups called prides, typically consisting of several females, their offspring, and a few males. Female lions are the primary hunters, working together to catch prey. Lions are known for their strength, teamwork, and complex social structures.", "sandbox": true }' ``` * Python ```python from copyleaks.copyleaks import Copyleaks from copyleaks.models.submit.document import NaturalLanguageDocument scan_id = "my-scan-1" sample_text = "Lions are social animals, living in groups called prides..." natural_language_submission = NaturalLanguageDocument(sample_text) natural_language_submission.set_sandbox(True) response = Copyleaks.AiDetectionClient.submit_natural_language(auth_token, scan_id, natural_language_submission) print(response) ``` * Node.js ```javascript const { Copyleaks, CopyleaksNaturalLanguageSubmissionModel } = require('plagiarism-checker'); const scanId = "my-ai-scan"; const sampleText = "Lions are social animals, living in groups called prides..."; const submission = new CopyleaksNaturalLanguageSubmissionModel(sampleText); submission.sandbox = true; const response = await copyleaks.aiDetectionClient.submitNaturalTextAsync(authToken, scanId, submission); console.log(response); ``` * Java ```java import classes.Copyleaks; import models.submissions.CopyleaksNaturalLanguageSubmissionModel; import models.responses.AIDetectionResponse; String scanId = "my-ai-scan"; String sampleText = "Lions are social animals, living in groups called prides..."; CopyleaksNaturalLanguageSubmissionModel submission = new CopyleaksNaturalLanguageSubmissionModel(sampleText); submission.setSandbox(true); AIDetectionResponse response = Copyleaks.aiDetectionClient.submitNaturalLanguage(authToken, scanId, submission); System.out.println("AI Score: " + response.getSummary().getAi()); ``` 5. #### Interpreting The Response [Section titled “Interpreting The Response”](#interpreting-the-response) For a complete breakdown of the response structure, see the [AI Detection Response](/reference/data-types/ai-detector/ai-detector) documentation. 6. #### 🎉Congratulations! [Section titled “🎉Congratulations!”](#congratulations) You have successfully submitted text for AI detection. You can now use the JSON response in your application to take further action based on the findings. ## 🗺️ Next Steps [Section titled “🗺️ Next Steps”](#️-next-steps) [API Reference ](/reference/actions/writer-detector/check)Explore the full API reference for the AI Detection endpoint. [AI Logic ](/concepts/features/ai-logic/)Learn how to use AI logic can help you interpret the results of AI text detection. [Accuracy & 3rd Party Evaluations ](https://copyleaks.com/blog/ai-detector-continues-top-accuracy-third-party)Discover how Copyleaks AI Detector maintains top accuracy in third-party evaluations. --- # Detect Plagiarism in Text > A comprehensive guide to using the Copyleaks Plagiarism Checker API for robust originality verification. Authenticity The Copyleaks API is the most powerful way to analyze your content for plagiarism. This API is asynchronous - you submit a scan, and Copyleaks notifies your server via webhooks when the results are ready to be retrieved. This guide will walk you through the process of submitting a scan, enabling plagiarism detection, and exporting the results. ## 🚀 Get Started [Section titled “🚀 Get Started”](#-get-started) 1. #### Before you begin [Section titled “Before you begin”](#before-you-begin) Before you start, ensure you have the following: * An active Copyleaks account. If you don’t have one, **[sign up for free](https://api.copyleaks.com/signup)**. * You can find your API key on the **[API Dashboard](https://api.copyleaks.com/dashboard)**. 2. #### Installation [Section titled “Installation”](#installation) Choose your preferred method for making API calls. * HTTP You can interact with the API using any standard HTTP client. For a quicker setup, we provide a Postman collection. See our [Postman guide](/resources/postman) for instructions. * cURL * Ubuntu/Debian ```bash sudo apt-get install curl ``` * Windows Download it from [curl.se](https://curl.se). * macOS ```bash brew install curl ``` * Python ```bash sudo apt-get install curl ``` * JavaScript Download it from [curl.se](https://curl.se). * Java ```bash brew install curl ``` * Ubuntu/Debian ```bash pip install copyleaks ``` * Windows ```bash npm install plagiarism-checker ``` * macOS [Download from Maven](https://central.sonatype.com/artifact/com.copyleaks.sdk/copyleaks-java-sdk?smo=true) 3. #### Login [Section titled “Login”](#login) To perform a scan, we first need to generate an access token. For that, we will use the [login](/reference/actions/account/login) endpoint. The API key can be found on the [Copyleaks API Dashboard](https://api.copyleaks.com/dashboard). Upon successful authentication, you will receive a token that must be attached to subsequent API calls via the Authorization: Bearer `` header. This token remains valid for 48 hours. * HTTP ```http POST https://id.copyleaks.com/v3/account/login/api Headers Content-Type: application/json Body { "email": "your@email.address", "key": "00000000-0000-0000-0000-000000000000" } ``` * cURL ```bash export COPYLEAKS_EMAIL="your@email.address" export COPYLEAKS_API_KEY="your-api-key-here" curl --request POST \ --url https://id.copyleaks.com/v3/account/login/api \ --header 'Accept: application/json' \ --header 'Content-Type: application/json' \ --data "{ \"email\": \"${COPYLEAKS_EMAIL}\", \"key\": \"${COPYLEAKS_API_KEY}\" }" ``` * Python ```python from copyleaks.copyleaks import Copyleaks EMAIL_ADDRESS = "your@email.address" API_KEY = "your-api-key-here" # Login to Copyleaks auth_token = Copyleaks.login(EMAIL_ADDRESS, API_KEY) print("Logged successfully!\nToken:", auth_token) ``` * JavaScript ```javascript const { Copyleaks } = require('plagiarism-checker'); const EMAIL_ADDRESS = "your@email.address"; const API_KEY = "your-api-key-here"; async function login() { const copyleaks = new Copyleaks(); const loginResult = await copyleaks.loginAsync(EMAIL_ADDRESS, API_KEY); console.log('Logged successfully!\nToken:', loginResult); return loginResult; } ``` * Java ```java import com.copyleaks.sdk.api.Copyleaks; String EMAIL_ADDRESS = "your@email.address"; String API_KEY = "00000000-0000-0000-0000-000000000000"; // Login to Copyleaks try { String authToken = Copyleaks.login(EMAIL_ADDRESS, API_KEY); System.out.println("Logged successfully!\nToken: " + authToken); } catch (CommandException e) { System.out.println("Failed to login: " + e.getMessage()); System.exit(1); } ``` **Response** ```json { "access_token": "", ".issued": "2025-07-31T10:19:40.0690015Z", ".expires": "2025-08-02T10:19:40.0690016Z" } ``` Note Save this token! It’s valid for 48 hours and can be reused for subsequent API calls. 4. #### Submit for Scanning [Section titled “Submit for Scanning”](#submit-for-scanning) Use the [Submit File Endpoint](/reference/actions/scans/submit-file) to send content for analysis. You need to provide a unique `scanId` for each submission. Tip For testing, set `"sandbox": true`. Sandbox mode is free and returns mock results. * HTTP ```http PUT https://api.copyleaks.com/v3/scans/submit/file/my-plagiarism-scan Headers Authorization: Bearer Content-Type: application/json Body { "base64": "SGVsbG8gd29ybGQh", "filename": "file.txt", "properties": { "webhooks": { "status": "https://your-server.com/webhook/{STATUS}" }, "sandbox": true } } ``` * cURL ```bash curl -X PUT "https://api.copyleaks.com/v3/scans/submit/file/my-plagiarism-scan" \ -H "Authorization: Bearer " \ -H "Content-Type: application/json" \ -d '{ "base64": "SGVsbG8gd29ybGQh", "filename": "file.txt", "properties": { "webhooks": { "status": "https://your-server.com/webhook/{STATUS}" }, "sandbox": true, "scanning": { "internet": true, }, "cheatDetection": false, } }' ``` * Python ```python import base64 from copyleaks.copyleaks import Copyleaks from copyleaks.models.submit.document import FileDocument from copyleaks.models.submit.properties.scan_properties import ScanProperties scan_id = "my-plagiarism-scan" base64_content = base64.b64encode(b'Hello world.').decode('utf8') scan_properties = ScanProperties("https://your-server.com/webhook/{STATUS}") scan_properties.set_sandbox(True) scan_properties.set_plagiarism_scan(True) # Enable Plagiarism scan file_submission = FileDocument(base64_content, "file.txt") file_submission.set_properties(scan_properties) Copyleaks.submit_file(auth_token, scan_id, file_submission) print("Sent to scanning...") ``` * Node.js ```javascript const { Copyleaks, CopyleaksFileSubmissionModel } = require('plagiarism-checker'); const scanId = "my-plagiarism-scan"; const base64Content = Buffer.from('Hello world').toString('base64'); const submission = new CopyleaksFileSubmissionModel( base64Content, 'file.txt', { webhooks: { status: 'https://your-server.com/webhook/{STATUS}' }, sandbox: true, plagiarism: { scan: true } // Enable Plagiarism scan } ); await copyleaks.submitFileAsync(authToken, scanId, submission); console.log('Sent to scanning...'); ``` * Java ```java import classes.Copyleaks; import models.submissions.CopyleaksFileSubmissionModel; import models.submissions.properties.*; import java.util.Base64; import java.nio.charset.StandardCharsets; String scanId = "my-plagiarism-scan"; String base64Content = Base64.getEncoder().encodeToString("Hello world".getBytes(StandardCharsets.UTF_8)); SubmissionWebhooks webhooks = new SubmissionWebhooks("https://your-server.com/webhook/{STATUS}"); SubmissionProperties properties = new SubmissionProperties(webhooks); properties.setSandbox(true); properties.setPlagiarism(new SubmissionPlagiarism(true)); // Enable Plagiarism scan CopyleaksFileSubmissionModel submission = new CopyleaksFileSubmissionModel(base64Content, "file.txt", properties); Copyleaks.submitFile(authToken, scanId, submission); System.out.println("Sent to scanning..."); ``` 5. #### Wait for Completion Webhook [Section titled “Wait for Completion Webhook”](#wait-for-completion-webhook) The scan can take some time. Once it’s complete, Copyleaks will send a [completed webhook](/reference/data-types/authenticity/webhooks/scan-completed) to the status URL you provided. This webhook contains a summary of the scan results, including any `result` IDs for found plagiarism matches. 6. #### Export Detailed Results [Section titled “Export Detailed Results”](#export-detailed-results) After the `completed` webhook arrives, use the [export endpoint](/reference/actions/downloads/export) to retrieve the detailed plagiarism [`results`](/reference/data-types/authenticity/results/new-plagiarism-result) using the `result` IDs you received in the completion webhook. We will also export the Crawled Version. The `crawledVersion` webhook contains the text and html version of the document. This can later be used in order to display the report. In addition, you should also specify a `completionWebhook` to receive notifications when the export is ready. * HTTP ```http POST https://api.copyleaks.com/v3/downloads/my-plagiarism-scan/export/ Headers Authorization: Bearer Content-Type: application/json Body { "completionWebhook": { "url": "https://your.server/webhook/export/completed", "headers": { "key": "value", "key2": "value2" }, "maxRetries": 3 }, "developerPayload": "custom_data_identifier", "crawledVersion": { "endpoint": "https://your.server/webhook/export/crawled", "verb": "POST", "headers": { "key": "value", "key2": "value2" } }, "results": [ { "id": "result-1", "endpoint": "https://your.server/webhook/export/result/result-1", "verb": "POST", "headers": { "key": "value", "key2": "value2" } } ] } ``` * cURL ```bash curl -X POST "https://api.copyleaks.com/v3/downloads/my-plagiarism-scan/export/my-export-1" \ -H "Authorization: Bearer " \ -H "Content-Type: application/json" \ -d '{ "completionWebhook": "https://your-server.com/webhook/export/completion", "developerPayload": "custom_data_identifier", "crawledVersion": { "endpoint": "https://your.server/webhook/export/crawled", "verb": "POST", "headers": { "key": "value", "key2": "value2" } }, "results": [ { "id": "", "endpoint": "https://your-server.com/webhook/export/result/1", "verb": "POST" } ] }' ``` * Python ```python from copyleaks.copyleaks import Copyleaks from copyleaks.models.export import Export, ExportResult scan_id = "my-plagiarism-scan" export_id = "my-export-1" export = Export() export.set_completion_webhook('https://your-server.com/webhook/export/completion') # Export a specific plagiarism result result1 = ExportResult() result1.set_id('') result1.set_endpoint('https://your-server.com/webhook/export/result/1', 'POST') export.set_results([result1]) Copyleaks.export(auth_token, scan_id, export_id, export) print("Export initiated.") ``` * Node.js ```javascript const { Export, ExportResult } = require('plagiarism-checker'); const scanId = 'my-plagiarism-scan'; const exportId = 'my-export-1'; const exportRequest = new Export(); exportRequest.setCompletionWebhook('https://your-server.com/webhook/export/{STATUS}'); // Export a specific plagiarism result const result1 = new ExportResult(); result1.setId(''); result1.setEndpoint('https://your-server.com/webhook/export/result/1', 'POST'); exportRequest.setResults([result1]); await copyleaks.exportAsync(authToken, scanId, exportId, exportRequest); console.log('Export initiated.'); ``` * Java ```java import classes.Copyleaks; import models.exports.Export; import models.exports.ExportResult; import java.util.Arrays; String scanId = "my-plagiarism-scan"; String exportId = "my-export-1"; Export exportRequest = new Export(); exportRequest.setCompletionWebhook("https://your-server.com/webhook/export/{STATUS}"); // Export a specific plagiarism result ExportResult result1 = new ExportResult(); result1.setId(""); result1.setEndpoint("https://your-server.com/webhook/export/result/1", "POST"); exportRequest.setResults(Arrays.asList(result1)); Copyleaks.export(authToken, scanId, exportId, exportRequest); System.out.println("Export initiated."); ``` 7. #### 🎉Congratulations! [Section titled “🎉Congratulations!”](#congratulations) You have successfully submitted a scan for plagiarism detection and exported the results. You can now handle the results in your application, display them to users, or take further actions based on the findings. ## 🗺️ Next Steps [Section titled “🗺️ Next Steps”](#️-next-steps) [Webhooks Overview ](/reference/data-types/authenticity/webhooks/overview/)Learn how to securely receive and process notifications from Copyleaks. [Viewing Scan Results ](/concepts/features/how-to-display/)Understand the scan result format and how to display it to your users. --- # Embed Hosted Web Report > This guide explains how to embed the Copyleaks Hosted Web Report in your application, allowing users to view detailed plagiarism and AI detection reports. Display The Copyleaks Hosted Web Report provides a seamless way to display detailed plagiarism and AI detection reports within your application. This guide will walk you through the process of embedding the report. ## 🚀 Get Started [Section titled “🚀 Get Started”](#-get-started) 1. #### Before you begin [Section titled “Before you begin”](#before-you-begin) Before you begin, ensure you have the following: * A way to generate the required JSON data for the report. This is usually done by using the [Copyleaks API](/reference/actions/scans/submit-url/) to perform a scan and then exporting the results. * A publicly accessible server to host the JSON data file. 2. #### Generate the JSON Data [Section titled “Generate the JSON Data”](#generate-the-json-data) The Hosted Web Report requires a JSON file with a specific structure. This file contains all the necessary information to render the report correctly. Note For a detailed explanation of the JSON data structure, refer to the [JSON Schema reference](/reference/data-types/authenticity/webhooks/overview). Here is an example of the JSON data: ```json { "input": { "requestParams": { "headers": { "Authorization": "***token****", "header_key2": "header_value1", "header_key3": "header_value2" } }, "crawledVersion": "https://example.com/api/scans/scanid/scan-source.json", "completedWebhook": "https://example.com/api/scans/scanid/complete_result.json", "writingFeedback": "https://example.com/api/scans/scanid/writing_feedback.json", "result": "https://example.com/api/scans/scanid/results/{RESULT_ID}.json", "pdf": "https://example.com/api/scans/scanid/report.pdf" }, "customizations": { "companyLogo": "https://example.com/logo.svg", "accessExpired": { "httpResponsesCode": [ 403, 401 ], "customMessage": "*Custom Error Message*", "redirectUrl": "https://example.com/login" } } } ``` 3. #### Host the JSON File [Section titled “Host the JSON File”](#host-the-json-file) The JSON file must be hosted on a publicly accessible server. The URL of this file will be used to load the report. **Important Considerations:** * The URL must be publicly accessible. * Your server must allow Cross-Origin Resource Sharing (CORS) to the `https://report.sand-box.info` domain. 4. #### Embed the Report [Section titled “Embed the Report”](#embed-the-report) You can embed the Hosted Web Report in your application using an ` ``` ## 🗺️ Next Steps [Section titled “🗺️ Next Steps”](#️-next-steps) [Export Detailed Results ](/guides/authenticity/detect-plagiarism-text/)Learn how to export the results of a scan to generate the JSON data for the report. [Webhooks Overview ](/reference/data-types/authenticity/webhooks/overview/)Explore the different customization options available for the Hosted Web Report. ## 💬 Support [Section titled “💬 Support”](#-support) If you have any questions or need help, please contact our [support team](https://help.copyleaks.com/hc/en-us/requests/new). We’re here to assist you with any integration needs. --- # Install Open Source Web Report > This guide provides detailed instructions for integrating Copyleaks' web report module into your Angular application to display plagiarism detection, AI content detection, and writing assistance reports while maintaining your brand identity. Display This guide provides detailed instructions for integrating Copyleaks’ web report module into your Angular application to display plagiarism detection, AI content detection, and writing assistance reports while maintaining your brand identity. Copyleaks Web Report is an [Angular](https://angular.dev/) module designed to integrate plagiarism and AI detection reporting seamlessly into your application. This module offers a user-friendly, engaging, and flexible interface for presenting plagiarism and AI content reports, showcasing the authenticity and uniqueness of submitted files or text. ### Key Features [Section titled “Key Features”](#key-features) * **Customizable Layouts**: Various layout options for report display * **Responsive Design**: Adapts to different screen sizes for consistent user experience * **API Integration**: Configurable endpoints for efficient data retrieval * **Accessibility Focused**: Inclusive design for a wider range of users * **Error Handling**: Effective management of data retrieval errors ## 🚀 Get Started [Section titled “🚀 Get Started”](#-get-started) 1. #### Before you begin [Section titled “Before you begin”](#before-you-begin) Before you begin, ensure you have: * A [Copyleaks account](https://api.copyleaks.com/signup) with the ability to complete successful scans and store results * Server-side application with access to stored Copyleaks reports * Angular application (version compatibility detailed below) * Familiarity with the Copyleaks’ Authenticity Authenticity API. If you haven’t tried it yet, get started with the [Detect Plagiarism](/guides/authenticity/detect-plagiarism-text/) guide 2. #### Installation [Section titled “Installation”](#installation) First, select the version corresponding to your Angular version: | Angular | @copyleaks/ng-web-report | | ------- | ------------------------ | | 13 | Latest | Then, install the package: * npm ```bash npm install @copyleaks/ng-web-report --save ``` * yarn ```bash yarn add @copyleaks/ng-web-report ``` Finally, ensure the following peer dependencies are installed: | Dependency | Version | | -------------------------- | --------------- | | @angular/localize | ^13.1.1 | | @angular/material | ^13.1.1 | | @angular/flex-layout | ^13.0.0-beta.36 | | scroll-into-view-if-needed | ^2.2.28 | | ngx-skeleton-loader | ^5.0.0 | You can install them using: * npm ```bash npm install @angular/localize@^13.1.1 @angular/material@^13.1.1 @angular/flex-layout@^13.0.0-beta.36 scroll-into-view-if-needed@^2.2.28 ngx-skeleton-loader@^5.0.0 --save ``` * yarn ```bash yarn add @angular/localize@^13.1.1 @angular/material@^13.1.1 @angular/flex-layout@^13.0.0-beta.36 scroll-into-view-if-needed@^2.2.28 ngx-skeleton-loader@^5.0.0 ``` 3. #### Integration [Section titled “Integration”](#integration) The general integration process follows these steps: 1. Create a Copyleaks account 2. Use Copyleaks API to scan for plagiarism 3. Use the [Export Methods](/reference/actions/downloads/export/) to extract data and save it on your server/cloud 4. Create HTTP endpoints to access the stored data 5. Present the data in your website via the Copyleaks web report module 4. #### Implementation [Section titled “Implementation”](#implementation) Add `CopyleaksWebReportModule` and `HttpClientModule` to your module’s imports: app.module.ts ```typescript import { CopyleaksWebReportModule } from '@copyleaks/ng-web-report'; import { HttpClientModule } from '@angular/common/http'; @NgModule({ declarations: [AppComponent], imports: [ // ... CopyleaksWebReportModule, HttpClientModule ], providers: [], bootstrap: [AppComponent] }) export class AppModule {} ``` Create an endpoint configuration object that tells the report component where to fetch data, and then add the component to your template. * your.component.ts your.component.ts ```typescript import { IClsReportEndpointConfigModel, IEndpointDetails } from '@copyleaks/ng-web-report'; @Component({ // ... }) export class YourComponent { public endpointConfig: IClsReportEndpointConfigModel; constructor() { // Define your endpoint configuration this.endpointConfig = { crawledVersion: { url: 'https://your-api.com/copyleaks/{scanId}/source', headers: { 'Authorization': 'Bearer your-token', 'Content-Type': 'application/json' } }, completeResults: { url: 'https://your-api.com/copyleaks/{scanId}/completed', headers: { 'Authorization': 'Bearer your-token', 'Content-Type': 'application/json' } }, result: { url: 'https://your-api.com/copyleaks/{scanId}/results/{RESULT_ID}', headers: { 'Authorization': 'Bearer your-token', 'Content-Type': 'application/json' } } // Optional: progress endpoint for real-time results // progress: { ... } }; } // Event handlers handleError(error: ReportHttpRequestErrorModel): void { // Your error handling logic here console.error('Report request error:', error); } handleUpdate(results: ICompleteResults): void { // Your logic for processing report updates here console.log('Complete results updated:', results); } } ``` * your.component.html your.component.html ```html ``` 5. #### Advanced Customization [Section titled “Advanced Customization”](#advanced-customization) You can add custom actions, tabs, and more to the report interface. Here are a few examples: **Custom Actions** ```html

``` **Custom Tabs** ```html

Custom Analysis

Additional Analysis

Your custom analysis content here...

``` 6. #### 🎉 Congratulations! [Section titled “🎉 Congratulations!”](#-congratulations) You have successfully integrated the Copyleaks Web Report into your Angular application. You can now display detailed plagiarism and AI detection reports to your users. ## Query Parameters [Section titled “Query Parameters”](#query-parameters) The report component interprets several query parameters: | Parameter | Type | Description | | ----------- | ------ | --------------------------------------------------- | | contentMode | string | Determines content view type (‘text’ or ‘html’) | | sourcePage | number | Page number in text view pagination (starts from 1) | | suspectPage | number | Page number in text view pagination (starts from 1) | | suspectId | string | Identifier of the selected matching result | | alertCode | string | Code of the selected alert | ## Next Steps [Section titled “Next Steps”](#next-steps) [Package repository on Github ](https://github.com/Copyleaks/ng-web-report)Access the source code and contribute to the development of the Copyleaks Web Report. [Accessibility information (VPAT report) ](https://copyleaks.com/accessibility/)Review the Voluntary Product Accessibility Template (VPAT) report for accessibility compliance. ## Support [Section titled “Support”](#support) Should you require any assistance or have inquiries, please contact [Copyleaks Support](https://help.copyleaks.com/hc/en-us/requests/new) or ask a question on [StackOverflow](https://stackoverflow.com/questions/tagged/copyleaks-api) with the `copyleaks-api` tag. We appreciate your interest in Copyleaks and look forward to supporting your efforts to maintain originality and integrity. --- # Moderate Text Content > Scan and moderate text content for unsafe or policy-relevant material across 10+ categories with the Text Moderation API. Moderation The Copyleaks Text Moderation API empowers you to build safer online environments by proactively identifying and flagging harmful or risky content in real-time. With support for a broad range of categories—including hate speech, toxic language, and more—our API provides the tools you need to enforce your community standards effectively. This guide will walk you through submitting text for moderation and building a robust workflow based on the results. ## 🚀 Get Started [Section titled “🚀 Get Started”](#-get-started) 1. #### Before you begin [Section titled “Before you begin”](#before-you-begin) Before you start, ensure you have the following: * An active Copyleaks account. If you don’t have one, **[sign up for free](https://api.copyleaks.com/signup)**. * You can find your API key on the **[API Dashboard](https://api.copyleaks.com/dashboard)**. 2. #### Installation [Section titled “Installation”](#installation) Choose your preferred method for making API calls. * HTTP You can interact with the API using any standard HTTP client. For a quicker setup, we provide a Postman collection. See our [Postman guide](/resources/postman) for instructions. * cURL * Ubuntu/Debian ```bash sudo apt-get install curl ``` * Windows Download it from [curl.se](https://curl.se). * macOS ```bash brew install curl ``` * Python ```bash sudo apt-get install curl ``` * JavaScript Download it from [curl.se](https://curl.se). * Java ```bash brew install curl ``` * Ubuntu/Debian ```bash pip install copyleaks ``` * Windows ```bash npm install plagiarism-checker ``` * macOS [Download from Maven](https://central.sonatype.com/artifact/com.copyleaks.sdk/copyleaks-java-sdk?smo=true) 3. #### Login [Section titled “Login”](#login) To perform a scan, we first need to generate an access token. For that, we will use the [login](/reference/actions/account/login) endpoint. The API key can be found on the [Copyleaks API Dashboard](https://api.copyleaks.com/dashboard). Upon successful authentication, you will receive a token that must be attached to subsequent API calls via the Authorization: Bearer `` header. This token remains valid for 48 hours. * HTTP ```http POST https://id.copyleaks.com/v3/account/login/api Headers Content-Type: application/json Body { "email": "your@email.address", "key": "00000000-0000-0000-0000-000000000000" } ``` * cURL ```bash export COPYLEAKS_EMAIL="your@email.address" export COPYLEAKS_API_KEY="your-api-key-here" curl --request POST \ --url https://id.copyleaks.com/v3/account/login/api \ --header 'Accept: application/json' \ --header 'Content-Type: application/json' \ --data "{ \"email\": \"${COPYLEAKS_EMAIL}\", \"key\": \"${COPYLEAKS_API_KEY}\" }" ``` * Python ```python from copyleaks.copyleaks import Copyleaks EMAIL_ADDRESS = "your@email.address" API_KEY = "your-api-key-here" # Login to Copyleaks auth_token = Copyleaks.login(EMAIL_ADDRESS, API_KEY) print("Logged successfully!\nToken:", auth_token) ``` * JavaScript ```javascript const { Copyleaks } = require('plagiarism-checker'); const EMAIL_ADDRESS = "your@email.address"; const API_KEY = "your-api-key-here"; async function login() { const copyleaks = new Copyleaks(); const loginResult = await copyleaks.loginAsync(EMAIL_ADDRESS, API_KEY); console.log('Logged successfully!\nToken:', loginResult); return loginResult; } ``` * Java ```java import com.copyleaks.sdk.api.Copyleaks; String EMAIL_ADDRESS = "your@email.address"; String API_KEY = "00000000-0000-0000-0000-000000000000"; // Login to Copyleaks try { String authToken = Copyleaks.login(EMAIL_ADDRESS, API_KEY); System.out.println("Logged successfully!\nToken: " + authToken); } catch (CommandException e) { System.out.println("Failed to login: " + e.getMessage()); System.exit(1); } ``` **Response** ```json { "access_token": "", ".issued": "2025-07-31T10:19:40.0690015Z", ".expires": "2025-08-02T10:19:40.0690016Z" } ``` Note Save this token! It’s valid for 48 hours and can be reused for subsequent API calls. 4. #### Submit for Moderation [Section titled “Submit for Moderation”](#submit-for-moderation) Use the [Text Moderation Endpoint](/reference/actions/text-moderation/check) Provide a unique `scanId` for each request. Tip For testing, set `"sandbox": true`. Sandbox mode is free and returns mock results. * HTTP ```http POST https://api.copyleaks.com/v1/text-moderation/my-scan-1/check Authorization: Bearer Content-Type: application/json { "text": "Your text content to be moderated goes here.", "sandbox": true, "language": "en", "labels": [ { "id": "toxic-v1" }, { "id": "profanity-v1" }, { "id": "hate-speech-v1" } ] } ``` * cURL ```bash curl -X POST "https://api.copyleaks.com/v1/text-moderation/my-scan-1/check" \ -H "Authorization: Bearer " \ -H "Content-Type: application/json" \ -d '{ "text": "Copyleaks is a damn useful tool when youre tired of people copying your shit and posting it like its theirs. It doesnt just catch that crap — it can also flag stuff that’s offensive or even sexually suggestive garbage you don’t want in your public content. Platforms are flooded with toxic nonsense and hate these days, and Copyleaks helps clean that up before it turns into a PR nightmare. Whether its racist rants, plagiarism, or just some jerk spewing sexist crap, this tool can spot it.", "sandbox": false, "language": "en", "labels": [ { "id": "toxic-v1" }, { "id": "profanity-v1" }, { "id": "hate-speech-v1" } ] }' ``` * Python ```python from copyleaks.copyleaks import Copyleaks from copyleaks.models.submit.document import TextDocument from copyleaks.models.submit.properties.moderation_properties import ModerationLabel scan_id = "my-moderation-scan" sample_text = "Your text content to be moderated goes here." text_submission = TextDocument(sample_text) text_submission.set_sandbox(True) text_submission.set_language("en") text_submission.set_labels([ModerationLabel("toxic-v1"), ModerationLabel("profanity-v1")]) response = Copyleaks.moderate_text(auth_token, scan_id, text_submission) print(response) ``` * Node.js ```javascript const { Copyleaks, CopyleaksTextModerationModel, ModerationLabel } = require('plagiarism-checker'); const scanId = "my-moderation-scan"; const sampleText = "Your text content to be moderated goes here."; const submission = new CopyleaksTextModerationModel( sampleText, [new ModerationLabel("toxic-v1"), new ModerationLabel("profanity-v1")], 'en', true // sandbox ); const response = await copyleaks.moderateTextAsync(authToken, scanId, submission); console.log(response); ``` * Java ```java import classes.Copyleaks; import models.submissions.CopyleaksTextModerationModel; import models.submissions.properties.ModerationLabel; import models.responses.ModerationResponse; import java.util.Arrays; String scanId = "my-moderation-scan"; String sampleText = "Your text content to be moderated goes here."; CopyleaksTextModerationModel submission = new CopyleaksTextModerationModel( sampleText, Arrays.asList(new ModerationLabel("toxic-v1"), new ModerationLabel("profanity-v1")), "en", true // sandbox ); ModerationResponse response = Copyleaks.moderateText(authToken, scanId, submission); System.out.println(response); ``` 5. #### Interpret The Response [Section titled “Interpret The Response”](#interpret-the-response) The API returns a `legend` array that maps [labels](/reference/data-types/moderation/text-moderation-labels) IDs to numerical indices, and a `moderations` object that pinpoints the exact location of flagged content using those indices. * **`legend`**: A lookup table where each `id` (e.g., “toxic-v1”) corresponds to an `index`. * **`moderations.text.chars`**: Contains parallel arrays: * `starts`: An array of starting character positions for each flagged segment. * `lengths`: An array of character lengths for each segment. * `labels`: An array of numerical indices that correspond to the `legend`. Example Response ```json { "moderations": { "text": { "chars": { "labels": [ 2, 4 ], "starts": [ 27, 100 ], "lengths": [ 2, 8 ] } } }, "legend": [ { "index": 2, "id": "toxic-v1" }, { "index": 4, "id": "profanity-v1" } ] } ``` In this example, the content is flagged for “toxic-v1” starting at character 27 and for “profanity-v1” starting at character 100. 6. #### 🎉Congratulations! [Section titled “🎉Congratulations!”](#congratulations) You have successfully submitted text for moderation. You can now use the JSON response in your application to take further actions based on the findings. ## 🗺️ Next Steps [Section titled “🗺️ Next Steps”](#️-next-steps) [Moderation Labels ](/reference/data-types/moderation/text-moderation-labels/)See a complete list of all supported content moderation labels and their descriptions. [Full API Reference ](/reference/actions/text-moderation/check/)Explore the complete documentation for the Text Moderation endpoint and response object. --- # Assess Grammar & Writing Quality > This document outlines the essential steps for using the Writing Assistant API, covering scan execution and result management. Writing Get started with Copyleaks’ Writing Assistant API to detect and correct over 30 types of writing issues across grammar, mechanics, sentence structure, and word choice. The API is available in two ways: 1. **Sync:** Submit text via an HTTP request and receive the analysis in the response. 2. **Async:** Use the [Authenticity API](/guides/authenticity/detect-plagiarism-text) to submit larger documents and receive results via webhook. Note This guide focuses on the **synchronous** option. For asynchronous submissions, see the **[Authenticity API Guide](/guides/authenticity/detect-plagiarism-text)**. ## 🚀 Get Started [Section titled “🚀 Get Started”](#-get-started) 1. #### Before you begin [Section titled “Before you begin”](#before-you-begin) Before you start, ensure you have the following: * An active Copyleaks account. If you don’t have one, **[sign up for free](https://api.copyleaks.com/signup)**. * You can find your API key on the **[API Dashboard](https://api.copyleaks.com/dashboard)**. 2. #### Installation [Section titled “Installation”](#installation) Choose your preferred method for making API calls. * HTTP You can interact with the API using any standard HTTP client. For a quicker setup, we provide a Postman collection. See our [Postman guide](/resources/postman) for instructions. * cURL * Ubuntu/Debian ```bash sudo apt-get install curl ``` * Windows Download it from [curl.se](https://curl.se). * macOS ```bash brew install curl ``` * Python ```bash sudo apt-get install curl ``` * JavaScript Download it from [curl.se](https://curl.se). * Java ```bash brew install curl ``` * Ubuntu/Debian ```bash pip install copyleaks ``` * Windows ```bash npm install plagiarism-checker ``` * macOS [Download from Maven](https://central.sonatype.com/artifact/com.copyleaks.sdk/copyleaks-java-sdk?smo=true) 3. #### Login [Section titled “Login”](#login) To perform a scan, we first need to generate an access token. For that, we will use the [login](/reference/actions/account/login) endpoint. The API key can be found on the [Copyleaks API Dashboard](https://api.copyleaks.com/dashboard). Upon successful authentication, you will receive a token that must be attached to subsequent API calls via the Authorization: Bearer `` header. This token remains valid for 48 hours. * HTTP ```http POST https://id.copyleaks.com/v3/account/login/api Headers Content-Type: application/json Body { "email": "your@email.address", "key": "00000000-0000-0000-0000-000000000000" } ``` * cURL ```bash export COPYLEAKS_EMAIL="your@email.address" export COPYLEAKS_API_KEY="your-api-key-here" curl --request POST \ --url https://id.copyleaks.com/v3/account/login/api \ --header 'Accept: application/json' \ --header 'Content-Type: application/json' \ --data "{ \"email\": \"${COPYLEAKS_EMAIL}\", \"key\": \"${COPYLEAKS_API_KEY}\" }" ``` * Python ```python from copyleaks.copyleaks import Copyleaks EMAIL_ADDRESS = "your@email.address" API_KEY = "your-api-key-here" # Login to Copyleaks auth_token = Copyleaks.login(EMAIL_ADDRESS, API_KEY) print("Logged successfully!\nToken:", auth_token) ``` * JavaScript ```javascript const { Copyleaks } = require('plagiarism-checker'); const EMAIL_ADDRESS = "your@email.address"; const API_KEY = "your-api-key-here"; async function login() { const copyleaks = new Copyleaks(); const loginResult = await copyleaks.loginAsync(EMAIL_ADDRESS, API_KEY); console.log('Logged successfully!\nToken:', loginResult); return loginResult; } ``` * Java ```java import com.copyleaks.sdk.api.Copyleaks; String EMAIL_ADDRESS = "your@email.address"; String API_KEY = "00000000-0000-0000-0000-000000000000"; // Login to Copyleaks try { String authToken = Copyleaks.login(EMAIL_ADDRESS, API_KEY); System.out.println("Logged successfully!\nToken: " + authToken); } catch (CommandException e) { System.out.println("Failed to login: " + e.getMessage()); System.exit(1); } ``` **Response** ```json { "access_token": "", ".issued": "2025-07-31T10:19:40.0690015Z", ".expires": "2025-08-02T10:19:40.0690016Z" } ``` Note Save this token! It’s valid for 48 hours and can be reused for subsequent API calls. 4. #### Send Request [Section titled “Send Request”](#send-request) Use the [Writing Feedback Endpoint](/reference/actions/writing-assistant/check). Provide a unique `scanId` for each request. Tip For testing, set `"sandbox": true` in the request body. Sandbox mode is free and returns mock results. * HTTP ```http POST https://api.copyleaks.com/v1/writing-feedback/my-scan-1/check Headers Authorization: Bearer Content-Type: application/json Body { "text": "Copyleaks is a online plagarism detector that helps schools, business and content creators to make sure thier work is orginal. It scans textes from internet and databasis to find similerities. The tool is fast, accurate and supports multipal languages. However, some times it gives false possitives, so users should double check results. Overall, its a usefull platform for mantaining content integrity.", "sandbox": true } ``` * cURL ```bash curl -X POST "https://api.copyleaks.com/v1/writing-feedback/my-scan-1/check" \ -H "Authorization: Bearer " \ -H "Content-Type: application/json" \ -d '{ "text": "Copyleaks is a online plagarism detector that helps schools, business and content creators to make sure thier work is orginal. It scans textes from internet and databasis to find similerities. The tool is fast, accurate and supports multipal languages. However, some times it gives false possitives, so users should double check results. Overall, its a usefull platform for mantaining content integrity.", "sandbox": true }' ``` * Python ```python from copyleaks.copyleaks import Copyleaks from copyleaks.models.submit.writing_assistant_document import WritingAssistantDocument scan_id = "my-python-scan" sample_text = "Hello world, this is a test." submission = WritingAssistantDocument(sample_text) submission.set_sandbox(True) response = Copyleaks.WritingAssistantClient.submit_text(auth_token, scan_id, submission) print(response) ``` * JavaScript ```javascript const { CopyleaksWritingAssistantSubmissionModel } = require('plagiarism-checker'); const scanId = 'my-nodejs-scan'; const sampleText = "Hello world, this is a test."; const submission = new CopyleaksWritingAssistantSubmissionModel(sampleText); submission.sandbox = true; const response = await copyleaks.writingAssistantClient.submitTextAsync(authToken, scanId, submission); console.log(response); ``` * Java ```java import classes.Copyleaks; import models.submissions.CopyleaksWritingAssistantSubmissionModel; import models.responses.WritingAssistantResponse; String scanId = "my-java-scan"; String sampleText = "Hello world, this is a test."; CopyleaksWritingAssistantSubmissionModel submission = new CopyleaksWritingAssistantSubmissionModel(sampleText); submission.setSandbox(true); WritingAssistantResponse response = Copyleaks.writingAssistantClient.submitText(authToken, scanId, submission); System.out.println(response); ``` 5. #### Interpret The Response [Section titled “Interpret The Response”](#interpret-the-response) The [Writing Assistant Response](/reference/data-types/writing/writing-assistant) contains detailed feedback under the `corrections` and `score` properties. The `score` provides an overall quality metric and a breakdown by category, while `corrections` pinpoints the exact location and suggested changes for each issue. Example Response Snippet ```json { "score": { "corrections": { "overallScore": 93, "grammarCorrectionsScore": 100 }, "readability": { "readabilityLevelText": "College Student" } }, "corrections": { "text": { "chars": { "types": [18], "starts": [28], "lengths": [9], "operationTexts": ["stretches "] } } } } ``` ## 🗺️ Next Steps [Section titled “🗺️ Next Steps”](#️-next-steps) [Full API Reference ](/reference/actions/writing-assistant/check/)Explore the complete documentation for the Writing Assistant response object. [Correction Types ](/reference/data-types/authenticity/correction-types/)See a detailed list of all supported correction types and languages. --- # Login > Authenticate with the Copyleaks API using your email and API key Account POST **https\://id.copyleaks.com/v3/account/login/api** Login to the Copyleaks API using your email and API key. Once logged in, you will get back a login token that will be used to authenticate yourself when calling the other API methods. After generating the login-token, you should attach the token for your next calls. Attaching the endpoint is done by adding this header to your calls: ```http Authorization: Bearer TOKEN ``` A generated token is valid for **48 hours**. Within this period of time, you can use it multiple times. Before attaching the `Authorization` header for your next endpoint calls, make sure that the token has not expired. If it expired, generate a new one. Handle tokens carefully The Copyleaks API token should be treated as a password. Attackers, who can gain access to this token, can access your private information and modify it. API Rate Limit **12 requests per account/15 minutes** If you exceed the API limit, authentication will be blocked for 5 minutes (Rate Limit Exceeded HTTP 429 - Too Many Requests). ## Request [Section titled “Request”](#request) ### Headers [Section titled “Headers”](#headers) ```http Content-Type: application/json Accept: application/json ``` ### Body [Section titled “Body”](#body) email string required Your Copyleaks account email address key string required Your Copyleaks account API key (UUID format) ## Responses [Section titled “Responses”](#responses) ### Success [Section titled “Success”](#success) * 200 200 OK The command was executed successfully. #### Response Schema The response contains the following fields: access\_token string The authentication token for API access. .issued string The date and time when the token was issued. .expires string The date and time when the token will expire. #### Example Response A typical response from this endpoint: ```json { "access_token": "ACLNSKNSDAACCAJANCOIUiausoo_saidjaskldjoa...", ".issued": "2018-11-24T16:15:38.2431255+02:00", ".expires": "2018-11-26T16:15:38.2431255+02:00" } ``` - 400 400 Bad Request Bad Request - Invalid request format or missing required fields #### Example Response A typical response from this endpoint: ```json { "Key": [ "The key field is required." ] } ``` - 401 401 Unauthorized Unauthorized - Invalid email or API key #### Example Response A typical response from this endpoint: ```json { "message": "Invalid login credentials." } ``` - 429 429 Too Many Requests Too Many Requests - Rate limit exceeded (blocked for 5 minutes) #### Example Response A typical response from this endpoint: ```json { "error": "Rate limit exceeded", } ``` ### Examples [Section titled “Examples”](#examples) * HTTP ```http POST https://id.copyleaks.com/v3/account/login/api Headers Content-Type: application/json Body { "email": "your@email.address", "key": "00000000-0000-0000-0000-000000000000" } ``` * cURL ```bash export COPYLEAKS_EMAIL="your@email.address" export COPYLEAKS_API_KEY="your-api-key-here" curl --request POST \ --url https://id.copyleaks.com/v3/account/login/api \ --header 'Accept: application/json' \ --header 'Content-Type: application/json' \ --data "{ \"email\": \"${COPYLEAKS_EMAIL}\", \"key\": \"${COPYLEAKS_API_KEY}\" }" ``` * Python ```python from copyleaks.copyleaks import Copyleaks EMAIL_ADDRESS = "your@email.address" API_KEY = "your-api-key-here" # Login to Copyleaks auth_token = Copyleaks.login(EMAIL_ADDRESS, API_KEY) print("Logged successfully!\nToken:", auth_token) ``` * JavaScript ```javascript const { Copyleaks } = require('plagiarism-checker'); const EMAIL_ADDRESS = "your@email.address"; const API_KEY = "your-api-key-here"; async function login() { const copyleaks = new Copyleaks(); const loginResult = await copyleaks.loginAsync(EMAIL_ADDRESS, API_KEY); console.log('Logged successfully!\nToken:', loginResult); return loginResult; } ``` * Java ```java import com.copyleaks.sdk.api.Copyleaks; String EMAIL_ADDRESS = "your@email.address"; String API_KEY = "00000000-0000-0000-0000-000000000000"; // Login to Copyleaks try { String authToken = Copyleaks.login(EMAIL_ADDRESS, API_KEY); System.out.println("Logged successfully!\nToken: " + authToken); } catch (CommandException e) { System.out.println("Failed to login: " + e.getMessage()); System.exit(1); } ``` --- # Account Actions > Learn how to authenticate and get started with Copyleaks APIs Account Copyleaks APIs allow you to integrate your systems and services with Copyleaks products. You can start working with a Copyleaks API by sending HTTP requests to the Copyleaks servers. To do so, first register to Copyleaks and get your API key. Then, use your API key and email address to login. ## Authentication API [Section titled “Authentication API”](#authentication-api) [POST https://id.copyleaks.com/v3/account/login/api](/reference/actions/account/login) [Get your personal access token by logging in to the Copyleaks API using your email and API key.](/reference/actions/account/login) --- # Export > Export the full raw scan information and push it to your servers. Authenticity POST **https\://api.copyleaks.com/v3/downloads/{scanId}/export/{exportId}** One of the most common patterns when integrating with our services is to submit a scan and download the full results as soon as the scan is completed. When the scan is completed, Copyleaks triggers a ‘Completed’ webhook to inform that the scan has been completed. At this point, you will have all the needed information (i.e. the ‘result ids’) to download and present the reports on your side. Since you may have a large number of documents to download (the results, crawled version of the text and the pdf-report), you may need to send many HTTP REST calls to execute to export the data from our services. The ‘Export’ method makes this process easier by specifying the content you would like to export in a single call, and we will copy all the data according to your request. Then, we will fire an ‘export-completed’ webhook with the export results summary. If you are using a distributed cloud storage system (like AWS buckets, Google buckets or Azure Storage), we can export the data directly to your storage without the involvement of your servers. To do so, create a Signed URL for each data item that you would like to export. By specifying the request method (verb) and optionally added headers, the writing to this storage will be triggered, as per your definition. Authentication Required You need to login with a user and api key in order to access this method. Add this HTTP header to your request: **Authorization: Bearer < Your-Login-Token >** Need Help? Not sure how to generate your login token? Read **[here](/reference/actions/account/login/)**. ## Request [Section titled “Request”](#request) ### Path Parameters [Section titled “Path Parameters”](#path-parameters) exportId string required A new Id for the export process. `>= 3 characters` `<= 36 characters` Match pattern: `[a-z0-9] !@$^&-+%=_(){}<>';:/.",~`|\` scanId string required The scan ID of the specific scan to export. learn more about [the criteria for creating a Scan ID](/concepts/management/choosing-scan-id). `>= 3 characters` `<= 36 characters` Match pattern: `[a-z0-9] !@$^&-+%=_(){}<>';:/.",~`|\` ### Headers [Section titled “Headers”](#headers) ```http Content-Type: application/json Authorization: Bearer YOUR_LOGIN_TOKEN ``` ### Request Body [Section titled “Request Body”](#request-body) The request body is a JSON object containing the export configuration. completionWebhook string required This webhook event is triggered once the export is completed. completionWebhookHeaders array\[array] Adds headers to the webhook. Example: `[ [ "header-key", "header-value" ], ... ]` maxRetries integer default: "3" How many retries to send before giving up. Using high value (12) may lead to a longer time until the completionWebhook being executed. A low value (1) may lead to errors while your service is temporary having problems. `>= 1` `<= 12` developerPayload string Add a custom developer payload that will then be provided on the [Export-Completed webhook](/reference/data-types/authenticity/webhooks/scan-completed). results array\[object] An array of results to be exported. Learn more about [Result Webhook](/reference/data-types/authenticity/results/new-plagiarism-result). `<= 1000 items` id string required Result identification to be downloaded. You get these identifications from the [completed webhook](/reference/data-types/authenticity/webhooks/scan-completed). endpoint string\ required The HTTP url to upload the data. verb string required The HTTP verb (also called “HTTP Methods”) to upload the data to your specified endpoint. Example: `POST` headers array\[array] (Jagged Array) List of headers to be submitted with the upload request. You may use this field to provide additional request headers, such as “Authorization” header. pdfReport object Download the PDF report. Allowed only when `properties.pdf.create` was set to `true` on the scan submission. endpoint string\ required The HTTP url to upload the data. verb string required The HTTP verb (also called “HTTP Methods”) to upload the data to your specified endpoint. Example: `POST` headers array\[array] (Jagged Array) List of headers to be submitted with the upload request. You may use this field to provide additional request headers, such as “Authorization” header. aiDetection object Export the AI Content detection report. Allowed only when `properties.aiGeneratedText.detect` was set to `true` on the scan submission. Learn more about [AI Content Detection Webhook](/reference/data-types/authenticity/results/ai-detection). endpoint string\ required The HTTP url to upload the data. verb string required The HTTP verb (also called “HTTP Methods”) to upload the data to your specified endpoint. Example: `POST` headers array\[array] (Jagged Array) List of headers to be submitted with the upload request. You may use this field to provide additional request headers, such as “Authorization” header. writingFeedback object Export the Writing Assistant report. Allowed only when `properties.writingFeedback.enable` was set to `true` on the scan submission. Learn more about [Writing Assistant Webhook](/reference/data-types/authenticity/results/writing-assistant). endpoint string\ required The HTTP url to upload the data. verb string required The HTTP verb (also called “HTTP Methods”) to upload the data to your specified endpoint. Example: `POST` headers array\[array] (Jagged Array) List of headers to be submitted with the upload request. You may use this field to provide additional request headers, such as “Authorization” header. overview object Export the Overview report. Allowed only when `properties.overview.enable` was set to `true` on the scan submission. Learn more about [Overview Webhook](/reference/data-types/authenticity/results/ai-overview). endpoint string\ required The HTTP url to upload the data. verb string required The HTTP verb (also called “HTTP Methods”) to upload the data to your specified endpoint. Example: `POST` headers array\[array] (Jagged Array) List of headers to be submitted with the upload request. You may use this field to provide additional request headers, such as “Authorization” header. crawledVersion object Download the crawled version of the submitted text. Learn more about [Crawled-Version Webhook](/reference/data-types/authenticity/results/crawled-version). endpoint string\ required The HTTP url to upload the data. verb string required The HTTP verb (also called “HTTP Methods”) to upload the data to your specified endpoint. Example: `POST` headers array\[array] (Jagged Array) List of headers to be submitted with the upload request. You may use this field to provide additional request headers, such as “Authorization” header. ## Responses [Section titled “Responses”](#responses) * 204 204 No Content The command was executed. The export started. - 400 400 Bad Request Bad request. One or more details in your request is wrong. #### Example Response A typical response from this endpoint: ```json Scan wasn't finished yet. ``` - 401 401 Unauthorized Authorization has been denied for this request. #### Example Response A typical response from this endpoint: ```json { "type": "https://tools.ietf.org/html/rfc9110#section-15.5.2", "title": "Unauthorized", "status": 401, "traceId": "00-ef0db7690ced98431ac97782051edc77-2c4194d74ae6c08b-00" } ``` - 404 404 Not Found The scan id that was specified doesn't exist. - 409 409 Conflict Conflict. An export task with the same Id already exists in the system. #### Example Response A typical response from this endpoint: ```json { "type": "https://tools.ietf.org/html/rfc9110#section-15.5.10", "title": "Conflict", "status": 409, "traceId": "00-561fe3b2451eb51ce489557a2f34f247-3acd41c0b4ad1295-00" } ``` ## Examples [Section titled “Examples”](#examples) * HTTP ```http POST https://api.copyleaks.com/v3/downloads/my-scan-123/export/my-export-1 Content-Type: application/json Authorization: Bearer YOUR_LOGIN_TOKEN { "results": [ { "id": "my-result-id", "verb": "POST", "headers": [ [ "header-key", "header-value" ] ], "endpoint": "https://yourserver.com/export/export-id/results/my-result-id" } ], "pdfReport": { "verb": "POST", "headers": [ [ "header-key", "header-value" ] ], "endpoint": "https://yourserver.com/export/export-id/pdf-report" } ], "crawledVersion": { "verb": "POST", "headers": [ [ "header-key", "header-value" ] ], "endpoint": "https://yourserver.com/export/export-id/crawled-version" }, "completionWebhook": "https://yourserver.com/export/export-id/completed", "maxRetries": 3 } ``` * cURL ```bash curl --request POST \ --url https://api.copyleaks.com/v3/downloads/my-scan-123/export/my-export-1 \ --header 'Authorization: Bearer YOUR_LOGIN_TOKEN' \ --header 'Content-Type: application/json' \ --data '{ "results": [ { "id": "my-result-id", "verb": "POST", "headers": [ [ "header-key", "header-value" ] ], "endpoint": "https://yourserver.com/export/export-id/results/my-result-id" } ], "pdfReport": { "verb": "POST", "headers": [ [ "header-key", "header-value" ] ], "endpoint": "https://yourserver.com/export/export-id/pdf-report" }, "crawledVersion": { "verb": "POST", "headers": [ [ "header-key", "header-value" ] ], "endpoint": "https://yourserver.com/export/export-id/crawled-version" }, "completionWebhook": "https://yourserver.com/export/export-id/completed", "maxRetries": 3 }' ``` ## Next Steps [Section titled “Next Steps”](#next-steps) [Webhooks Overview ](/reference/data-types/authenticity/webhooks/overview/)Learn about the different types of webhooks and how to handle them, including export completion webhooks. [Export Completed Webhook ](/reference/data-types/authenticity/webhooks/scan-completed/)Understand the details provided in the export completed webhook. [How to Display Scan Reports ](/concepts/features/how-to-display/)Learn how to present exported scan data to your users. --- # Downloads Actions > Download your scan reports. The Copyleaks downloads API allows you to download your scan reports. ## Endpoints [Section titled “Endpoints”](#endpoints) [POST v3/downloads/{scanId}/export/{exportId}](/reference/actions/downloads/export) [Export your scan report by providing a scan ID.](/reference/actions/downloads/export) --- # OCR Supported Languages > Languages supported for OCR processing Authenticity GET **https\://api.copyleaks.com/v3/miscellaneous/ocr-languages-list** Get a list of the supported languages for OCR Note This is not a list of supported languages for the API, but only for the OCR files scan ## Request [Section titled “Request”](#request) * HTTP ```http GET https://api.copyleaks.com/v3/miscellaneous/ocr-languages-list ``` * cURL ```bash curl --request GET \ --url https://api.copyleaks.com/v3/miscellaneous/ocr-languages-list ``` ## Response [Section titled “Response”](#response) * 200 200 OK The supported language codes in ISO-639-1 standard. #### Example Response A typical response from this endpoint: ```json [ "af", "sq", "az", "...", "zu" ] ``` *** ## OCR Supported Langauges [Section titled “OCR Supported Langauges”](#ocr-supported-langauges) These are the language codes supported by our OCR scan in `ISO-639-1` standard: Tip We keep updating the list with new languages so we recommend [loading the list in runtime](/reference/actions/miscellaneous/ocr-supported-languages) rather than copying it to your code. **Name** | Code | Language | Code | Language | | ----- | --------------------- | ----- | -------------------- | | af | Afrikaans | am | Amharic | | ar | Arabic | az | Azerbaijani | | be | Belarusian | bg | Bulgarian | | bn | Bengali | bs | Bosnian | | ca | Catalan | ceb | Cebuano | | co | Corsican | cs | Czech | | cy | Welsh | da | Danish | | de | German | el | Greek | | en | English | eo | Esperanto | | es | Spanish | et | Estonian | | eu | Basque | fa | Persian | | fi | Finnish | fr | French | | fy | Frisian | ga | Irish | | gd | Scottish Gaelic | gl | Galician | | gu | Gujarati | ha | Hausa | | haw | Hawaiian | hi | Hindi | | hmn | Hmong | hr | Croatian | | ht | Haitian Creole | hu | Hungarian | | hy | Armenian | id | Indonesian | | ig | Igbo | is | Icelandic | | it | Italian | iw | Hebrew | | ja | Japanese | jw | Javanese | | ka | Georgian | kk | Kazakh | | km | Khmer | kn | Kannada | | ko | Korean | ku | Kurdish | | ky | Kyrgyz | la | Latin | | lb | Luxembourgish | lo | Lao | | lt | Lithuanian | lv | Latvian | | ma | Marathi | mg | Malagasy | | mi | Maori | mk | Macedonian | | ml | Malayalam | mn | Mongolian | | mr | Marathi | ms | Malay | | mt | Maltese | my | Burmese | | ne | Nepali | nl | Dutch | | no | Norwegian | ny | Chichewa | | pl | Polish | ps | Pashto | | pt | Portuguese | ro | Romanian | | ru | Russian | sd | Sindhi | | si | Sinhala | sk | Slovak | | sl | Slovenian | sm | Samoan | | sn | Shona | so | Somali | | sq | Albanian | sr | Serbian | | st | Sesotho | su | Sundanese | | sv | Swedish | sw | Swahili | | ta | Tamil | te | Telugu | | tg | Tajik | th | Thai | | tl | Tagalog | tr | Turkish | | uk | Ukrainian | ur | Urdu | | uz | Uzbek | vi | Vietnamese | | xh | Xhosa | yi | Yiddish | | yo | Yoruba | zh-CN | Chinese (Simplified) | | zh-TW | Chinese (Traditional) | zu | Zulu | --- # Miscellaneous Actions > Get information about supported file types, languages, and more. The Copyleaks miscellaneous API allows you to get information about supported file types, languages, and more. ## Endpoints [Section titled “Endpoints”](#endpoints) [GET v3/miscellaneous/supported-file-types](/reference/actions/miscellaneous/supported-file-types) [Get a list of supported file types.](/reference/actions/miscellaneous/supported-file-types) [GET v3/miscellaneous/ocr-supported-languages](/reference/actions/miscellaneous/ocr-supported-languages) [Get a list of supported languages for OCR.](/reference/actions/miscellaneous/ocr-supported-languages) [GET v3/miscellaneous/supported-cross-languages](/reference/actions/miscellaneous/supported-cross-languages) [Get a list of supported languages for cross-language scans.](/reference/actions/miscellaneous/supported-cross-languages) --- # Cross-Language Plagiarism > Cross-Language Plagiarism detection capabilities Authenticity GET **https\://api.copyleaks.com/v3/miscellaneous/allowed-cross-languages** Cross-language plagiarism detection identifies content that has been translated from one language to another, helping catch plagiarism attempts where text is copied and translated to avoid detection. This document provides information about the languages supported by Copyleaks for cross-language scans. The language codes are provided in the `ISO-639-1` standard. ## Request [Section titled “Request”](#request) * HTTP ```http GET https://api.copyleaks.com/v3/miscellaneous/allowed-cross-languages ``` * cURL ```bash curl --request GET \ --url https://api.copyleaks.com/v3/miscellaneous/allowed-cross-languages ``` ## Response [Section titled “Response”](#response) * 200 200 OK The supported language codes in \`ISO-639-1\` standard. #### Response Schema The response contains the following fields: documentLanguages array\ A list of supported source languages for cross-language scans. resultLanguages array\ A list of supported result languages for cross-language scans. #### Example Response A typical response from this endpoint: Show full example (20 lines) ```json { "documentLanguages": [ "da", "nl", "en", "...", "es" ], "resultLanguages": [ "sq", "bg", "my", "ca", "hr", "cs", "da", "...", "vi" ] } ``` ```json { "documentLanguages": [ "da", "nl", "en", "...", "es" ], "resultLanguages": [ "sq", "bg", "my", "ca", "hr", "cs", // ... truncated ``` ## Supported Languages for Cross-Language Scans [Section titled “Supported Languages for Cross-Language Scans”](#supported-languages-for-cross-language-scans) The following sections list the supported source and result languages for cross-language scans. These language codes are provided in the `ISO-639-1` standard. Tip We keep updating the list with new languages so we recommend [loading the list in runtime](/reference/actions/miscellaneous/supported-cross-languages) rather than copying it to your code. ### Allowed Source Languages [Section titled “Allowed Source Languages”](#allowed-source-languages) The following languages can be used as the source language in a cross-language scan: | Code | Language | Code | Language | | ---- | ---------- | ---- | -------- | | da | Danish | fr | French | | nl | Dutch | de | German | | en | English | it | Italian | | pt | Portuguese | ru | Russian | | es | Spanish | | | ### Allowed Result Languages [Section titled “Allowed Result Languages”](#allowed-result-languages) The following languages can be used as the result language in a cross-language scan: | Code | Language | Code | Language | | ---- | ---------- | ---- | ---------- | | sq | Albanian | gl | Galician | | bg | Bulgarian | ka | Georgian | | ca | Catalan | de | German | | hr | Croatian | el | Greek | | cs | Czech | hi | Hindi | | da | Danish | hu | Hungarian | | nl | Dutch | id | Indonesian | | en | English | it | Italian | | fi | Finnish | lv | Latvian | | fr | French | lt | Lithuanian | | mk | Macedonian | my | Burmese | | fa | Persian | pl | Polish | | pt | Portuguese | ro | Romanian | | ru | Russian | sr | Serbian | | sk | Slovak | sl | Slovenian | | es | Spanish | sv | Swedish | | tr | Turkish | uk | Ukrainian | | ur | Urdu | vi | Vietnamese | --- # Supported File Types > File formats accepted by the API Authenticity GET **https\://api.copyleaks.com/v3/miscellaneous/supported-file-types** Get a list of the supported file types. ## Request [Section titled “Request”](#request) * HTTP ```http GET https://api.copyleaks.com/v3/miscellaneous/supported-file-types ``` * cURL ```bash curl --request GET \ --url https://api.copyleaks.com/v3/miscellaneous/supported-file-types ``` ## Response [Section titled “Response”](#response) * 200 200 OK The command was executed. #### Response Schema The response contains the following fields: textual array\ A list of supported file extensions for textual content scans. ocr array\ A list of supported file extensions for OCR (image-based) scans. #### Example Response A typical response from this endpoint: Show full example (29 lines) ```json { "textual": [ "pdf", "docx", "doc", "txt", "rtf", "xml", "pptx", "ppt", "odt", "chm", "epub", "odp", "ppsx", "pages", "xlsx", "xls", "csv", "LaTeX" ], "ocr": [ "gif", "png", "bmp", "jpg", "jpeg" ] } ``` ```json { "textual": [ "pdf", "docx", "doc", "txt", "rtf", "xml", "pptx", "ppt", "odt", "chm", "epub", "odp", "ppsx", // ... truncated ``` --- # Actions Overview > Explore the Copyleaks API actions for managing scans, detecting AI-generated content, moderating text, and more. Actions Explore the Copyleaks API actions to manage your content integrity and authenticity needs. Each action provides specific functionalities to help you integrate our services effectively. [Account ](/reference/actions/account/overview)Manage your account and login. [Authenticity ](/reference/actions/scans/overview)Submit scans, check status, and manage results. [AI Detector ](/reference/actions/writer-detector/overview)Detect AI-generated text and source code. [Writing ](/reference/actions/writing-assistant/overview)Check grammar and get writing feedback. [Moderation ](/reference/actions/text-moderation/overview)Moderate text for harmful content. --- # Get Repository Information > Get repository information such as credit consumption, metadata values and current status. Authenticity GET **https\://api.copyleaks.com/v3/repositories/repository/{repositoryId}/info** Get repository information such as credit consumption, metadata values and current status. A “Super Admin” or “Admin” role is required. Authentication Required You need to login with a user and api key in order to access this method. Add this HTTP header to your request: **Authorization: Bearer < Your-Login-Token >** Need Help? Not sure how to generate your login token? Read **[here](/reference/actions/account/login/)**. ## Request [Section titled “Request”](#request) ### Path Parameters [Section titled “Path Parameters”](#path-parameters) repositoryId string required The repository ID to get the info for. The repository ID can be fetched from the [Copyleaks Admin Dashboard](https://admin.copyleaks.com/repositories). ### Headers [Section titled “Headers”](#headers) ```http Authorization: Bearer YOUR_LOGIN_TOKEN ``` ## Responses [Section titled “Responses”](#responses) * 200 200 OK The command was executed. #### Example Response A typical response from this endpoint: ```json { "id": "private-data-hub-id", "name": "Private Data Hub Name", "description": "Your Description", "permission": 4, "status": 0, "maxCredits": 1000, "currentCredits": 1000, "maskingPolicy": 0, "creationTime": "2024-09-09T10:43:52" } ``` - 400 400 Bad Request Bad Request. #### Example Response A typical response from this endpoint: ```json { "repositoryId": [ "The field repositoryId must match the regular expression '^[_A-Za-z0-9]*$'." ] } ``` - 401 401 Unauthorized Unauthorized - Authorization has been denied for this request. #### Example Response A typical response from this endpoint: ```json { "type": "https://tools.ietf.org/html/rfc9110#section-15.5.2", "title": "Unauthorized", "status": 401, "traceId": "00-ef0db7690ced98431ac97782051edc77-2c4194d74ae6c08b-00" } ``` - 403 403 Forbidden Forbidden. Your organization role does not permit you to perform this request. This operation requires "Super Admin" or "Admin" role. ## Examples [Section titled “Examples”](#examples) * HTTP ```http GET https://api.copyleaks.com/v3/repositories/repository/my-repo-123/info Authorization: Bearer YOUR_LOGIN_TOKEN ``` * cURL ```bash curl --request GET \ --url https://api.copyleaks.com/v3/repositories/repository/my-repo-123/info \ --header 'Authorization: Bearer YOUR_LOGIN_TOKEN' ``` --- # Private Cloud Hub Actions > Manage your Private Cloud Hubs. Copyleaks Private Cloud Hub API help you maintain your data hubs. ## Endpoints [Section titled “Endpoints”](#endpoints) [GET v3/repositories/repository/{repositoryId}/info](/reference/actions/private-cloud-hub/info) [Get information about your Private Cloud Hubs.](/reference/actions/private-cloud-hub/info) --- # Get Credit Balance > Get your current credit balance. Authenticity GET **https\://api.copyleaks.com/v3/scans/credits** Get your current credit balance. Each credit allow the scan of up to 250 words. Authentication Required You need to login with a user and api key in order to access this method. Add this HTTP header to your request: **Authorization: Bearer < Your-Login-Token >** Need Help? Not sure how to generate your login token? Read **[here](/reference/actions/account/login/)**. ## Request [Section titled “Request”](#request) ### Headers [Section titled “Headers”](#headers) ```http Authorization: Bearer YOUR_LOGIN_TOKEN ``` ## Responses [Section titled “Responses”](#responses) * 200 200 OK The command was executed. #### Response Schema The response contains the following fields: Amount integer The number of credits available. #### Example Response A typical response from this endpoint: ```json { "Amount": 100 } ``` - 401 401 Unauthorized Authorization has been denied for this request. #### Example Response A typical response from this endpoint: ```json { "type": "https://tools.ietf.org/html/rfc9110#section-15.5.2", "title": "Unauthorized", "status": 401, "traceId": "00-ef0db7690ced98431ac97782051edc77-2c4194d74ae6c08b-00" } ``` ## Examples [Section titled “Examples”](#examples) * HTTP ```http GET https://api.copyleaks.com/v3/scans/credits Authorization: Bearer YOUR_LOGIN_TOKEN ``` * cURL ```bash curl --request GET \ --url https://api.copyleaks.com/v3/scans/credits \ --header 'Authorization: Bearer YOUR_LOGIN_TOKEN' ``` --- # Delete Scans > Delete scans from Copyleaks API. Authenticity PATCH **https\://api.copyleaks.com/v3.1/scans/delete** Delete scans from Copyleaks API. Only completed scans can be deleted. All of the scan results, metadata and information will be removed. The delete is performed in the background, the deletion process can take few minutes. Authentication Required You need to login with a user and api key in order to access this method. Add this HTTP header to your request: **Authorization: Bearer < Your-Login-Token >** Need Help? Not sure how to generate your login token? Read **[here](/reference/actions/account/login/)**. ## Request [Section titled “Request”](#request) ### Headers [Section titled “Headers”](#headers) ```http Content-Type: application/json Authorization: Bearer YOUR_LOGIN_TOKEN ``` ### Request Body [Section titled “Request Body”](#request-body) The request body is a JSON object containing the scans to delete. scans array\

Additional Analysis

Example Domain

Hello world!