Identical Matches Detection
This document provides an overview of configuring the Copyleaks API to detect only identical text matches while filtering out paraphrased and minor changed content, focusing specifically on exact duplications.
📖 Introduction
Section titled “📖 Introduction”The Identical Matches Only configuration is designed for teams that need to focus specifically on exact text duplications without the complexity of similarity detection. This approach is particularly useful for detecting direct copying and verbatim plagiarism.
By focusing exclusively on identical matches, teams can efficiently identify the most straightforward cases of content duplication while reducing false positives and ambiguous similarity detections that might require additional review time.
🔧 Configuration
Section titled “🔧 Configuration”To detect only identical matches, you must disable the settings for paraphrased content and minor changes in your API request.
- Disable Minor Changes: Set
properties.filters.minorChangesEnabled
tofalse
. - Disable Paraphrased Content: Set
properties.filters.relatedMeaningEnabled
tofalse
. - Enable Internet Scanning: Set
properties.scanning.internet
totrue
to scan against online sources.
Example JSON Configuration
Section titled “Example JSON Configuration”Here is an example of the properties
object configured for an identical-only scan against internet sources.
{ "properties": { "filters": { "minorChangesEnabled": false, "relatedMeaningEnabled": false }, "scanning": { "internet": true } }}
🚀 Next Steps
Section titled “🚀 Next Steps”After configuring your identical matches detection:
💬 Support
Section titled “💬 Support”Should you require any assistance, please contact Copyleaks Support or ask a question on StackOverflow with the copyleaks-api
tag.