Core Requirements
While you have the flexibility to choose a Scan ID that aligns with your internal system, there are a few limitations to keep in mind:- Character Length: Must be between 3 and 36 characters.
- Allowed Characters: The Scan ID can include lower case characters
a-z, digits0-9and special symbols!@$&-=_()';:., ~. We recommend using lower case letters, digits and dashes for simplicity.
Strategies for Naming Your Scan ID
The best approach is to create a structured Scan ID that embeds useful information. This allows you to easily identify a scan based on its scan ID and use advanced features like include or exclude specific groups of documents from a scan.Recommended Structure
A highly effective structure is:<ORGANIZATION_ID>-<AUTHOR_ID>-<DOCUMENT_ID>
Examples
tech-corp-employee456-q3-reportacme-university-student123-final-thesis
In plagiarism scans, this structure enables powerful filtering capabilities:
- Exclude by author: Use a pattern like
*-student123-*to prevent a student’s new submission from being checked against their previous work. - Include by organization: Use
acme-university-*to compare a document only against others from the same institution. - Focus on document types: Use
*-final-thesisto analyze all final theses submitted.
Handling ID Mismatches
If your internal system uses IDs that don’t meet Copyleaks’ requirements (e.g., they are too long or contain uppercase letters), the recommended solution is to generate a compliant Scan ID and maintain a mapping table on your end. This table will link your internal entity ID to the corresponding Copyleaks Scan ID, ensuring seamless integration.| Your Internal ID | Copyleaks Scan ID |
|---|---|
USER-9876-DOC-A | user9876-doca |
Submission_ABC_123 | submission-abc-123 |
Next Steps
Prevent Self-Plagiarism
Learn more about how to exclude previous submissions from the same student to prevent self-plagiarism.
Submit a File
See how to implement your Scan ID strategy when submitting a file for scanning.

