GET
/
v3
/
miscellaneous
/
ocr-languages-list
curl --request GET \
  --url https://api.copyleaks.com/v3/miscellaneous/ocr-languages-list
["af", "sq", "az", "...", "zu"]
curl --request GET \
  --url https://api.copyleaks.com/v3/miscellaneous/ocr-languages-list
["af", "sq", "az", "...", "zu"]
Get a list of the supported languages for OCR
This is not a list of supported languages for the API, but only for the OCR files scan

Response

200 OK - The supported language codes in ISO-639-1 standard.
["af", "sq", "az", "...", "zu"]

OCR Supported Languages

These are the language codes supported by our OCR scan in ISO-639-1 standard:
We keep updating the list with new languages so we recommend loading the list in runtime rather than copying it to your code.
CodeLanguageCodeLanguage
afAfrikaansamAmharic
arArabicazAzerbaijani
beBelarusianbgBulgarian
bnBengalibsBosnian
caCatalancebCebuano
coCorsicancsCzech
cyWelshdaDanish
deGermanelGreek
enEnglisheoEsperanto
esSpanishetEstonian
euBasquefaPersian
fiFinnishfrFrench
fyFrisiangaIrish
gdScottish GaelicglGalician
guGujaratihaHausa
hawHawaiianhiHindi
hmnHmonghrCroatian
htHaitian CreolehuHungarian
hyArmenianidIndonesian
igIgboisIcelandic
itItalianiwHebrew
jaJapanesejwJavanese
kaGeorgiankkKazakh
kmKhmerknKannada
koKoreankuKurdish
kyKyrgyzlaLatin
lbLuxembourgishloLao
ltLithuanianlvLatvian
maMarathimgMalagasy
miMaorimkMacedonian
mlMalayalammnMongolian
mrMarathimsMalay
mtMaltesemyBurmese
neNepalinlDutch
noNorwegiannyChichewa
plPolishpsPashto
ptPortugueseroRomanian
ruRussiansdSindhi
siSinhalaskSlovak
slSloveniansmSamoan
snShonasoSomali
sqAlbaniansrSerbian
stSesothosuSundanese
svSwedishswSwahili
taTamilteTelugu
tgTajikthThai
tlTagalogtrTurkish
ukUkrainianurUrdu
uzUzbekviVietnamese
xhXhosayiYiddish
yoYorubazh-CNChinese (Simplified)
zh-TWChinese (Traditional)zuZulu

Frequently asked questions

What are OCR supported languages used for?

They apply only to OCR scans, where Copyleaks extracts text from images and scanned documents. This is not the general language list for plagiarism or AI detection.

How do I get the current list of OCR languages?

Call GET https://api.copyleaks.com/v3/miscellaneous/ocr-languages-list. It is a public endpoint that needs no authentication. Copyleaks keeps adding languages, so load the list at runtime instead of hardcoding it.

What language code format does OCR use?

ISO-639-1 codes (for example en, fr, ar), with zh-CN for Simplified Chinese and zh-TW for Traditional Chinese.

Does OCR support non-Latin scripts like Arabic, Chinese, and Hindi?

Yes. The OCR engine supports 100+ languages, including Arabic (ar), Chinese (zh-CN, zh-TW), Hindi (hi), Japanese (ja), Korean (ko), and many more.