Supported OCR Languages

Base64 Document AI reads documents in 165 languages, offering unmatched global coverage and accuracy with more supported languages than any other OCR solution.

Language
Code
Language
Code
Afrikaans
af
Khasi
kha
Albanian
sq
K'iche'
quc
Angika (Devanagiri)
anp
Korean
ko
Arabic
ar
Korku
kfq
Asturian
ast
Koryak
kpy
Awadhi-Hindi (Devanagiri)
awa
Kosraean
kos
Azerbaijani (Latin)
az
Kumyk (Cyrillic)
kum
Bagheli
bfy
Kurdish (Arabic)
ku-arab
Basque
eu
Kurdish (Latin)
ku-latn
Belarusian (Cyrillic)
be, be-cyrl
Kurukh (Devanagiri)
kru
Belarusian (Latin)
be, be-latn
Kyrgyz (Cyrillic)
ky
Bhojpuri-Hindi (Devanagiri)
bho
Lakota
lkt
Bislama
bi
Latin
la
Bodo (Devanagiri)
brx
Lithuanian
lt
Bosnian (Latin)
bs
Lower Sorbian
dsb
Brajbha
bra
Lule Sami
smj
Breton
br
Luxembourgish
lb
Bulgarian
bg
Mahasu Pahari (Devanagiri)
bfz
Bundeli
bns
Malay (Latin)
ms
Buryat (Cyrillic)
bua
Maltese
mt
Catalan
ca
Malto (Devanagiri)
kmj
Cebuano
ceb
Manx
gv
Chamling
rab
Maori
mi
Chamorro
ch
Marathi
mr
Chhattisgarhi (Devanagiri)
hne
Mongolian (Cyrillic)
mn
Chinese Simplified
zh-Hans
Montenegrin (Cyrillic)
cnr-cyrl
Chinese Traditional
zh-Hant
Montenegrin (Latin)
cnr-latn
Cornish
kw
Neapolitan
nap
Corsican
co
Nepali
ne
Crimean Tatar (Latin)
crh
Niuean
niu
Croatian
hr
Nogay
nog
Czech
cs
Northern Sami (Latin)
sme
Danish
da
Norwegian
no
Dari
prs
Occitan
oc
Dhimal (Devanagiri)
dhi
Ossetic
os
Dogri (Devanagiri)
doi
Pashto
ps
Dutch
nl
Persian
fa
English
en
Polish
pl
Erzya (Cyrillic)
myv
Portuguese
pt
Estonian
et
Punjabi (Arabic)
pa
Faroese
fo
Ripuarian
ksh
Fijian
fj
Romanian
ro
Filipino
fil
Romansh
rm
Finnish
fi
Russian
ru
French
fr
Sadri (Devanagiri)
sck
Friulian
fur
Samoan (Latin)
sm
Gagauz (Latin)
gag
Sanskrit (Devanagari)
sa
Galician
gl
Santali(Devanagiri)
sat
German
de
Scots
sco
Gilbertese
gil
Scottish Gaelic
gd
Gondi (Devanagiri)
gon
Serbian (Latin)
sr, sr-latn
Greenlandic
kl
Sherpa (Devanagiri)
xsr
Gurung (Devanagiri)
gvr
Sirmauri (Devanagiri)
srx
Haitian Creole
ht
Skolt Sami
sms
Halbi (Devanagiri)
hlb
Slovak
sk
Hani
hni
Slovenian
sl
Haryanvi
bgc
Somali (Arabic)
so
Hawaiian
haw
Southern Sami
sma
Hebrew
heb
Spanish
es
Hindi
hi
Swahili (Latin)
sw
Hmong Daw (Latin)
mww
Swedish
sv
Ho(Devanagiri)
hoc
Tajik (Cyrillic)
tg
Hungarian
hu
Tatar (Latin)
tt
Icelandic
is
Tetum
tet
Inari Sami
smn
Thangmi
thf
Indonesian
id
Tongan
to
Interlingua
ia
Turkish
tr
Inuktitut (Latin)
iu
Turkmen (Latin)
tk
Irish
ga
Tuvan
tyv
Italian
it
Upper Sorbian
hsb
Japanese
ja
Urdu
ur
Jaunsari (Devanagiri)
Jns
Uyghur (Arabic)
ug
Javanese
jv
Uzbek (Arabic)
uz-arab
Kabuverdianu
kea
Uzbek (Cyrillic)
uz-cyrl
Kachin (Latin)
kac
Uzbek (Latin)
uz
Kangri (Devanagiri)
xnr
Volapük
vo
Karachay-Balkar
krc
Walser
wae
Kara-Kalpak (Cyrillic)
kaa-cyrl
Welsh
cy
Kara-Kalpak (Latin)
kaa
Western Frisian
fy
Kashubian
csb
Yucatec Maya
yua
Kazakh (Cyrillic)
kk-cyrl
Zhuang
za
Kazakh (Latin)
kk-latn
Zulu
zu
Khaling
klr

Handwritten Text
Language
Code
Language
Code
English
en
Japanese
ja
Chinese Simplified
zh-Hans
Korean
ko
French
fr
Portuguese
pt
German
de
Spanish
es
Italian
it