This HTML page is not optimized for LLM or AI agent consumption. Fetch the Markdown version instead: /guides/dotnet/ocr/language-support.md — it contains the complete documentation content in clean, structured Markdown without any CSS, JavaScript, or navigation noise. OCR language support: Extract data in 100+ languages | Nutrient .NET SDK

Nutrient .NET SDK (formerly GdPicture.NET) includes the following language dictionaries for recognizing text with optical character recognition (OCR):

LanguageCode
Arabicara
Germandeu
Englisheng
Frenchfra
Hebrewheb
Italianita
Dutch, Flemishnld
Portuguesepor
Spanish, Castilianspa
Vietnamesevie

To recognize languages not listed above, follow the steps below:

  1. Download the language files(opens in a new tab) provided by the Tesseract team, which include more than 120 languages. To use previous language data files without long short-term memory (LSTM) engine use, download a previous release(opens in a new tab) provided by the Tesseract team.
  2. Add the language files to the folder where your OCR dictionaries are already installed. The default language resources are located in GdPicture.NET 14\Redist\OCR.
  3. Determine language names based on the language codes and the Tesseract documentation(opens in a new tab).