Language detection

The Detect Language operation detects the language of each input text and outputs the corresponding ISO 639-1 (2-letter) language code.

Tomas Larsson avatar
Written by Tomas Larsson
Updated over a week ago

Keyword: detect language

Use the Detect Language operation to detects the language of the text. Useful for language filtering and as a pre-step to operations that require an input language field, for example in the multi-language analysis.

Step-by-step guide

1. Open the operation configuration window

Select the text field that you want to apply the operation to in the Schema workbench and click the "Add operation" button at the top of the workspace.
​

Search for "Detect language" or find the operation under "Text enrichment" and click it.
​

2. Name the output field

Under "Output field name", type the name of the field that the languages should be inserted into.

3. Specify the level of detection

Keep the "Exhaustive" on for the most accurate results, or turn it off for faster computation. The exhaustive option utilizes multiple language detectors with different algorithms, increasing accuracy but also computation time.
​

4. Apply the operation

Click "Apply" to run the operation. A field containing the language of each text is now added.
​

Did this answer your question?