Encyclopedia > L > Language identification
Language identification
If one is translating texts of unknown origin, the first order of business is to recognize the language of the text, also known as language identification which is a kind of text categorization. This can be done by comparing the compressibility of the text to the compressibility of texts in the known languages.
Information are taken from Wikipedia, the open encyclopedia, to which contribute many volunteers from around the whole world. Texts are available under the following conditions GNU Free Documentation License.
Encyklopedie (cz) Encyklopédia (sk) Enzyklopädie (de)