Written language identification
TextCat is an implementation of the text categorization algorithm presented in Cavnar, W. B. and J. M. Trenkle, "N-Gram-Based Text Categorization". TextCat uses this the technique to implement a written language identification. At the moment, it knows about 69 natural languages (counting Esperanto as a natural language).
Release | Stable | Testing |
---|---|---|
Fedora Rawhide | 1.10-14.fc35 | - |
Fedora 35 | 1.10-14.fc35 | - |
Fedora 34 | 1.10-13.fc34 | - |
EPEL 7 | 1.10-1.el7 | - |
You can contact the maintainers of this package via email at
textcat dash maintainers at fedoraproject dot org
.