Name: | textcat |
---|---|
Version: | 1.10 |
Release: | 1.el7 |
Architecture: | noarch |
Group: | Unspecified |
Size: | 359561 |
License: | LGPLv2+ |
RPM: | textcat-1.10-1.el7.noarch.rpm |
Source RPM: | textcat-1.10-1.el7.src.rpm |
Build Date: | Mon Nov 06 2017 |
Build Host: | x86-ol7-builder-01.us.oracle.com |
Vendor: | Oracle America |
URL: | http://www.let.rug.nl/~vannoord/TextCat/ |
Summary: | Written language identification |
Description: | TextCat is an implementation of the text categorization algorithm presented in Cavnar, W. B. and J. M. Trenkle, "N-Gram-Based Text Categorization". TextCat uses this the technique to implement a written language identification. At the moment, it knows about 69 natural languages (counting Esperanto as a natural language). |