ucto binary package in Ubuntu Trusty i386
Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
punctuation, split sentences, generate n-grams), and offers several other
basic preprocessing steps (change case, count words/characters and reverse
lines) that make your text suited for further processing such as indexing,
part-of-speech tagging, or machine translation.
.
Ucto is a product of the ILK Research Group, Tilburg University (The
Netherlands).
.
If you are interested in machine parsing of UTF-8 encoded text files, e.g. to
do scientific research in natural language processing, ucto will likely be of
use to you.
Publishing history
Date | Status | Target | Component | Section | Priority | Phased updates | Version | ||
---|---|---|---|---|---|---|---|---|---|
2014-01-17 09:13:21 UTC | Published | Ubuntu Trusty i386 | release | universe | science | Extra | 0.5.3-3.1ubuntu1 | ||
|
|||||||||
Deleted | Ubuntu Trusty i386 | proposed | universe | science | Extra | 0.5.3-3.1ubuntu1 | |||
|
|||||||||
2014-01-17 09:14:10 UTC | Superseded | Ubuntu Trusty i386 | release | universe | science | Extra | 0.5.3-3.1build1 | ||
|
|||||||||
2014-01-18 12:10:11 UTC | Deleted | Ubuntu Trusty i386 | proposed | universe | science | Extra | 0.5.3-3.1build1 | ||
|
|||||||||
2013-12-31 02:35:52 UTC | Superseded | Ubuntu Trusty i386 | release | universe | science | Extra | 0.5.3-3.1 | ||
|