golang-gopkg-neurosnap-sentences.v1-dev binary package in Ubuntu Bionic amd64

 A golang package that converts a blob of text into a list of sentences.
 .
 This package attempts to support a multitude of languages:
 Czech, Danish, Dutch, English, Estonian, Finnish,
 French, German, Greek, Italian, Norwegian, Polish,
 Portuguese, Slovene, Spanish, Swedish, and Turkish.
 .
 An unsupervised multilingual sentence boundary detection library for golang.
 The goal of this library is to be able to break up any text into a list of
 sentences in multiple languages. The way the punkt system accomplishes this
 goal is through training the tokenizer with text in that given language.
 Once the likelihoods of abbreviations, collocations, and sentence starters
 are determined, finding sentence boundaries becomes easier.
 .
 There are many problems that arise when tokenizing text into sentences,
 the primary issue being abbreviations. The punkt system attempts to
 determine whether a word is an abbreviation, an end to a sentence, or even
 both through training the system with text in the given language.
 The punkt system incorporates both token- and type-based analysis on the
 text through two different phases of annotation.
 .
 Original research article:
 http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=BAE5C34E5C3B9DC60DFC4D93B85D8BB1?doi=10.1.1.85.5017&rep=rep1&type=pdf

Publishing history

Date Status Target Pocket Component Section Priority Phased updates Version
  2017-11-03 08:31:32 UTC Published Ubuntu Bionic amd64 release universe devel Extra 1.0.6-1
  • Published
  • Copied from ubuntu bionic-proposed amd64 in Primary Archive for Ubuntu
  Deleted Ubuntu Bionic amd64 proposed universe devel Extra 1.0.6-1
  • Removal requested .
  • Deleted by Ubuntu Archive Robot

    moved to release

  • Published