Uses of Package
org.apache.lucene.analysis.icu.segmentation
Packages that use org.apache.lucene.analysis.icu.segmentation
Package
Description
Tokenizer that breaks text into words with the Unicode Text Segmentation algorithm.
-
Classes in org.apache.lucene.analysis.icu.segmentation used by org.apache.lucene.analysis.icu.segmentationClassDescriptionWraps RuleBasedBreakIterator, making object reuse convenient and emitting a rule status for emoji sequences.Wraps a char[] as CharacterIterator for processing with a BreakIteratorAn internal BreakIterator for multilingual text, following recommendations from: UAX #29: Unicode Text Segmentation.Breaks text into words according to UAX #29: Unicode Text Segmentation (http://www.unicode.org/reports/tr29/)Class that allows for tailored Unicode Text Segmentation on a per-writing system basis.An iterator that locates ISO 15924 script boundaries in text.