org.jbox.textCutter.EN
Class SimpleENCutter

java.lang.Object
  extended by org.jbox.textCutter.AbstractCutter
      extended by org.jbox.textCutter.EN.SimpleENCutter
All Implemented Interfaces:
Cutter

public class SimpleENCutter
extends AbstractCutter

A concrete class of Cutter for English.

SimpleENCutter is used to cut English text into words. It use a arithmetic similar with Apache open source project "Lucene" to change a English word to it's prototype.

Version:
1.0
Author:
YiBin.H
See Also:
CutterBox, LanguageFilter

Field Summary
 
Fields inherited from class org.jbox.textCutter.AbstractCutter
langFilter
 
Constructor Summary
SimpleENCutter()
           
 
Method Summary
 java.util.Collection<java.lang.String> cutSentenceToWord(java.lang.String sentence)
          Cut text into words.
 
Methods inherited from class org.jbox.textCutter.AbstractCutter
cutSentenceToWord, setUnicode
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

SimpleENCutter

public SimpleENCutter()
Method Detail

cutSentenceToWord

public java.util.Collection<java.lang.String> cutSentenceToWord(java.lang.String sentence)
Description copied from class: AbstractCutter
Cut text into words.

Specified by:
cutSentenceToWord in class AbstractCutter
Parameters:
sentence - text contain chars belongs the unicode scope of the Cutter.
Returns:
words of text.