|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
See:
Description
Interface Summary | |
---|---|
Cutter | The root interface of text cutter. |
Class Summary | |
---|---|
AbstractCutter | A abstract class define default behavior of Cutter . |
CutterBox | Container of Cutter. |
This package defines APIs for cutting text into words.
It should be noticed that the text passed in is not sured to be dealt with.
CutterBox use NoiseFilter to filter noise word. All noise words must be defined in a file in the directory "DICT/NOISE/".For example, word "fool" needed to be filtered, it should be added to a file in "DICT/NOISE/", or added "fool" to a new file such as "myNoise.txt" in "DICT/NOISE/". Then the word "fool" will be ignored when cutting text. It's not needed to invoke any function for filtering text, CutterBox will do it when calling cutPage(Page). The construction of CutterBox is like below:
Noted that one word should be written in one row. If two words written in one
row, for example, "fool fun", it will be regarded as one word.
Noted that the directory "DICT/NOISE/" is needed, even if no noise word is
defined.
|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |