|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.jbox.textCutter.CutterBox
public class CutterBox
Container of Cutter.
Cutter
,
NoiseFilter
,
LanguageFilter
Constructor Summary | |
---|---|
CutterBox()
Constructs a new CutterBox. |
Method Summary | |
---|---|
void |
addCutter(Cutter c)
Add a Cutter into CutterBox. |
static java.lang.String[] |
cutArticleToSentence(java.lang.String article)
Static method for splitting text into Sentences by "." or "ĄŁ" |
void |
cutPage(Page p)
Cut text of Page object into words, calculate the TF of
Word , and stored the words in the Page object. |
java.util.Collection<java.lang.String> |
cutText(java.lang.String text)
Cut text into words. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public CutterBox()
Method Detail |
---|
public void addCutter(Cutter c)
Cutter
into CutterBox.
c
- a Cutter
object.public void cutPage(Page p)
Page
object into words, calculate the TF of
Word
, and stored the words in the Page
object. All words
in the text defined in noise file will not be stored.
p
- Page
object contain the text to be cut.public java.util.Collection<java.lang.String> cutText(java.lang.String text)
text
- text to be cut.
public static java.lang.String[] cutArticleToSentence(java.lang.String article)
article
- text to be cut.
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |