|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectweka.core.Stopwords
public class Stopwords
Class that can test whether a given string is a stop word. Lowercases all words before the test.
The format for reading and writing is one word per line, lines starting with '#' are interpreted as comments and therefore skipped. The default stopwords are based on Rainbow. Accepts the following parameter: -i file
Constructor Summary | |
---|---|
Stopwords()
initializes the stopwords (based on Rainbow). |
Method Summary | |
---|---|
void |
add(java.lang.String word)
adds the given word to the stopword list (is automatically converted to lower case and trimmed) |
void |
clear()
removes all stopwords |
java.util.Enumeration |
elements()
Returns a sorted enumeration over all stored stopwords |
boolean |
is(java.lang.String word)
Returns true if the given string is a stop word. |
static boolean |
isStopword(java.lang.String str)
Returns true if the given string is a stop word. |
static void |
main(java.lang.String[] args)
Accepts the following parameter: -i file loads the stopwords from the given file -o file saves the stopwords to the given file -p outputs the current stopwords on stdout Any additional parameters are interpreted as words to test as stopwords. |
void |
read(java.io.BufferedReader reader)
Generates a new Stopwords object from the reader. |
void |
read(java.io.File file)
Generates a new Stopwords object from the given file |
void |
read(java.lang.String filename)
Generates a new Stopwords object from the given file |
boolean |
remove(java.lang.String word)
removes the word from the stopword list |
java.lang.String |
toString()
returns the current stopwords in a string |
void |
write(java.io.BufferedWriter writer)
Writes the current stopwords to the given writer. |
void |
write(java.io.File file)
Writes the current stopwords to the given file |
void |
write(java.lang.String filename)
Writes the current stopwords to the given file |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Constructor Detail |
---|
public Stopwords()
Method Detail |
---|
public void clear()
public void add(java.lang.String word)
word
- the word to addpublic boolean remove(java.lang.String word)
word
- the word to remove
public boolean is(java.lang.String word)
word
- the word to test
public java.util.Enumeration elements()
public void read(java.lang.String filename) throws java.lang.Exception
filename
- the file to read the stopwords from
java.lang.Exception
- if reading failspublic void read(java.io.File file) throws java.lang.Exception
file
- the file to read the stopwords from
java.lang.Exception
- if reading failspublic void read(java.io.BufferedReader reader) throws java.lang.Exception
reader
- the reader to get the stopwords from
java.lang.Exception
- if reading failspublic void write(java.lang.String filename) throws java.lang.Exception
filename
- the file to write the stopwords to
java.lang.Exception
- if writing failspublic void write(java.io.File file) throws java.lang.Exception
file
- the file to write the stopwords to
java.lang.Exception
- if writing failspublic void write(java.io.BufferedWriter writer) throws java.lang.Exception
writer
- the writer to get the stopwords from
java.lang.Exception
- if writing failspublic java.lang.String toString()
toString
in class java.lang.Object
public static boolean isStopword(java.lang.String str)
str
- the word to test
public static void main(java.lang.String[] args) throws java.lang.Exception
args
- commandline parameters
java.lang.Exception
- if something goes wrong
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |