Mahout has an example of using naive bayes to classify 20 news group. but
how to just classify paragraphs (e.g. twitter message, movie review) in
text files such as:
Text files has content like:
text paragraph 1 class a
text paragraph 2 class b
text paragraph 3 class a
text paragraph 4 class b
............. ...
does it support n grams, stem, stop words, etc?
thanks for any suggestions.
how to just classify paragraphs (e.g. twitter message, movie review) in
text files such as:
Text files has content like:
text paragraph 1 class a
text paragraph 2 class b
text paragraph 3 class a
text paragraph 4 class b
............. ...
does it support n grams, stem, stop words, etc?
thanks for any suggestions.