Quantcast
Channel: Pentaho Community Forums
Viewing all articles
Browse latest Browse all 16689

Ratio of positive and negative text in learning dataset for text classification?

$
0
0
Hi,
I am using WEKA for the text classification task. I have data (few thousands articles) to classify in positive or negative class.
In learning dataset I have 200 articles (12 positive and 188 negative) and with this ratio the result is not good.
My question is:
"What ratio of positive and negative articles in learning dataset will be perfect for the accuracy?"
Any rule, suggestion etc.

Thanks in anticipation!
Regards/
Sardar

Viewing all articles
Browse latest Browse all 16689

Trending Articles