The website is in Maintenance mode. We are in the process of adding more features.
Any new bookmarks, comments, or user profiles made during this time will not be saved.

Machine Learning Resources

In what cases (and why) does using Binary Occurrence instead of TF-IDF makes more sense? 

Bookmark this question

If the vocabulary size is small, and the binary occurrence of given words are strong features in differentiating between document classes, simply using dummy variables to indicate the presence of given words could be a viable and less computationally intensive approach to text classification.

Leave your Comments and Suggestions below:

Please Login or Sign Up to leave a comment

Partner Ad  

Find out all the ways
that you can

Explore Questions by Topics

Partner Ad

Learn Data Science with Travis - your AI-powered tutor |