• + 0 comments

    I'm using a bag of words model in sklearn, and my problem seems to be that transforming the bag of words from a sparse matrix to a dataframe takes to long. But CountVectorizer can only create sparse matrices, so I don't see a way around this, nor do I know of a simpler model for modeling text. Any suggestions?