User

japerk

Jacob Perkins
348a20c3a576b2cb26674f1bc9eaf012.jpg?size=80&default=http%3a%2f%2fwww.infochimps.com%2fmarketplace%2fassets%2fgravatar-sample
Website:

http://streamhacker.com

Institution:

Weotta

Bio:

NLTK contributor, author of "Python Text Processing with NLTK 2.0 Cookbook", and co-founder/CTO of Weotta.

Uploaded datasets

3 datasets
  • Brown Simplifed Tags Part-of-Speech Tagger for Python NLTK

    Free Download — This data download is a pre-trained model for a Bayesian classifier. If you do not have experience with Python NLTK, you may not be interested in this data set. A 98.1% accurate simplified tags part-of-speech tagger trained on the brown corpus. It requires Python and NLTK 2.0 and is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported ...
  • Chinese Part-of-Speech Tagger for Python NLTK

    Free Download — This data download is a pre-trained model for a Bayesian classifier. If you do not have experience with Python NLTK, you may not be interested in this data set. A 98.3% accurate Chinese part-of-speech tagger trained on the sinica_treebank corpus. It requires Python & NLTK 2.0 and is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License: ...
  • Fast Treebank Part-of-Speech Tagger for Python NLTK

    Free Download — This data download is a pre-trained model for a Bayesian classifier. If you do not have experience with Python NLTK, you may not be interested in this data set. A 99.3% accurate part-of-speech tagger trained on the treebank corpus. It is many times faster than the default NLTK tagger and is a fraction of the size (which means less loading time and lower memory ...