User
japerk
Jacob Perkins
Website:
Institution:
Weotta
Bio:NLTK contributor, author of "Python Text Processing with NLTK 2.0 Cookbook", and co-founder/CTO of Weotta.
Uploaded datasets
3 datasets-
Brown Simplifed Tags Part-of-Speech Tagger for Python NLTK
Free Download — This data download is a pre-trained model for a Bayesian classifier. If you do not have experience with Python NLTK, you may not be interested in this data set. A 98.1% accurate simplified tags part-of-speech tagger trained on the brown corpus. It requires Python and NLTK 2.0 and is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported ... -
Chinese Part-of-Speech Tagger for Python NLTK
Free Download — This data download is a pre-trained model for a Bayesian classifier. If you do not have experience with Python NLTK, you may not be interested in this data set. A 98.3% accurate Chinese part-of-speech tagger trained on the sinica_treebank corpus. It requires Python & NLTK 2.0 and is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License: ... -
Fast Treebank Part-of-Speech Tagger for Python NLTK
Free Download — This data download is a pre-trained model for a Bayesian classifier. If you do not have experience with Python NLTK, you may not be interested in this data set. A 99.3% accurate part-of-speech tagger trained on the treebank corpus. It is many times faster than the default NLTK tagger and is a fraction of the size (which means less loading time and lower memory ...