Dataset

Brown Simplifed Tags Part-of-Speech Tagger for Python NLTK

Added By japerk

This data download is a pre-trained model for a Bayesian classifier. If you do not have experience with Python NLTK, you may not be interested in this data set.

A 98.1% accurate simplified tags part-of-speech tagger trained on the brown corpus. It requires Python and NLTK 2.0 and is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License: http://creativecommons.org/licenses/by-nc-sa/3.0/. In addition to punctuation tags, it produces the following tags:

ADJ
ADV
CNJ
DET
EX
FW
MOD
N
NIL
NP
NUM
P
PRO
TO
UH
V
VB+AT
VB+IN
VB+JJ
VB+PPO
VB+RP
VB+TO
VB+VB
VBG+TO
VBN+TO
VBZ
VD
VG
VN
WH