Brown Simplifed Tags Part-of-Speech Tagger for Python NLTK
Overview
This data download is a pre-trained model for a Bayesian classifier. If you do not have experience with Python NLTK, you may not be interested in this data set.
A 98.1% accurate simplified tags part-of-speech tagger trained on the brown corpus. It requires Python and NLTK 2.0 and is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License: http://creativecommons.org/licenses/by-nc-sa/3.0/. In addition to punctuation tags, it produces the following tags:
ADJ
ADV
CNJ
DET
EX
FW
MOD
N
NIL
NP
NUM
P
PRO
TO
UH
V
VB+AT
VB+IN
VB+JJ
VB+PPO
VB+RP
VB+TO
VB+VB
VBG+TO
VBN+TO
VBZ
VD
VG
VN
WH
Application Gallery
Do you have an application, visualization or otherwise great use of this data?
Submit it now, and be featured here!
Infochimps Platform
Use this data on the Infochimps Big Data Platform to unlock:
- Advanced analytical capabilities
- Hosting for customer databases
- Access to tools such as Hadoop, Pig, and R
- …and more to come!
Tags
Stats
| Added by: | japerk | |
|---|---|---|
| Link: | ||
| Created: | 10 months ago | |
| Updated: | 26 days ago | |
Share
