Email Data Sets
Overview
Due to privacy issues, it is very hard to get a hold of large and realistic email corpora. Here you can find a few email data sets, as well as a dataset of news groups text – annotated with personal names spans. The email corpora given here were extracted from the Enron corpus, made public by the Federal agency Regulatory commission. As a second type of informal text, we also annotated a collection of newsgroups postings.
Application Gallery
Do you have an application, visualization or otherwise great use of this data?
Submit it now, and be featured here!
Visit Source
Infochimps Platform
Use this data on the Infochimps Big Data Platform to unlock:
- Advanced analytical capabilities
- Hosting for customer databases
- Access to tools such as Hadoop, Pig, and R
- …and more to come!
Learn More »
Tags
Categories
Stats
| Sources: | ||
|---|---|---|
| Added by: | Infochimps | |
| Collection: | Pete Skomoroch's Bookmarks | |
| Link: | http://www.cs.cmu.edu/~einat/datasets.html | |
| Created: | about 3 years ago | |
| Updated: | 11 months ago | |
Share
