[Wikitech-l] page counters
Overview
This presents a kind of ‘what pages are visited’ statistics. It is applied to a squid access-log stream and redirected to profiling agent (webstatscollector) then the hourly snapshots are written in very trivial format. This can be used to both noticing strange activities, as well as spotting trends (specific events show up really nicely), let it be a movie premiere, a national holiday or any scandal.
A normal snapshot contains ~3.5M page titles and extracted is over 100MB. Entries inside are grouped by project, and in semi-alphabetic order.
Application Gallery
Do you have an application, visualization or otherwise great use of this data?
Submit it now, and be featured here!
Visit Source
Infochimps Platform
Use this data on the Infochimps Big Data Platform to unlock:
- Advanced analytical capabilities
- Hosting for customer databases
- Access to tools such as Hadoop, Pig, and R
- …and more to come!
Learn More »
Tags
Stats
| Sources: | ||
|---|---|---|
| Added by: | Infochimps | |
| Collection: | Pete Skomoroch's Bookmarks | |
| Link: | http://lists.wikimedia.org/pipermail/wikitech-l/200[ ... ]December/035435.html | |
| Created: | about 3 years ago | |
| Updated: | 11 months ago | |
Share
