Dataset
Wikipedia XML Data
This data set contains a complete copy of all Wikimedia wikis, in the form of wikitext source and metadata embedded in XML as provided by the Wikimedia Foundation.
The data set will be updated every month and the 3 previous months will always be available for use. We will list previous snapshots in the text of this description.