This page links to detailed information about Edits datasets in the Data Lake.
In comparison to the traffic ones, those datasets are not continuously updated. They are regularly updated by fully re-importing/re-building them, creating a new
snapshot notion is key when querying the Edits datasets, since inclufing multiple snapshots doesn't sense for most queries. As of 2017-04, snapshots are provided monthly.
Mediawiki raw data
Those are copy of mediawiki MySQL tables
- Mediawiki user history -- Dataset providing reconstructed history events of mediawiki users
- Mediawiki page history -- Dataset providing reconstructed history events of mediawiki pages
- Mediawiki history -- Fully denormalized dataset containing user, page and revision processed data
- Metrics -- Dataset providing precomputed metrics over edits data