Dumps/SQL-XML Dumps
Docs for end-users of the xml/sql dumps can be found on
meta
. If you're a Toolforge user and want to use the dumps, check out
Help:Shared storage
for information on where to find the files.
Current Info
-
The SQL/XML dumps are managed by this file
mediawiki_sql_xml_dumps.py
which creates a number of DAGs in the
test-k8s
instance of Airflow.
- These DAGs can be monitored and managed here: https://airflow-test-k8s.wikimedia.org/home?tags=dumps-xml-sql
- For current dumps issues, see the Dumps-generation project in Phabricator.
- For information about the WikiTeam initiative to upload these dumps to the Internet Archive, see the Nova Resource:Dumps project.
Older Info
- For information about the initiative to upload these dumps to the Internet Archive, see the Nova Resource:Dumps project.
- For historical information about the dumps, see Dumps/History .
- For a list of various information sources about the dumps, see Dumps/Other information sources .
Setup
Current architecture
See Dumps/Airflow for details of the architecture.
Rerunning dumps
See Dumps/Rerunning a job if you need to rerun a wiki/job.