You are browsing a read-only backup copy of Wikitech. The primary site can be found at wikitech.wikimedia.org
Analytics is the systematic computational analysis of data or statistics, for the purposes of discovery, interpretation, and communication of meaningful patterns.
These Wikitech pages below the Analytics path are intended to be reference documentation for users of these systems.
The Data Engineering team has responsibility for managing the Analytics Cluster and the Data Lake, so some pages regarding custer operations and data governance etc. will be found under that path.
The Analytics Cluster comprises a number of different systems geared to help researchers, data scientists, machine learning engineers and other authorized parties to access the data lake.
If you believe that you need access to the cluster, please refer to Analytics/Data access
Many of these datasets are managed by the Data Engineering team with pipelines deployed to production and monitored.