You are browsing a read-only backup copy of Wikitech. The primary site can be found at wikitech.wikimedia.org

Analytics/Data Lake/Schemas/Metric results: Difference between revisions

From Wikitech-static
Jump to navigation Jump to search
imported>Joal
(Update for first internal productionisation.)
imported>MarcoAurelio
m (Bot: Fixing double redirect to Analytics/Data Lake/Edits/Metrics)
 
(One intermediate revision by one other user not shown)
Line 1: Line 1:
=Overview=
#REDIRECT [[Analytics/Data Lake/Edits/Metrics]]
 
This table stores metric computed over the [[Analytics/Data Lake/Schemas/Mediawiki history|denormalized mediawiki history]] dataset. It is [https://cwiki.apache.org/confluence/display/Hive/Tutorial#Tutorial-DataUnits partitioned] by wiki_db and metric name to facilitate using its data outside of Hive, namely for display in Dashiki.
 
=Schema=
<syntaxhighlight>
 
col_name data_type comment
dt                  string              The date of this measurement, as YYYY-MM-DD
value              bigint              The measurement   
snapshot            string              Versioning information to keep multiple datasets (YYYY-MM for regular labs imports)
metric              string              The metric being computed to measure
wiki_db            string              The wiki this measurement pertains to
# Partition Information
# col_name            data_type          comment           
snapshot            string              Versioning information to keep multiple datasets (YYYY-MM for regular labs imports)
metric              string              The metric being computed to measure
wiki_db            string              The wiki this measurement pertains to
 
</syntaxhighlight>

Latest revision as of 19:01, 13 July 2017