You are browsing a read-only backup copy of Wikitech. The live site can be found at

Metrics Platform/Demos

From Wikitech-static
< Metrics Platform
Revision as of 22:10, 9 June 2021 by imported>Ldelench (added demo recording, added some notes, moved to previous sessions)
Jump to navigation Jump to search

Metrics Platform Demos


Biweekly demo time to show Metrics Platform developments/works in progress and gather feedback.

Recordings and Presenters

Note: You'll need to be signed in with your WMF account to view these videos.

Next session


Recording Link: [to be posted upon completion of demo]

Topic Type Objectives Presenter Links
topic TBD type TBD objectives TBD presenter TBD links TBD

Previous sessions


Recording Link

Topic Type Objectives Presenter Links
Metrics Schema Query Testing Demo Demonstrate how to generate queries in Superset using the new Metrics Platform Schema. Jason L. Sample Superset Data
Instrumentation Workflows Demo Showcase the potential instrumentation workflows and solicit feedback. Desiree A.
What is Next FAQ We will be focused on completing centralization of repos and handling of bespoke datasets.


  • Next steps/followup items:
  1. Setting up time with Product Analytics to start walking through this, giving them access to test it and generate feedback. This is not a "hard commit" but we do need something we can poke holes in, versus talking about intangibles.
  2. Talking about sanitization by fields with data engineering & adding to our backlog.
  3. Drafting documentation & communications.
  4. Looking at potential other events we would want to add as samples. We have a short list that Mikhail has helped inform, and will also touch base with Maya on this.
  • With bespoke data, is it possible to add new content to JSON blobs or bespoke dimensions map without changing the schema?

Yes - that's the goal. Must have column, must have type, type can't change, but we want the values to change & make it structured. So in the string case, it's really a piece of structured data that's been serialized into a string. To work on it as a piece of structured data, you would un-serialize the string.
In cases with dimensions and measures, where you've chosen a type in advance, there's flexibility with adding new keys and changing properties. There's an essential amount of freedom to vary the content. With bespoke data we would have standard fields, and a "flexibility area" which allows us to do things engineering-wise that we can't do today. All of Metrics Platform's events, even though they're structured differently and we have different quality controls/processes in place, we fundamentally have the same shape/same schema. That allows us to do nice things with data integration, and allows for a sandbox for "non-standardized" data.
In other words: changes to what bespoke data that instruments collect will not require event schema changes.

  • Lauren to add remaining Q&A by end of week.