You are browsing a read-only backup copy of Wikitech. The live site can be found at wikitech.wikimedia.org

Difference between revisions of "Metrics Platform/Demos"

From Wikitech-static
Jump to navigation Jump to search
imported>DAbad
m
imported>Ldelench
(added demo recording, added some notes, moved to previous sessions)
Line 17: Line 17:
<div style="border:solid 1px #000000; padding:1em; background:#FBEEE6; font-size:1.2em; margin-top:1em;">
<div style="border:solid 1px #000000; padding:1em; background:#FBEEE6; font-size:1.2em; margin-top:1em;">
==Next session==
==Next session==
=== '''2021-06-15''' ===
=== '''2021-06-23''' ===
Recording Link: [to be posted upon completion of demo]
Recording Link: [to be posted upon completion of demo]
 
{| class="wikitable sortable"
''Presenters:''
|+Agenda
* [Name] - [thing to be demo'd]
!Topic
!Type
!Objectives
!Presenter
!Links
|-
|topic TBD
|type TBD
|objectives TBD
|presenter TBD
|links TBD
|-
|
|
|
|
|
|-
|
|
|
|
|
|}




Line 30: Line 53:
==Previous sessions==
==Previous sessions==


=== '''To come''' ===
=== '''2021-06-09''' ===
[https://drive.google.com/file/d/12D99wJ1p9tF92323TLXxgy8c5wHzygwF/view Recording Link]
{| class="wikitable sortable"
|+Agenda
!Topic
!Type
!Objectives
!Presenter
!Links
|-
|Metrics Schema Query Testing
|Demo
|Demonstrate how to generate queries in Superset using the new Metrics Platform Schema.
|Jason L.
|[https://superset.wikimedia.org/superset/sqllab/?savedQueryId=256 Sample Superset Data]
|-
|Instrumentation Workflows
|Demo
|Showcase the potential instrumentation workflows and solicit feedback.
|Desiree A.
|
|-
|What is Next
|FAQ
|We will be focused on completing centralization of repos and handling of bespoke datasets.
|
|
|}
==== Notes/Q&A ====
* Next steps/followup items:
# Setting up time with Product Analytics to start walking through this, giving them access to test it and generate feedback. This is not a "hard commit" but we do need something we can poke holes in, versus talking about intangibles. 
# Talking about sanitization by fields with data engineering & adding to our backlog.
# Drafting documentation & communications.
# Looking at potential other events we would want to add as samples. We have a short list that Mikhail has helped inform, and will also touch base with Maya on this.


* With bespoke data, is it possible to add new content to JSON blobs or bespoke dimensions map without changing the schema?
Yes - that's the goal. Must have column, must have type, type can't change, but we want the values to change & make it structured. So in the string case, it's really a piece of structured data that's been serialized into a string. To work on it as a piece of structured data, you would un-serialize the string.
<br>
In cases with dimensions and measures, where you've chosen a type in advance, there's flexibility with adding new keys and changing properties. There's an essential amount of freedom to vary the content. With bespoke data we would have standard fields, and a "flexibility area" which allows us to do things engineering-wise that we can't do today. All of Metrics Platform's events, even though they're structured differently and we have different quality controls/processes in place, we fundamentally have the same shape/same schema. That allows us to do nice things with data integration, and allows for a sandbox for "non-standardized" data.
<br>
In other words: changes to what bespoke data that instruments collect will not require event schema changes.
* Lauren to add remaining Q&A by end of week.
</div>
</div>
</div>
</div>

Revision as of 22:10, 9 June 2021

Metrics Platform Demos

About

Biweekly demo time to show Metrics Platform developments/works in progress and gather feedback.

Recordings and Presenters

Note: You'll need to be signed in with your WMF account to view these videos.

Next session

2021-06-23

Recording Link: [to be posted upon completion of demo]

Agenda
Topic Type Objectives Presenter Links
topic TBD type TBD objectives TBD presenter TBD links TBD


Previous sessions

2021-06-09

Recording Link

Agenda
Topic Type Objectives Presenter Links
Metrics Schema Query Testing Demo Demonstrate how to generate queries in Superset using the new Metrics Platform Schema. Jason L. Sample Superset Data
Instrumentation Workflows Demo Showcase the potential instrumentation workflows and solicit feedback. Desiree A.
What is Next FAQ We will be focused on completing centralization of repos and handling of bespoke datasets.

Notes/Q&A

  • Next steps/followup items:
  1. Setting up time with Product Analytics to start walking through this, giving them access to test it and generate feedback. This is not a "hard commit" but we do need something we can poke holes in, versus talking about intangibles.
  2. Talking about sanitization by fields with data engineering & adding to our backlog.
  3. Drafting documentation & communications.
  4. Looking at potential other events we would want to add as samples. We have a short list that Mikhail has helped inform, and will also touch base with Maya on this.
  • With bespoke data, is it possible to add new content to JSON blobs or bespoke dimensions map without changing the schema?

Yes - that's the goal. Must have column, must have type, type can't change, but we want the values to change & make it structured. So in the string case, it's really a piece of structured data that's been serialized into a string. To work on it as a piece of structured data, you would un-serialize the string.
In cases with dimensions and measures, where you've chosen a type in advance, there's flexibility with adding new keys and changing properties. There's an essential amount of freedom to vary the content. With bespoke data we would have standard fields, and a "flexibility area" which allows us to do things engineering-wise that we can't do today. All of Metrics Platform's events, even though they're structured differently and we have different quality controls/processes in place, we fundamentally have the same shape/same schema. That allows us to do nice things with data integration, and allows for a sandbox for "non-standardized" data.
In other words: changes to what bespoke data that instruments collect will not require event schema changes.

  • Lauren to add remaining Q&A by end of week.