Wikimedia Product/Data dictionary
Definitions for core and other essential metrics have been moved to a separate Data Glossary. For documentation of the datasets behind these derived tables, and the pipelines that generate that data, see the Data Platform docs on Wikitech. |
Druid Data Tables in Superset/Turnilo
edit- edit_hourly
- Table contains edits data, aggregated hourly.
- mediawiki_history_reduced
- A light version of mediawiki_history. Table only contains data of revisions which is not deleted by page deletion.
- virtualpageviews_hourly
- Table contains virtual pageviews data, aggregated hourly.
- pageviews_hourly
- Table contains pageviews data, aggregated hourly.
- pageviews_daily
- Table contains pageviews data, aggregated daily.
- unique_devices_per_domain_daily
- Table contains unique devices counts per domain, aggregated daily.
- unique_devices_per_domain_monthly
- Table contains unique devices counts per domain, aggregated monthly.
- unique_devices_per_project_family_daily
- Table contains unique devices counts per project family, aggregated daily.
- unique_devices_per_project_family_monthly
- Table contains unique devices counts per project family, aggregated monthly.
- mediawiki_geoeditors_monthly
- Table contains private data of editors counts by country region, aggregated monthly.
Hive Tables in Superset
edit- session_length_daily
- Table contains session length and session counts data, aggregated daily.
- content_interactions
- Table contains interactions data, aggregated monthly.
- active_editors
- Table contains active editors data, aggregated monthly.
- content_edit_daily
- Table contains edit topic data, aggregated daily.
- content_pv
- Table contains pageview topic data, aggregated daily.
References
editReconcile datasets in Superset with Key Product Metrics, documents differences between data available for exploration in Superset and our monthly Key Product Metrics