Wikimedia Product/Data dictionary/virtualpageviews_hourly
This page describes the data set virtualpageviews_hourly
that stores on Druid Datasources, which can be accessed via Superset/Turnilo. virtualpageviews_hourly
on Druid is directly loaded from wmf.virtualpageview_hourly
on Hive, while wmf.virtualpageview_hourly
on Hive is originally extracted from event.VirtualPageView
.
Schema
editField name | data type | description | data example | source schema | source field |
---|---|---|---|---|---|
project | string | Project name from hostname | hu.wikipedia | event.VirtualPageView | event.source_url |
agent_type | string | Agent accessing the pages, can be spider or user | user | event.VirtualPageView | IF(useragent.is_bot, 'spider', 'user') |
ua_browser_family | string | Name of web browser (if not using an official Wikipedia mobile app), extracted from the client device's User-Agent | Chrome | event.VirtualPageView | user_agent_map['browser_family'] |
ua_device_family | string | Client device family (e.g. brand of manufacturer, product name), extracted from the client device's User-Agent if provided | Mac | event.VirtualPageView | user_agent_map['device_family'] |
city | string | City iso code of the accessing agents (maxmind GeoIP database) | Springfield | event.VirtualPageView | geocoded_data['city'] |
subdivision | string | Subdivision of the accessing agents (maxmind GeoIP database) | Ohio | event.VirtualPageView | geocoded_data['subdivision'] |
ua_wmf_app_version | string | Version of official Wikipedia mobile app (for iOS, Android, and KaiOS), extracted from the client device's User-Agent | - | event.VirtualPageView | user_agent_map['wmf_app_version'] |
country | string | Country (text) of the accessing agents (maxmind GeoIP database) | Hungary | event.VirtualPageView | geocoded_data['country'] |
country_code | string | Country iso code of the accessing agents (maxmind GeoIP database) | HU | event.VirtualPageView | geocoded_data['country_code'] |
ua_os_major | string | Major version of that Operating System, extracted from the client device's User-Agent | 10 | event.VirtualPageView | user_agent_map['os_major'] |
continent | string | Continent of the accessing agents (maxmind GeoIP database) | Europe | event.VirtualPageView | geocoded_data['continent'] |
ua_os_family | string | Operating System family used by the client device, extracted from the User-Agent | Windows 10 | event.VirtualPageView | user_agent_map['os_family'] |
language_variant | string | Language variant from path (not set if present in project name) | 2017-es_Eurov
%C3%ADzi%C3% B3s_Dalfesztiv%C3%A1l |
event.VirtualPageView | event.source_url |
ua_os_minor | string | Minor version of that Operating System, extracted from the client device's User-Agent | 13 | event.VirtualPageView | user_agent_map['os_minor'] |
referer_class | string | Always internal (virtualpageviews are always shown in wiki pages) | internal | 'internal' | |
access_method | string | Always desktop (virtualpageviews are a desktop only feature for now) | desktop | event.VirtualPageView | event.access_method, 'desktop' |
ua_browser_major | string | Major version of the client browser, extracted from the client device's User-Agent | 64 | event.VirtualPageView | user_agent_map['browser_major'] |
view_count | bigint | Number of views | 1 | event.VirtualPageView | count(*) |