Topic on Extension talk:WikibaseLexeme/RDF mapping

Number of statements in data nodes

2
Lucas Werkmeister (WMDE) (talkcontribs)

FYI, in T195387 I added support for writing the number of statements to the page_props table, which means it will probably also end up in the query service. Since the granularity of page data is the page, not the entity, I decided to count all the statements of the page there, including the statements on forms and senses. This might be a bit awkward on the query service, especially if we merge data nodes and entities as we currently do for items. Given a lexeme like

wd:L64723 wdt:P2 wd:Q3;
          ontolex:lexicalForm wd:L64723-F1;
          ontolex:sense wd:L64723-FS1.
wd:L64723-F1 wdt:P2 wd:Q4.
wd:L64723-S1 wdt:P2 wd:Q5.

you would get

wd:L64723 wikibase:statements 3.

instead of, as might be expected,

wd:L64723 wikibase:statements 1.
wd:L64723-F1 wikibase:statements 1.
wd:L64723-S1 wikibase:statements 1.

Do you think that’s acceptable?

Smalyshev (WMF) (talkcontribs)

Given that we don't have other pagedata on forms, I think it is. Also, these markers are most useable as kind of gauge of item quality and such. Since the real "item" here is the Lexeme, and Form & Sense are just sub-structures that exist only in context of Lexeme and only have IDs for technical reasons, I think it is ok.

There is the use case where you may want to see Forms/Senses without statements, but it's easy to check for with SPARQL.

Reply to "Number of statements in data nodes"