도움말:편집 검토를 위한 새로운 필터/품질 및 의도 필터

This page is a translated version of the page Help:New filters for edit review/Quality and Intent Filters and the translation is 13% complete.

주의사항: 이 문서를 편집하면 CC0에 따라 당신의 기여한 것을 배포하는 데 동의하는 것으로 간주됩니다. 자세한 내용은 퍼블릭 도메인 도움말 문서를 확인 하세요.

New filters for edit review introduces two filter groups—Contribution Quality and User Intent—that work differently from other edit-review filters. The filters in these groups offer probabilistic predictions about, respectively, whether or not edits are likely to contain problems and whether the users who made them were acting in good faith.

Additionally, the language-agnostic revert risk model, enabled in 2024, provides a prediction about how likely an edit is to require reverting.

Knowing a bit about how these unique tools work will help you use them more effectively.

이 필터들은 특정 위키에서만 사용할 수 있습니다.

기계 학습 기반

The predictions that make the Quality and Intent filters possible are calculated by ORES, a machine learning program trained on a large set of edits previously scored by human editors. Machine learning is a powerful technology that lets machines replicate some limited aspects of human judgement.

The Quality and Intent filters are available only on wikis where the “damaging” and “good faith” ORES “models” are supported. The ORES “damaging” model powers Quality predictions, while its “good-faith” model powers Intent.

Enabling ORES requires volunteers to score edits on the relevant wiki. This page explains the process and how you can get it started on your wiki.

The language-agnostic revert risk model supports all language Wikipedias and does not require manual training by volunteers.

적절한 도구 선택하기

Looking at the Quality, Intent, and Revert Risk filters, you may notice something different about them. Unlike filters in other groups, the various options don’t target different edit properties. Instead, many of them target the same property, but offer different levels of accuracy.

Why would anyone choose to use a tool that's less accurate? Because such accuracy can come at a cost.

Increase prediction probability (higher ‘precision’)

This conceptual diagram illustrates how the Quality filters relate to one another on many wikis (performance varies).

As you can see, the 문제가 있을 가능성이 매우 높습니다 filter captures results composed almost entirely of problem edits (high precision). But it captures only a small portion of all problem edits (low recall). Notice how everything in 문제가 있을 가능성이 매우 높습니다 (and 문제가 있을 가능성이 높습니다) is also included in the broader 문제가 있을 수 있습니다, which provides high recall but low precision (because it returns a high percentage of problem-free edits).

You may be surprised to see that 문제가 있을 수 있습니다 overlaps with 양호할 가능성이 매우 높습니다. Both filters cover the indeterminate zone between problem and problem-free edits in order to catch more of their targets (broader recall).

For space reasons, the diagram doesn't accurately reflect scale.

The more “accurate” filters on the menu return a higher percentage of correct versus incorrect predictions and, consequently, fewer false positives. (In the lingo of pattern recognition, these filters have a higher “precision”.) They achieve this accuracy by being narrower, stricter. When searching, they set a higher bar for probability. The downside of this is that they return a smaller percentage of their target.

Example: The 문제가 있을 가능성이 매우 높습니다 filter is the most accurate of the Quality filters. Performance varies from wiki to wiki, but on English Wikipedia its predictions are right more than 90% of the time. The tradeoff is that this filter finds only about 10% of all the problem edits in a given set —because it passes over problems that are harder to detect. The problems this filter finds will often include obvious vandalism.

Find more of your target (higher ‘recall’)

If your priority is finding all or most of your target, then you’ll want a broader, less accurate filter. These find more of what they’re looking for by setting the bar for probability lower. The tradeoff here is that they return more false positives. (In technical parlance, these filters have higher “recall”, defined as the percentage of the stuff you’re looking for that your query actually finds.)

Example: The 문제가 있을 수 있습니다 filter is the broadest Quality filter. Performance varies on different wikis, but on English Wikipedia it catches about 82% of problem edits. On the downside, this filter is right only about 15% of the time.

If 15% doesn’t sound very helpful, consider that problem edits actually occur at a rate of fewer than 5 in 100—or 5%. So 15% is a 3x boost over random. And of course, patrollers don’t sample randomly; they’re skilled at using various tools and clues to increase their hit rates. Combined with those techniques, 문제가 있을 수 있습니다 provides a significant edge.

(As noted above, ORES performs differently on different wikis, which means that some are less subject to the tradeoffs just discussed than others. On Polish Wikipedia, for example, the 문제가 있을 가능성이 높습니다 filter captures 91% of problem edits, compared to 34% with the corresponding filter on English Wikipedia. Because of this, Polish Wikipedia does not need—or have—a broader 문제가 있을 수 있습니다 filter.)

Get the best of both worlds (with highlighting)

You can get the best of both worlds by filtering broadly but highlighting using more accurate functions. Here, the user casts a wide net for damage by checking to use the broadest Quality filter, 문제가 있을 수 있습니다. At the same time, she identifies the worst or most obvious problems by highlighting (but not filtering with) 문제가 있을 가능성이 높습니다, 문제가 있을 가능성이 매우 높습니다 and 악의일 가능성이 높습니다.

The filtering system is designed to let users get around the tradeoffs described above. You can do this by filtering broadly while Highlighting the information that matters most.

To use this strategy, it’s helpful to understand that the more accurate filters, like 문제가 있을 가능성이 매우 높습니다, return results that are a subset of the less accurate filters, such as 문제가 있을 수 있습니다. In other words, all “Very likely” results are also included in the broader 문제가 있을 수 있습니다. (The diagram above illustrates this concept.)

Example: Find almost all damage while emphasizing the worst/most likely:

With the default settings loaded,

Check the broadest Quality filter, 문제가 있을 수 있습니다.

At the same time, highlight —without checking the filter boxes— 문제가 있을 가능성이 높습니다, in yellow, and 문제가 있을 가능성이 매우 높습니다, in red.

Because you are using the broadest Quality filter, your results will include most problem edits (high “recall”). But by visually scanning for the yellow, red and orange (i.e., blended red + yellow) bands, you will easily be able to pick out the most likely problem edits and address them first. (Find help on using highlights without filtering.)

Find the good (and reward it)

This reviewer wants to thank new users who are making positive contributions. The 양호할 가능성이 매우 높습니다 filter isolates problem-free edits with 99% accuracy. Filtering for 신입 사용자 and 학습자 limits the search to these two experience levels, while applying a green highlight to 신입 사용자 (only) enables the reviewer to distinguish at a glance between the two levels.

Good faith is easy to find, literally! So are good edits.

The 선의일 가능성이 매우 높습니다 filter and the 양호할 가능성이 매우 높습니다 (Quality) filter give you new ways to find and encourage users who are working to improve the wikis. For example, you might use the 양호할 가능성이 매우 높습니다 filter in combination with the 신입 사용자 filter to thank new users for their good work.

Example: Thank good-faith new users

Clear the filters by clicking the Trashcan. Then select the 문서 편집 and 사람 (봇이 아님) filters.
Check the Quality filter 양호할 가능성이 매우 높습니다.
Check the User Registration and Experience filters 신입 사용자 and 학습자 (this has the hidden effect of limiting your results to registered users).
Highlight the 신입 사용자 filter, in green.

All edits in your results will be good edits by Newcomers (users with fewer than 10 edits and 4 days of activity) and Learners (users with fewer than 500 edits and 30 days of activity). The green highlight lets you easily distinguish between the two.

Good is everywhere!

The “good” filters mentioned above are both accurate and broad, meaning they aren’t subject to the tradeoffs described in the previous section (they combine high “precision” with high “recall”). These filters are correct about 99% of the time and find well over 90% of their targets. How can they do that?

The happy answer is that the “good” filters perform so well because good is more common than bad. That is, good edits and good faith are much, much more plentiful than their opposites—and therefore easier to find. It may surprise some patrollers to hear this, but on English Wikipedia, for example, one out of every 20 edits has problems, and only about half those problematic edits are intentional vandalism.

^[1]

필터 목록

품질 및 의도 필터를 이용하는 위키에서 일부 필터는 더 나은 예측 품질로 인해 존재하지 않을 수 있습니다. 위키에서 ORES의 성능이 좋을수록 더 적은 필터 수준이 필요합니다.

기여 품질 예측

양호할 가능성이 매우 높습니다: 문제가 거의 없는 편집을 발견할 가능성이 매우 높습니다.

문제가 있을 수 있습니다: 결함이 많거나 악의적인 편집을 발견할 수 있지만 정확도는 낮습니다.

문제가 있을 가능성이 높습니다: 문제가 되는 대부분의 편집을 높은 정확성으로 찾아냅니다.; 문제가 되는 편집을 발견할 수 있지만 정확도는 보통입니다.

문제가 있을 가능성이 매우 높습니다: 매우 결함이 있거나 악영향을 주는 편집을 발견할 가능성이 매우 높습니다.

사용자 의도 예측

선의일 가능성이 매우 높습니다: 대부분 선의의 편집을 발견할 가능성이 매우 높습니다.

악의일 가능성이 있습니다: 악의적으로 편집된 대부분의 내용들을 찾아내지만 정확도는 낮습니다.

악의일 가능성이 높습니다: 악의적인 편집을 발견하지만 정확도는 보통입니다.

Revert risk

Filter levels TBD. Uses the Language-agnostic revert risk model.

참고

↑ These figures come from research that went into training the “damaging” and “good faith” ORES models on English Wikipedia. That’s to say, when volunteers scored a large, randomly drawn set of test edits, this is what they found.

[1] These figures come from research that went into training the “damaging” and “good faith” ORES models on English Wikipedia. That’s to say, when volunteers scored a large, randomly drawn set of test edits, this is what they found.

[1]