I would like it if the CopyPatrol had tools that understood about CC sources (it would be nice if the people did too) and other random editors who think they can spot copyright violation.
I work with a lot of material released under CC-BY, and I get tired of the accusations of copyvio (including speedy deletion of my new articles) despite my clear attribution within the article that it is drawing from an identified CC-BY source. I follow the rules but others don't even seem to understand those rules.
The first common misunderstanding is that an assertion of copyright over a webpage does override a CC notice. If an organisation is to license its content under CC-BY, then it must be the copyright holder to do so. There is no incompatibility between a copyright notice and a CC license; it's to be expected. It is not a case of "find a copyright statement and declare a copyvio", but look for the CC license.
In particular, our tools need to look beyond the individual webpage which does not contain a CC-BY statement, but to the copyright notice on the bottom of the page links to a copyright statement which CC-BYs all (or most) pages within the website. As an example, suppose I were to copy content (appropriately attributed) from this webpage onto Wikipedia:
As you can see, there is nothing on this page that says it is CC-BY, but the link at the bottom of the page labelled Copyright links to this page:
https://www.qld.gov.au/legal/copyright
which says that the whole website (except where explicitly excluded) is CC-BY-4.0.
We also need easier ways to attribute suitably licensed CC-material, particularly when there are large CC-BY websites (such as the one above - the primary portal for the Queensland Government). I don't mind doing a "big" attribution statement when I created an article largely from a CC-BY source, e.g,
https://en.wikipedia.org/wiki/Junction_Park_State_School
but if I just want a few sentences from that source, and then a few sentences from another CC-BY source etc, then the attribution starts to become very heavyweight (could end up with more bytes of attribution than the text taken from those sources!) and indeed such "end of article" attribution cannot indicate which portions of the article were derived from it. I think it would be much better if the cite templates took a parameter to signal the licensing of a source where this is relevant and displayed the little "CC-BY" icon (or whichever license icon is applicable). Alternatively we need better attribution templates. Currently I have to write my own attribution templates as in the case above, or manually write an attribution as in:
https://en.wikipedia.org/wiki/New_Mapoon,_Queensland
because the existing templates are inadequate. Another piece of the problem is that some people believe you must use one of the existing templates to avoid the accusation of copyright violation.
CC-BY material is a great way to get a lot of content onto Wikipedia. In the example above, we were using such content to fill knowledge gaps on Wikipedia about Indigenous Australian communities (several articles were expanded using that Qld Govt website). Let's encourage the use of CC-BY content instead of the current practice of making it as unpleasant and as difficult as possible.
Now, above I am discussing CC-BY content. But if the content is CC0 or PD, then I don't have to make attribution at all. But again we have no clear policies and proceses in place about how to do this. I often write biographies based on obituaries that appeared in the pre-1955 Australian newspapers (which are both out-of-copyright in Australia and extensively digitised), e.g. I might want to write a biography based on this newspaper article:
https://trove.nla.gov.au/newspaper/article/53048297
Do I need to do anything more than cite it? Is there anywhere I should be asserting {{PD-Australia}}? Again, if the cite templates included the capability for me to add the {{PD-Australia}} to an appropriate field, there would be a clear assertion by me that this is public domain material and hence not a copyvio. At the moment, I left at the untender mercies of any random editor who decides to call it a copyright violation.