Manual:Pywikibot/djvutext.py

Wikimedia Git repository has this file: scripts/djvutext.py

djvutext.py extracts the ocr text from a djvu page image and uploads it to the corresponding page in the page namespace on Wikisource. In order for the bot to function, a djvu file must be in the same folder pywikibot runs from and it must be identical to the djvu used for the index file on Wikisource.

On a blank page this script will leave a blank page and set the pagequality level=0 (without text) and the user= to the name of your bot (e.g. <pagequality level="0" user="DougBot" />).

Parameters

The following parameters are supported:

`-ask`	Ask for confirmation before uploading each page. (Default: ask when overwriting pages)
`-djvu:...`	Filename of the local djvu file (i.e. in your PWB folder)
`-index:...`	Name of the index page on Wikisource
`-pages:<start>-<end>`	Page range to upload; `‎<end>` is optional (e.g. "-pages:1-"), and `‎<start>` is optional if end is stated

All other parameters will be regarded as part of the title of a single page, and the bot will only work on that single page.

Core only parameters

-summary:...	custom edit summary. Use quotes if edit summary contains spaces.
-force	overwrites existing text; optional, default False
-always	don't ask to confirm any of the changes