Manual:Pywikibot/djvutext.py
Wikimedia Git repository has this file: scripts/djvutext.py |
djvutext.py extracts the ocr text from a djvu page image and uploads it to the corresponding page in the page namespace on Wikisource. In order for the bot to function, a djvu file must be in the same folder pywikibot runs from and it must be identical to the djvu used for the index file on Wikisource.
On a blank page this script will leave a blank page and set the pagequality level=0 (without text) and the user= to the name of your bot (e.g. <pagequality level="0" user="DougBot" />).
Parameters
editThe following parameters are supported:
-ask
|
Ask for confirmation before uploading each page. (Default: ask when overwriting pages) |
-djvu:...
|
Filename of the local djvu file (i.e. in your PWB folder) |
-index:...
|
Name of the index page on Wikisource |
-pages:<start>-<end>
|
Page range to upload; <end> is optional (e.g. "-pages:1-"), and <start> is optional if end is stated
|
All other parameters will be regarded as part of the title of a single page, and the bot will only work on that single page.
Core only parameters
edit-summary:... | custom edit summary. Use quotes if edit summary contains spaces. |
-force | overwrites existing text; optional, default False |
-always | don't ask to confirm any of the changes |