Hilfe:Erweiterung:Wikisource/Wikimedia OCR

This page is a translated version of the page Help:Extension:Wikisource/Wikimedia OCR and the translation is 53% complete.

Outdated translations are marked like this.

The Wikimedia OCR feature of the Wikisource extension adds a toolbar interface to the main editing toolbar when editing in the Page namespace, to quickly extract text from the page image and add it to the page body text-box. OCR steht für Optical Character Recognition (optische Zeichenerkennung) und ist ein Verfahren, mit dem Text in einem fotografischen Bild in bearbeitbaren Text umgewandelt und so in ein Wiki eingefügt werden kann.

Um diese Funktion zu nutzen, klicken Sie auf die Schaltfläche "Extract text" (dt. Text extrahieren) auf der rechten Seite der Hauptsymbolleiste für die Bearbeitung. Dies führt den OCR-Prozess durch und fügt den daraus resultierenden Text in das Feld für den Seitentext im Bearbeitungsformular ein (und ersetzt den dort bereits vorhandenen Text). Am oberen Rand des Textfeldes wird eine Schaltfläche "undo" (dt. Rückgängig) angezeigt, mit der Sie auf Wunsch zum vorherigen Zustand des Feldes zurückkehren können.

In seiner Grundform ist das die gesamte Funktionalität von Wikimedia OCR, aber es gibt ein paar erweiterte Funktionen, die unter bestimmten Umständen nützlich sein können und über das Dropdown-Menü rechts neben der Hauptschaltfläche "Extract text" verfügbar sind. Mit diesen erweiterten Funktionen können Sie eine andere OCR-Engine auswählen, eine Liste von Sprachen festlegen, um die Software bei der Erkennung von Wörtern zu unterstützen, oder einen kleineren Bereich der Seite auswählen, aus dem der Text extrahiert werden soll. Diese werden im Folgenden erläutert. Mit Ausnahme der "OCR Engine Auswahl" sind alle Optionen über den Menüpunkt "Advanced options" (dt. Erweiterte Optionen) verfügbar, der eine neue Registerkarte öffnet.

OCR Engines There are currently three OCR engines available: Tesseract, Google and Transkribus. Tesseract is an open-source tool that runs in-house and supports a wide range of languages and other options. Google OCR is a proprietary service, also supporting lots of languages, but with fewer options. Transkribus is supported by an EU cooperative READ-COOP and has partnered with the Wikimedia Foundation to provide a limited number of free credits to support Wikisource Loves Manuscripts project.

The choice of which to use can vary depending on the nature of the image to be processed.

To switch engines, select the relevant radio button in the dropdown menu. Your choice will be remembered for your current device, and can be changed at any time.

Sprachen

Clicking the 'Advanced options' menu item opens a new tab with a transcription form containing a field for selecting the language or languages that are used in the page of text being extracted. This is useful because the OCR engines can be much more accurate when they know what languages to expect.

Note that not all languages are supported by all engines, and if you change the engine then the list of available languages will change too.

If your language is not in the list, you can leave the Languages field empty and the OCR engine will attempt to extract what text it can. This can have varying results, and is worth trying.

Crop area

The crop button.

To extract text from only a part of an image (for example, a single column of a page from a newspaper), it is possible to select a crop area. Do this by first clicking the crop button ( , see screenshot at right), and then clicking and dragging over the page image to draw a rectangle. The image can be zoomed and panned, and the crop rectangle moved and resized as required. There are buttons above the image with which to switch between moving and cropping. Once you've selected the desired area, click 'Extract area' and the text for only that area will be shown in the right-side text box.

Returning from Advanced options

After using the advanced options form to extract text, it's necessary to copy and paste the resulting text back into the body field of the page editing form. To make this a bit quicker, a 'Copy to clipboard' button is provided.

First-time use

The first time you open a page for editing, a pulsating blue dot is shown on the 'Extract text' button. Clicking this dot or either of the buttons will open a popup explaining what this feature is. After this popup is dismissed, it will not be shown again (on the same device).

Issues

If you encounter any issues with using Wikimedia OCR, please report them on Phabricator, under the Wikisource OCR tag.

Hinweis: Wenn Du diese Seite bearbeitest, stimmst Du zu, dass Dein Beitrag unter der [CC0] veröffentlicht wird. Mehr Informationen findest du auf der Public Domain Hilfeseite.