root
ab:ln -s "/usr/share/applications/ExifTool/exiftool" "/usr/local/bin/exiftool"
/usr/local/bin
welches standardmäßig von synOCR mit genutzt wird.Interessant ist, dass mittendrin "exiftool ok" steht.-----------------------------------
| ==> installation info <== |
-----------------------------------
synOCR-user: synOCR
synOCR-user is admin: yes
synOCR-version: 1.2.0
Architecture: aarch64
DSM-build: 42218
Device: rs819 (2356825277)
current Profil: default
DB-version: 5
used image (created): jbarlow83/ocrmypdf:latest (2022-02-08T09:23:30)
used ocr-parameter (raw): -srd -l deu+eng
ocropt_array: -srd -l deu+eng
search prefix:
replace search prefix: yes
renaming syntax: §y-§m-§d_§tag_§tit
Symbol for tag marking: #
Document split pattern:
Date search method: use standard search via RegEx
source for filedate: ocr
ignored dates by search: 2021-02-29;2020-11-31
Docker test: OK
DSM notify to user: admin
Loglevel: normal
max. count of logfiles: 10
Source directory: /volume1/ScanInbox/ocr/in/
Target directory: /volume1/ScanInbox/ocr/out/
BackUp directory: /volume1/ScanInbox/ocr/backup/
rotate backupfiles after: (purge backup deactivated)
----------------------------------
| ==> Funktionsaufrufe <== |
----------------------------------
ERROR - with pdfinfo / exiftool - $pagecount was set to 0
PROCESSING: ➜ img20220316_13390741.pdf (Sat Mar 19 15:22:38 CET 2022)
temp. target file: /tmp/tmp.UZ2QK72Nxf/img20220316_13390741.pdf
➜ OCRmyPDF-LOG:
reading file from standard input
Start processing 2 pages concurrently
2 page is facing ⇧, confidence 10.43 - no change
1 page is facing ⇧, confidence 9.24 - no change
Postprocessing...
Some input metadata could not be copied because it is not permitted in PDF/A. You may wish to examine the output PDF's XMP metadata.
Optimize ratio: 1.14 savings: 12.6%
Output sent to stdout
← OCRmyPDF-LOG-END
target file (OK): /tmp/tmp.UZ2QK72Nxf/img20220316_13390741.pdf
➜ search tags and date:
no tags defined
Using date format: 1 (1 = dd mm [yy]yy; 2 = [yy]yy mm dd; 3 = mm dd [yy]yy)
Using date format: 2 (1 = dd mm [yy]yy; 2 = [yy]yy mm dd; 3 = mm dd [yy]yy)
Using date format: 3 (1 = dd mm [yy]yy; 2 = [yy]yy mm dd; 3 = mm dd [yy]yy)
Date not found in OCR text - use file date:
day: 16
month:03
year: 2022
➜ renaming:
apply renaming syntax ➜ 2022-03-16__img20220316_13390741
➜ edit metadata (exiftool ok) 1 image files updated
target file: 2022-03-16__img20220316_13390741.pdf
➜ Adapt file date (Source: Source file [OCR selected but not found])
➜ move source file to: /volume1/ScanInbox/ocr/backup/img20220316_13390741 (1).pdf
./synOCR.sh: line 1779: /dev/ttyS1: Permission denied
ERROR at line 1779: echo 2 > /dev/ttyS1
INFO: (PushBullet-TOKEN not set)
Stats:
➜ runtime last file: 00:01:33
➜ pagecount last file: 0
➜ file count profile : (profile default) - 13 PDF's / 0 Pages processed up to now
➜ file count total: 13 PDF's / 0 Pages processed up to now
➜ delete tmp-files …
➜ purge logfiles:
➜ purge backup deactivated
-----------------------------------
| ==> synOCR ENDE <== |
-----------------------------------
NAME
exiftool - Read and write meta information in files
SYNOPSIS
Reading
exiftool [*OPTIONS*] [-*TAG*...] [--*TAG*...] *FILE*...
Writing
exiftool [*OPTIONS*] -*TAG*[+-<]=[*VALUE*]... *FILE*...
Copying
exiftool [*OPTIONS*] -tagsFromFile *SRCFILE* [-[*DSTTAG*<]*SRCTAG*...]
*FILE*...
Other
exiftool [ -ver | -list[w|f|r|wf|g[*NUM*]|d|x] ]
For specific examples, see the EXAMPLES sections below.
This documentation is displayed if exiftool is run without an input
*FILE* when one is expected.
DESCRIPTION
input="/volume1/…"
exiftool -"*Count*" "${input}" | awk -F' ' '{print $NF}'
Siehe "pages counted with pdfinfo".----------------------------------
| ==> Funktionsaufrufe <== |
----------------------------------
show files in INPUT with transcoded special characters
img20220316_13390741.pdf$
(pages counted with pdfinfo)
ERROR - with pdfinfo / exiftool - $pagecount was set to 0
PROCESSING: ➜ img20220316_13390741.pdf (Sat Mar 19 16:43:59 CET 2022)
temp. target file: /tmp/tmp.E594Kc2gcv/img20220316_13390741.pdf
[runtime up to now: 00:00:00]
Leider bricht das Skript direkt mit einem Fehler ab, aber das liegt sicher an mir. Der Fehler:#!/bin/bash
if [ $(which pdfinfo) ]; then
echo "pdfinfo"
elif [ $(which exiftool) ]; then
echo "exiftool"
fi
Leider bringt das mich aber nicht weiter. Kannst du da etwas erkennen?pdfinfo.sh: line 4: syntax error near unexpected token `elif'
'dfinfo.sh: line 4: ` elif [ $(which exiftool) ]; then
Kann es sein, dass dein Skript mit Windowszeilenenden gespeichert wurde?Leider bricht das Skript direkt mit einem Fehler ab, aber das liegt sicher an mir. Der Fehler:
rule_06:
tagname: Vorgang_k.A.
tagname_RegEx: ((?<=(^Nr[:|;][\s]{7}){1})|(?<=(^Nr[:|;][\s]{8}){1})|(?<=(^Nr[:|;][\s]{9}){1})|(?<=(^Nr[:|;][\s]{1}){1}))[0-9]+
targetfolder: /Nr/
condition: all
subrules:
- searchstring: ((?<=(^Nr[:|;][\s]{7}){1})|(?<=(^Nr[:|;][\s]{8}){1})|(?<=(^Nr[:|;][\s]{9}){1})|(?<=(^Nr[:|;][\s]{1}){1}))[0-9]+
searchtyp: contains
isRegEx: true
-----------------------------------
| ==> installation info <== |
-----------------------------------
synOCR-user: root
synOCR-user is admin: no
synOCR-version: 1.1.902
Architecture: x86_64
DSM-build: 42218
Device: 1618plus (0533678875)
current Profil: SWE_Nummer
DB-version: 4
used image (created): jbarlow83/ocrmypdf:latest (2022-02-08T09:05:13)
used ocr-parameter: -srd -l deu
replace search prefix: yes
renaming syntax: §tag
Symbol for tag marking: #
Document split pattern:
source for filedate: source
ignored dates by search:
Docker Test: OK
Loglevel: normal
Application Directory: /usr/syno/synoman/webman/3rdparty/synOCR
Source directory: /volume1/SWE/Test_Ordner_by_IT/OCR/
Target directory: /volume1/SWE/Test_Ordner_by_IT/
BackUp directory was created [/volume1/%2525252525252525252525252525252525252525252525252525252525252525252525253CPATH%2525252525252525252525252525252525252525252525252525252525252525252525253E/_BACKUP/]
----------------------------------
| ==> Funktionsaufrufe <== |
----------------------------------
PROCESSING: ➜ Scan_9832033.pdf (Tue Mar 22 08:36:01 CET 2022)
temp. target file: /tmp/tmp.uVW29jBCdH/Scan_9832033.pdf
➜ OCRmyPDF-LOG:
reading file from standard input
1 page is facing ⇧, confidence 3.26 - no change
Postprocessing...
Optimize ratio: 2.62 savings: 61.8%
Output sent to stdout
← OCRmyPDF-LOG-END
target file (OK):
➜ search tags and date:
source for tags is yaml based tag rule file [/volume1/SWE/Test_Ordner_by_IT/OCR/_TagConfig_[profile_SWE_Nummer].txt]
validate the integrity of yaml-file:
syntax error in row ERROR at line 652 [value must be only "content" OR "filename"]
syntax error in row ERROR at line 670 [value must be only "true" OR "false"]
search by tag rule: "rule_06" ➜
➜ condition: all
➜ tag: Vorgang_k.A.
➜ destination: /Nr/
➜ RegEx for tag: ((?<=(^Nr[:|;][\s]{7}){1})|(?<=(^Nr[:|;][\s]{8}){1})|(?<=(^Nr[:|;][\s]{9}){1})|(?<=(^Nr[:|;][\s]{1}){1}))[0-9]+
[value for source is empty - "content" is used]
[value for casesensitive is empty - "false" is used]
>>> Rule is not satisfied
rename tag is: ""
Using date format: 1 (1 = dd mm [yy]yy; 2 = [yy]yy mm dd; 3 = mm dd [yy]yy)
Using date format: 2 (1 = dd mm [yy]yy; 2 = [yy]yy mm dd; 3 = mm dd [yy]yy)
Using date format: 3 (1 = dd mm [yy]yy; 2 = [yy]yy mm dd; 3 = mm dd [yy]yy)
Date not found in OCR text - use file date:
day: 22
month:03
year: 2022
➜ renaming:
apply renaming syntax ➜
➜ edit metadata FAILED! - exiftool not found! Please install it over cphub.net if you need it
File name already exists! Add counter (3)
target file: (3).pdf
➜ Adapt file date (Source: Source file)
➜ move source file to: /volume1/%2525252525252525252525252525252525252525252525252525252525252525252525253CPATH%2525252525252525252525252525252525252525252525252525252525252525252525253E/_BACKUP/Scan_9832033.pdf
INFO: (notification dosn't work at DSM7 without i18n …)
INFO: (PushBullet-TOKEN not set)
Stats:
➜ runtime last file: 00:00:38
➜ pagecount last file: 1
➜ file count profile : (profile SWE_Nummer) - 79 PDF's / 204 Pages processed up to now
➜ file count total: 87 PDF's / 204 Pages processed up to now
➜ delete tmp-files …
-----------------------------------
| ==> synOCR ENDE <== |
-----------------------------------
Am Ende des Logs steht die Laufzeit für das letzte Dokument. Vielleicht kannst du auf die Parameter… an Spitzentagen erreichen wir das Maximum (Leistung) von 4.300 Bilder/PDF und überlegen das Nachfolger Modell zu kaufen.
r
und d
verzichten (also -s
statt -srd
).r
sucht die korrekte Seitenorientierung und versucht sie passend zu drehend
setzt schiefe Scans geradestart
aufgerufen werden.Wenn du das Forum hilfreich findest oder uns unterstützen möchtest, dann gib uns doch einfach einen Kaffee aus.
Als Dankeschön schalten wir deinen Account werbefrei.