Gotenberg URL = PAPERLESS_TIKA_GOTENBERG_ENDPOINT: http://gotenberg:3000Richte dich nach der Anleitung von Paperless. Das betrifft dieGotenberg URL
,Gotenberg commands
und dasTika Image
. Was du noch gekürzt hast, kann noch falsch sein. Gerade Gotenberg und Tika ist ja für Office notwendig.
gotenberg:
image: gotenberg/gotenberg:latest
restart: unless-stopped
environment:
CHROMIUM_DISABLE_ROUTES: 1
command:
- 'gotenberg'
- '--chromium-disable-javascript=true'
- '--chromium-allow-list=file:///tmp/.*'
- '--api-timeout=60s'
networks:
- internal
PAPERLESS_TIKA_ENABLED: 1
PAPERLESS_TIKA_GOTENBERG_ENDPOINT: http://gotenberg:3000
PAPERLESS_TIKA_ENDPOINT: http://tika:9998
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 * Ready to accept connections
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 * DB loaded from disk: 0.000 seconds
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 # Done loading RDB, keys loaded: 0, keys expired: 0.
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 * RDB memory usage when created 0.77 Mb
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 * RDB age 336 seconds
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 * Loading RDB produced by version 6.2.12
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.010 # WARNING Memory overcommit must be enabled! Without it, a background save or replication may fail under low memory condition. Being disabled, it can can also cause failures without low memory condition, see https://github.com/jemalloc/jemalloc/issues/1328. To fix this issue add 'vm.overcommit_memory = 1' to /etc/sysctl.conf and then reboot or run the command 'sysctl vm.overcommit_memory=1' for this to take effect.
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.010 # Server initialized
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.010 # WARNING: The TCP backlog setting of 511 cannot be enforced because /proc/sys/net/core/somaxconn is set to the lower value of 128.
2023/06/23 17:18:30 stdout
2023/06/23 17:18:30 stdout `-.__.-'
2023/06/23 17:18:30 stdout `-._ _.-'
2023/06/23 17:18:30 stdout `-._ `-.__.-' _.-'
2023/06/23 17:18:30 stdout `-._ `-._`-.__.-'_.-' _.-'
2023/06/23 17:18:30 stdout | `-._`-._ _.-'_.-' |
2023/06/23 17:18:30 stdout |`-._`-._ `-.__.-' _.-'_.-'|
2023/06/23 17:18:30 stdout `-._ `-._`-.__.-'_.-' _.-'
2023/06/23 17:18:30 stdout | `-._`-._ _.-'_.-' | https://redis.io
2023/06/23 17:18:30 stdout |`-._`-._ `-.__.-' _.-'_.-'|
2023/06/23 17:18:30 stdout `-._ `-._ `-./ _.-' _.-'
2023/06/23 17:18:30 stdout | `-._ `._ / _.-' | PID: 1
2023/06/23 17:18:30 stdout |`-._`-...-` __...-.``-._|'` _.-'| Port: 6379
2023/06/23 17:18:30 stdout ( ' , .-` | `, ) Running in standalone mode
2023/06/23 17:18:30 stdout .-`` .-```. ```\/ _.,_ ''-._
2023/06/23 17:18:30 stdout _.-`` `. `_. ''-._ Redis 6.2.12 (00000000/0) 64 bit
2023/06/23 17:18:30 stdout _.-``__ ''-._
2023/06/23 17:18:30 stdout _._
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.009 * monotonic clock: POSIX clock_gettime
2023/06/23 17:18:30 stdout 1:C 23 Jun 2023 15:18:30.008 # Warning: no config file specified, using the default config. In order to specify a config file use redis-server /path/to/redis.conf
2023/06/23 17:18:30 stdout 1:C 23 Jun 2023 15:18:30.008 # Redis version=6.2.12, bits=64, commit=00000000, modified=0, pid=1, just started
2023/06/23 17:18:30 stdout 1:C 23 Jun 2023 15:18:30.008 # oO0OoO0OoO0Oo Redis is starting oO0OoO0OoO0Oo
2023/06/23 17:19:16 stdout Failed to connect to redis using environment variable PAPERLESS_REDIS.
2023/06/23 17:19:11 stdout Waiting 5s
2023/06/23 17:19:11 stdout Error: Error -5 connecting to broker:6379. No address associated with hostname..
2023/06/23 17:19:11 stdout Redis ping #4 failed.
2023/06/23 17:19:06 stdout Waiting 5s
2023/06/23 17:19:06 stdout Error: Error -5 connecting to broker:6379. No address associated with hostname..
2023/06/23 17:19:06 stdout Redis ping #3 failed.
2023/06/23 17:19:01 stdout Waiting 5s
2023/06/23 17:19:01 stdout Error: Error -5 connecting to broker:6379. No address associated with hostname..
2023/06/23 17:19:01 stdout Redis ping #2 failed.
2023/06/23 17:18:56 stdout Waiting 5s
2023/06/23 17:18:56 stdout Error: Error -5 connecting to broker:6379. No address associated with hostname..
2023/06/23 17:18:56 stdout Redis ping #1 failed.
2023/06/23 17:18:51 stdout Waiting 5s
2023/06/23 17:18:51 stdout Error: Error -5 connecting to broker:6379. No address associated with hostname..
2023/06/23 17:18:51 stdout Redis ping #0 failed.
2023/06/23 17:18:51 stdout Waiting for Redis...
2023/06/23 17:18:42 stdout Waiting for PostgreSQL to start...
2023/06/23 17:18:41 stdout Adjusting permissions of paperless files. This may take a while.
2023/06/23 17:18:41 stdout Creating directory /tmp/paperless
2023/06/23 17:18:41 stdout Package tesseract-ocr-deu already installed!
2023/06/23 17:18:41 stdout Package tesseract-ocr-eng already installed!
2023/06/23 17:18:41 stdout 0% [Working] 0% [Working] 0% [Working] 0% [Working] 20% [Working] Reading package lists... 0% Reading package lists... 0% Reading package lists... 0% Reading package lists... 85% Reading package lists... 98% Reading package lists... 98% Reading package lists... 99% Reading package lists... 99% Reading package lists... Done
2023/06/23 17:18:39 stdout Hit:3 http://deb.debian.org/debian-security bookworm-security InRelease
2023/06/23 17:18:39 stdout 0% [Working] Hit:2 http://deb.debian.org/debian bookworm-updates InRelease
2023/06/23 17:18:39 stdout 0% [Working] Hit:1 http://deb.debian.org/debian bookworm InRelease
2023/06/23 17:18:36 stdout Installing languages...
2023/06/23 17:18:35 stdout Paperless-ngx docker container starting...
{
"CapAdd" : null,
"CapDrop" : null,
"cmd" : "/usr/local/bin/paperless_cmd.sh",
"cpu_priority" : 50,
"enable_publish_all_ports" : false,
"enable_restart_policy" : false,
"enable_service_portal" : null,
"enabled" : true,
"env_variables" : [
{
"key" : "PATH",
"value" : "/usr/local/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin"
},
{
"key" : "LANG",
"value" : "C.UTF-8"
},
{
"key" : "GPG_KEY",
"value" : "XXX"
},
{
"key" : "PYTHON_VERSION",
"value" : "3.9.16"
},
{
"key" : "PYTHON_PIP_VERSION",
"value" : "22.0.4"
},
{
"key" : "PYTHON_SETUPTOOLS_VERSION",
"value" : "58.1.0"
},
{
"key" : "PYTHON_GET_PIP_URL",
"value" : "https://github.com/pypa/get-pip/raw/1a96dc5acd0303c4700e02655aefd3bc68c78958/public/get-pip.py"
},
{
"key" : "PYTHON_GET_PIP_SHA256",
"value" : "XXX"
},
{
"key" : "PAPERLESS_DBHOST",
"value" : "db"
},
{
"key" : "PAPERLESS_SECRET_KEY",
"value" : "XXX"
},
{
"key" : "PAPERLESS_REDIS",
"value" : "redis://broker:6379"
},
{
"key" : "USERMAP_GID",
"value" : "101"
},
{
"key" : "PAPERLESS_OCR_LANGUAGE",
"value" : "deu"
},
{
"key" : "PAPERLESS_URL",
"value" : "https://paperless.XXX.synology.me"
},
{
"key" : "PAPERLESS_OCR_LANGUAGES",
"value" : "eng deu"
},
{
"key" : "USERMAP_UID",
"value" : "1026"
},
{
"key" : "PAPERLESS_ALLOWED_HOSTS",
"value" : "localhost,paperless.XXX.synology.me"
},
{
"key" : "PAPERLESS_TIME_ZONE",
"value" : "Europe/Berlin"
},
{
"key" : "PAPERLESS_FILENAME_FORMAT",
"value" : "{created_year}/{correspondent}/{title}"
},
{
"key" : "PAPERLESS_DBPASS",
"value" : "XXX"
}
],
"exporting" : false,
"id" : "25983fd1f88dda7dcd8a892b00910018d36e0f8b88acdd22568d754d3c32ccda",
"image" : "paperlessngx/paperless-ngx:latest",
"is_ddsm" : false,
"is_package" : false,
"labels" : {
"com.docker.compose.config-hash" : "7e92544a6783cdbae5087c9e54eda5dda537bd30ae3c39059a70bf025ee5177a",
"com.docker.compose.container-number" : "1",
"com.docker.compose.depends_on" : "db:service_started,broker:service_started",
"com.docker.compose.image" : "sha256:1e0fca605df2034d1c2780bb227445531367d0e8324f548521f62ed3625222ac",
"com.docker.compose.oneoff" : "False",
"com.docker.compose.project" : "paperless",
"com.docker.compose.project.config_files" : "/data/compose/3/docker-compose.yml",
"com.docker.compose.project.working_dir" : "/data/compose/3",
"com.docker.compose.service" : "webserver",
"com.docker.compose.version" : "2.10.2",
"org.opencontainers.image.authors" : "paperless-ngx team <hello@paperless-ngx.com>",
"org.opencontainers.image.created" : "2023-02-17T04:19:44.474Z",
"org.opencontainers.image.description" : "A community-supported supercharged version of paperless: scan, index and archive all your physical documents",
"org.opencontainers.image.documentation" : "https://docs.paperless-ngx.com/",
"org.opencontainers.image.licenses" : "GPL-3.0",
"org.opencontainers.image.revision" : "6c658a676e9b5697a408355f964e526b17692ddb",
"org.opencontainers.image.source" : "https://github.com/paperless-ngx/paperless-ngx",
"org.opencontainers.image.title" : "paperless-ngx",
"org.opencontainers.image.url" : "https://github.com/paperless-ngx/paperless-ngx",
"org.opencontainers.image.version" : "1.13.0"
},
"links" : [],
"memory_limit" : 0,
"name" : "paperless",
"network" : [
{
"driver" : "bridge",
"name" : "paperless_internal"
}
],
"network_mode" : "paperless_internal",
"port_bindings" : [
{
"container_port" : 8000,
"host_port" : 8010,
"type" : "tcp"
}
],
"privileged" : false,
"services" : [],
"shortcut" : {
"enable_shortcut" : false,
"enable_status_page" : false,
"enable_web_page" : false,
"web_page_url" : ""
},
"use_host_network" : false,
"version" : 2,
"volume_bindings" : [
{
"host_volume_file" : "/homes/XXX/Paperless-Inbox",
"is_directory" : true,
"mount_point" : "/usr/src/paperless/consume",
"type" : "rw"
},
{
"host_volume_file" : "/docker/paperless/data",
"is_directory" : true,
"mount_point" : "/usr/src/paperless/data",
"type" : "rw"
},
{
"host_volume_file" : "/docker/paperless/media",
"is_directory" : true,
"mount_point" : "/usr/src/paperless/media",
"type" : "rw"
},
{
"host_volume_file" : "/docker/paperless/export",
"is_directory" : true,
"mount_point" : "/usr/src/paperless/export",
"type" : "rw"
}
]
}
{
"CapAdd" : null,
"CapDrop" : null,
"cmd" : "redis-server",
"cpu_priority" : 50,
"enable_publish_all_ports" : false,
"enable_restart_policy" : false,
"enable_service_portal" : null,
"enabled" : true,
"env_variables" : [
{
"key" : "PATH",
"value" : "/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin"
},
{
"key" : "GOSU_VERSION",
"value" : "1.16"
},
{
"key" : "REDIS_VERSION",
"value" : "6.2.11"
},
{
"key" : "REDIS_DOWNLOAD_URL",
"value" : "http://download.redis.io/releases/redis-6.2.11.tar.gz"
},
{
"key" : "REDIS_DOWNLOAD_SHA",
"value" : "8c75fb9cdd01849e92c23f30cb7fe205ea0032a38d11d46af191014e9acc3098"
}
],
"exporting" : false,
"id" : "573f83f91824dcd385aba3d5508121b22a090a0fcaef8972c38e980e45ab5f48",
"image" : "redis:6.2",
"is_ddsm" : false,
"is_package" : false,
"labels" : {
"com.docker.compose.config-hash" : "c421fa498044514bb27c4b26d25fa278d1df83e5da7e47dd68fa9b35e1bb0214",
"com.docker.compose.container-number" : "1",
"com.docker.compose.depends_on" : "",
"com.docker.compose.image" : "sha256:c250e371c3997705ad7f1725e30ccd26bb93b76e4a49be801bc28280c90ef310",
"com.docker.compose.oneoff" : "False",
"com.docker.compose.project" : "paperless",
"com.docker.compose.project.config_files" : "/data/compose/3/docker-compose.yml",
"com.docker.compose.project.working_dir" : "/data/compose/3",
"com.docker.compose.service" : "broker",
"com.docker.compose.version" : "2.10.2"
},
"links" : [],
"memory_limit" : 0,
"name" : "paperless-redis",
"network" : [
{
"driver" : "bridge",
"name" : "paperless_internal"
}
],
"network_mode" : "paperless_internal",
"port_bindings" : [],
"privileged" : false,
"shortcut" : {
"enable_shortcut" : false,
"enable_status_page" : false,
"enable_web_page" : false,
"web_page_url" : ""
},
"use_host_network" : false,
"version" : 2,
"volume_bindings" : [
{
"host_volume_file" : "/docker/paperless/redis",
"is_directory" : true,
"mount_point" : "/redis",
"type" : "rw"
}
]
}
Error while converting email to PDF: Client error '404 Not Found' for url 'http://gotenberg:3000/forms/chromium/convert/html'
For more information check: https://httpstatuses.com/404
PAPERLESS_TIKA_GOTENBERG_ENDPOINT: http://gotenberg:3000
PAPERLESS_TIKA_ENDPOINT: http://tika:9998
PAPERLESS_ENABLE_UPDATE_CHECK: 1
PAPERLESS_CONSUMER_POLLING_RETRY_COUNT: 2
PAPERLESS_WEBSERVER_WORKERS: 2
gotenberg:
image: gotenberg/gotenberg:7.7.2
restart: always
container_name: Paperless-NGX-GOTENBERG
ports:
- 3000:3000
command:
- "gotenberg"
- "--chromium-disable-routes=true"
tika:
image: ghcr.io/paperless-ngx/tika
container_name: Paperless-NGX-TIKA
ports:
- 9998:9998
restart: always
Hast du Parameter innerhalb der Paperless-ngx-Oberfläche gesetzt, beispielsweise, dass die Mails nicht älter sein dürfen als x Tage? Einstellungen → E-Mail-Regel bearbeiten → „Maximales Alter (Tage)“aber keine Nachrichten gefunden
[2024-01-07 16:23:43,963] [WARNING] [paperless.barcodes] Unsupported file format for barcode reader: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
[2024-01-07 16:23:44,004] [INFO] [paperless.consumer] Consuming 2023-12-07 zu archivierende Dokumente per anno.xlsm
[2024-01-07 16:23:44,009] [DEBUG] [paperless.consumer] Detected mime type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
[2024-01-07 16:23:44,425] [DEBUG] [paperless.consumer] Parser: TikaDocumentParser
[2024-01-07 16:23:44,433] [DEBUG] [paperless.consumer] Parsing 2023-12-07 zu archivierende Dokumente per anno.xlsm...
[2024-01-07 16:23:44,434] [INFO] [paperless.parsing.tika] Sending /tmp/paperless/paperless-ngxqsutgg51/2023-12-07 zu archivierende Dokumente per anno.xlsm to Tika server
[2024-01-07 16:23:46,112] [INFO] [paperless.parsing.tika] Converting /tmp/paperless/paperless-ngxqsutgg51/2023-12-07 zu archivierende Dokumente per anno.xlsm to PDF as /tmp/paperless/paperless-ina7whdf/convert.pdf
[2024-01-07 16:23:46,226] [ERROR] [paperless.consumer] Error occurred while consuming document 2023-12-07 zu archivierende Dokumente per anno.xlsm: Error while converting document to PDF: Client error '400 Bad Request' for url 'http://gotenberg:3000/forms/libreoffice/convert'
For more information check: https://developer.mozilla.org/en-US/docs/Web/HTTP/Status/400
Traceback (most recent call last):
File "/usr/src/paperless/src/paperless_tika/parsers.py", line 109, in convert_to_pdf
response = route.run()
^^^^^^^^^^^
File "/usr/local/lib/python3.11/site-packages/gotenberg_client/_base.py", line 75, in run
resp.raise_for_status()
File "/usr/local/lib/python3.11/site-packages/httpx/_models.py", line 758, in raise_for_status
raise HTTPStatusError(message, request=request, response=self)
httpx.HTTPStatusError: Client error '400 Bad Request' for url 'http://gotenberg:3000/forms/libreoffice/convert'
For more information check: https://developer.mozilla.org/en-US/docs/Web/HTTP/Status/400
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/lib/python3.11/site-packages/asgiref/sync.py", line 349, in main_wrap
raise exc_info[1]
File "/usr/src/paperless/src/documents/consumer.py", line 446, in try_consume_file
document_parser.parse(self.working_copy, mime_type, self.filename)
File "/usr/src/paperless/src/paperless_tika/parsers.py", line 84, in parse
self.archive_path = self.convert_to_pdf(document_path, file_name)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/src/paperless/src/paperless_tika/parsers.py", line 116, in convert_to_pdf
raise ParseError(
documents.parsers.ParseError: Error while converting document to PDF: Client error '400 Bad Request' for url 'http://gotenberg:3000/forms/libreoffice/convert'
For more information check: https://developer.mozilla.org/en-US/docs/Web/HTTP/Status/400
version: "3.4"
services:
broker:
image: docker.io/library/redis:7
restart: unless-stopped
volumes:
- redisdata:/data
db:
image: docker.io/library/postgres:13
restart: unless-stopped
volumes:
- ../pgdata:/var/lib/postgresql/data
environment:
POSTGRES_DB: paperless
POSTGRES_USER: paperless
POSTGRES_PASSWORD: paperless
webserver:
image: ghcr.io/paperless-ngx/paperless-ngx:latest
restart: unless-stopped
depends_on:
- db
- broker
- gotenberg
- tika
ports:
- 8090:8000
healthcheck:
test: ["CMD", "curl", "-fs", "-S", "--max-time", "2", "http://localhost:8000"]
interval: 30s
timeout: 10s
retries: 5
volumes:
- ../data:/usr/src/paperless/data
- ../media:/usr/src/paperless/media
- ../export:/usr/src/paperless/export
- /volume1/scaninput:/usr/src/paperless/consume
env_file: docker-compose.env
environment:
PAPERLESS_REDIS: redis://broker:6379
PAPERLESS_DBHOST: db
PAPERLESS_TIKA_ENABLED: 1
PAPERLESS_TIKA_GOTENBERG_ENDPOINT: http://gotenberg:3000
PAPERLESS_TIKA_ENDPOINT: http://tika:9998
PAPERLESS_CONSUMER_ENABLE_BARCODES: true
gotenberg:
image: docker.io/gotenberg/gotenberg:latest
restart: unless-stopped
# The gotenberg chromium route is used to convert .eml files. We do not
# want to allow external content like tracking pixels or even javascript.
command:
- "gotenberg"
- "--chromium-disable-javascript=true"
- "--chromium-allow-list=file:///tmp/.*"
tika:
image: ghcr.io/paperless-ngx/tika:latest
restart: unless-stopped
volumes:
redisdata:
groups: cannot find name for group ID 3****
I have no name!@****
INFO [main] 20:47:44,074 org.eclipse.jetty.util.log Logging initialized @3373ms to org.eclipse.jetty.util.log.Slf4jLog
INFO [main] 20:47:44,161 org.eclipse.jetty.server.Server jetty-9.4.51.v20230217; built: 2023-02-17T08:19:37.309Z; git: b45c405e4544384de066f814ed42ae3dceacdd49; jvm 17.0.9+9-Ubuntu-122.04
INFO [main] 20:47:44,225 org.eclipse.jetty.server.AbstractConnector Started ServerConnector@2bc12da{HTTP/1.1, (http/1.1)}{0.0.0.0:9998}
INFO [main] 20:47:44,225 org.eclipse.jetty.server.Server Started @3526ms
WARN [main] 20:47:44,236 org.eclipse.jetty.server.handler.ContextHandler Empty contextPath
INFO [main] 20:47:44,256 org.eclipse.jetty.server.handler.ContextHandler Started o.e.j.s.h.ContextHandler@5f84abe8{/,null,AVAILABLE}
INFO [main] 20:47:44,263 org.apache.tika.server.core.TikaServerProcess Started Apache Tika server f3c1de14-bc5a-4ef0-8440-32584e631cb3 at http://0.0.0.0:9998/
INFO [qtp310212872-22] 20:52:36,519 org.apache.tika.server.core.resource.TikaResource /tika (application/vnd.openxmlformats-officedocument.spreadsheetml.sheet)
...da werde ich meine stundenlangen Versuche es hinzubekommen erst einmal einstellen.Bei mir funktioniert auch aktuell der Import von Office-Dokumenten nicht. Das muss in Zusammenhang mit dem Linux-Kernel auftreten, denn auf einem Linux-Server läuft es einwandfrei
Wenn du das Forum hilfreich findest oder uns unterstützen möchtest, dann gib uns doch einfach einen Kaffee aus.
Als Dankeschön schalten wir deinen Account werbefrei.
Du bist nicht hier, um Support für Adblocker zu erhalten. Dein Adblocker funktioniert bereits ;-)
Klar machen Adblocker einen guten Job, aber sie blockieren auch nützliche Funktionen.
Das Forum wird mit hohem technischen, zeitlichen und finanziellen Aufwand kostenfrei zur Verfügung gestellt. Wir zeigen keine offensive Werbung und bemühen uns um eine dezente Integration.
Bitte unterstütze dieses Forum, in dem du deinen Adblocker für diese Seite deaktivierst.
Du kannst uns auch über unseren Kaffeautomat einen Kaffe ausgeben oder ein PUR Abo abschließen und das Forum so werbefrei nutzen.
Vielen Dank für Deine Unterstützung!