Gotenberg URL = PAPERLESS_TIKA_GOTENBERG_ENDPOINT: http://gotenberg:3000Richte dich nach der Anleitung von Paperless. Das betrifft dieGotenberg URL
,Gotenberg commands
und dasTika Image
. Was du noch gekürzt hast, kann noch falsch sein. Gerade Gotenberg und Tika ist ja für Office notwendig.
gotenberg:
image: gotenberg/gotenberg:latest
restart: unless-stopped
environment:
CHROMIUM_DISABLE_ROUTES: 1
command:
- 'gotenberg'
- '--chromium-disable-javascript=true'
- '--chromium-allow-list=file:///tmp/.*'
- '--api-timeout=60s'
networks:
- internal
PAPERLESS_TIKA_ENABLED: 1
PAPERLESS_TIKA_GOTENBERG_ENDPOINT: http://gotenberg:3000
PAPERLESS_TIKA_ENDPOINT: http://tika:9998
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 * Ready to accept connections
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 * DB loaded from disk: 0.000 seconds
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 # Done loading RDB, keys loaded: 0, keys expired: 0.
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 * RDB memory usage when created 0.77 Mb
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 * RDB age 336 seconds
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.012 * Loading RDB produced by version 6.2.12
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.010 # WARNING Memory overcommit must be enabled! Without it, a background save or replication may fail under low memory condition. Being disabled, it can can also cause failures without low memory condition, see https://github.com/jemalloc/jemalloc/issues/1328. To fix this issue add 'vm.overcommit_memory = 1' to /etc/sysctl.conf and then reboot or run the command 'sysctl vm.overcommit_memory=1' for this to take effect.
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.010 # Server initialized
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.010 # WARNING: The TCP backlog setting of 511 cannot be enforced because /proc/sys/net/core/somaxconn is set to the lower value of 128.
2023/06/23 17:18:30 stdout
2023/06/23 17:18:30 stdout `-.__.-'
2023/06/23 17:18:30 stdout `-._ _.-'
2023/06/23 17:18:30 stdout `-._ `-.__.-' _.-'
2023/06/23 17:18:30 stdout `-._ `-._`-.__.-'_.-' _.-'
2023/06/23 17:18:30 stdout | `-._`-._ _.-'_.-' |
2023/06/23 17:18:30 stdout |`-._`-._ `-.__.-' _.-'_.-'|
2023/06/23 17:18:30 stdout `-._ `-._`-.__.-'_.-' _.-'
2023/06/23 17:18:30 stdout | `-._`-._ _.-'_.-' | https://redis.io
2023/06/23 17:18:30 stdout |`-._`-._ `-.__.-' _.-'_.-'|
2023/06/23 17:18:30 stdout `-._ `-._ `-./ _.-' _.-'
2023/06/23 17:18:30 stdout | `-._ `._ / _.-' | PID: 1
2023/06/23 17:18:30 stdout |`-._`-...-` __...-.``-._|'` _.-'| Port: 6379
2023/06/23 17:18:30 stdout ( ' , .-` | `, ) Running in standalone mode
2023/06/23 17:18:30 stdout .-`` .-```. ```\/ _.,_ ''-._
2023/06/23 17:18:30 stdout _.-`` `. `_. ''-._ Redis 6.2.12 (00000000/0) 64 bit
2023/06/23 17:18:30 stdout _.-``__ ''-._
2023/06/23 17:18:30 stdout _._
2023/06/23 17:18:30 stdout 1:M 23 Jun 2023 15:18:30.009 * monotonic clock: POSIX clock_gettime
2023/06/23 17:18:30 stdout 1:C 23 Jun 2023 15:18:30.008 # Warning: no config file specified, using the default config. In order to specify a config file use redis-server /path/to/redis.conf
2023/06/23 17:18:30 stdout 1:C 23 Jun 2023 15:18:30.008 # Redis version=6.2.12, bits=64, commit=00000000, modified=0, pid=1, just started
2023/06/23 17:18:30 stdout 1:C 23 Jun 2023 15:18:30.008 # oO0OoO0OoO0Oo Redis is starting oO0OoO0OoO0Oo
2023/06/23 17:19:16 stdout Failed to connect to redis using environment variable PAPERLESS_REDIS.
2023/06/23 17:19:11 stdout Waiting 5s
2023/06/23 17:19:11 stdout Error: Error -5 connecting to broker:6379. No address associated with hostname..
2023/06/23 17:19:11 stdout Redis ping #4 failed.
2023/06/23 17:19:06 stdout Waiting 5s
2023/06/23 17:19:06 stdout Error: Error -5 connecting to broker:6379. No address associated with hostname..
2023/06/23 17:19:06 stdout Redis ping #3 failed.
2023/06/23 17:19:01 stdout Waiting 5s
2023/06/23 17:19:01 stdout Error: Error -5 connecting to broker:6379. No address associated with hostname..
2023/06/23 17:19:01 stdout Redis ping #2 failed.
2023/06/23 17:18:56 stdout Waiting 5s
2023/06/23 17:18:56 stdout Error: Error -5 connecting to broker:6379. No address associated with hostname..
2023/06/23 17:18:56 stdout Redis ping #1 failed.
2023/06/23 17:18:51 stdout Waiting 5s
2023/06/23 17:18:51 stdout Error: Error -5 connecting to broker:6379. No address associated with hostname..
2023/06/23 17:18:51 stdout Redis ping #0 failed.
2023/06/23 17:18:51 stdout Waiting for Redis...
2023/06/23 17:18:42 stdout Waiting for PostgreSQL to start...
2023/06/23 17:18:41 stdout Adjusting permissions of paperless files. This may take a while.
2023/06/23 17:18:41 stdout Creating directory /tmp/paperless
2023/06/23 17:18:41 stdout Package tesseract-ocr-deu already installed!
2023/06/23 17:18:41 stdout Package tesseract-ocr-eng already installed!
2023/06/23 17:18:41 stdout 0% [Working] 0% [Working] 0% [Working] 0% [Working] 20% [Working] Reading package lists... 0% Reading package lists... 0% Reading package lists... 0% Reading package lists... 85% Reading package lists... 98% Reading package lists... 98% Reading package lists... 99% Reading package lists... 99% Reading package lists... Done
2023/06/23 17:18:39 stdout Hit:3 http://deb.debian.org/debian-security bookworm-security InRelease
2023/06/23 17:18:39 stdout 0% [Working] Hit:2 http://deb.debian.org/debian bookworm-updates InRelease
2023/06/23 17:18:39 stdout 0% [Working] Hit:1 http://deb.debian.org/debian bookworm InRelease
2023/06/23 17:18:36 stdout Installing languages...
2023/06/23 17:18:35 stdout Paperless-ngx docker container starting...
{
"CapAdd" : null,
"CapDrop" : null,
"cmd" : "/usr/local/bin/paperless_cmd.sh",
"cpu_priority" : 50,
"enable_publish_all_ports" : false,
"enable_restart_policy" : false,
"enable_service_portal" : null,
"enabled" : true,
"env_variables" : [
{
"key" : "PATH",
"value" : "/usr/local/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin"
},
{
"key" : "LANG",
"value" : "C.UTF-8"
},
{
"key" : "GPG_KEY",
"value" : "XXX"
},
{
"key" : "PYTHON_VERSION",
"value" : "3.9.16"
},
{
"key" : "PYTHON_PIP_VERSION",
"value" : "22.0.4"
},
{
"key" : "PYTHON_SETUPTOOLS_VERSION",
"value" : "58.1.0"
},
{
"key" : "PYTHON_GET_PIP_URL",
"value" : "https://github.com/pypa/get-pip/raw/1a96dc5acd0303c4700e02655aefd3bc68c78958/public/get-pip.py"
},
{
"key" : "PYTHON_GET_PIP_SHA256",
"value" : "XXX"
},
{
"key" : "PAPERLESS_DBHOST",
"value" : "db"
},
{
"key" : "PAPERLESS_SECRET_KEY",
"value" : "XXX"
},
{
"key" : "PAPERLESS_REDIS",
"value" : "redis://broker:6379"
},
{
"key" : "USERMAP_GID",
"value" : "101"
},
{
"key" : "PAPERLESS_OCR_LANGUAGE",
"value" : "deu"
},
{
"key" : "PAPERLESS_URL",
"value" : "https://paperless.XXX.synology.me"
},
{
"key" : "PAPERLESS_OCR_LANGUAGES",
"value" : "eng deu"
},
{
"key" : "USERMAP_UID",
"value" : "1026"
},
{
"key" : "PAPERLESS_ALLOWED_HOSTS",
"value" : "localhost,paperless.XXX.synology.me"
},
{
"key" : "PAPERLESS_TIME_ZONE",
"value" : "Europe/Berlin"
},
{
"key" : "PAPERLESS_FILENAME_FORMAT",
"value" : "{created_year}/{correspondent}/{title}"
},
{
"key" : "PAPERLESS_DBPASS",
"value" : "XXX"
}
],
"exporting" : false,
"id" : "25983fd1f88dda7dcd8a892b00910018d36e0f8b88acdd22568d754d3c32ccda",
"image" : "paperlessngx/paperless-ngx:latest",
"is_ddsm" : false,
"is_package" : false,
"labels" : {
"com.docker.compose.config-hash" : "7e92544a6783cdbae5087c9e54eda5dda537bd30ae3c39059a70bf025ee5177a",
"com.docker.compose.container-number" : "1",
"com.docker.compose.depends_on" : "db:service_started,broker:service_started",
"com.docker.compose.image" : "sha256:1e0fca605df2034d1c2780bb227445531367d0e8324f548521f62ed3625222ac",
"com.docker.compose.oneoff" : "False",
"com.docker.compose.project" : "paperless",
"com.docker.compose.project.config_files" : "/data/compose/3/docker-compose.yml",
"com.docker.compose.project.working_dir" : "/data/compose/3",
"com.docker.compose.service" : "webserver",
"com.docker.compose.version" : "2.10.2",
"org.opencontainers.image.authors" : "paperless-ngx team <hello@paperless-ngx.com>",
"org.opencontainers.image.created" : "2023-02-17T04:19:44.474Z",
"org.opencontainers.image.description" : "A community-supported supercharged version of paperless: scan, index and archive all your physical documents",
"org.opencontainers.image.documentation" : "https://docs.paperless-ngx.com/",
"org.opencontainers.image.licenses" : "GPL-3.0",
"org.opencontainers.image.revision" : "6c658a676e9b5697a408355f964e526b17692ddb",
"org.opencontainers.image.source" : "https://github.com/paperless-ngx/paperless-ngx",
"org.opencontainers.image.title" : "paperless-ngx",
"org.opencontainers.image.url" : "https://github.com/paperless-ngx/paperless-ngx",
"org.opencontainers.image.version" : "1.13.0"
},
"links" : [],
"memory_limit" : 0,
"name" : "paperless",
"network" : [
{
"driver" : "bridge",
"name" : "paperless_internal"
}
],
"network_mode" : "paperless_internal",
"port_bindings" : [
{
"container_port" : 8000,
"host_port" : 8010,
"type" : "tcp"
}
],
"privileged" : false,
"services" : [],
"shortcut" : {
"enable_shortcut" : false,
"enable_status_page" : false,
"enable_web_page" : false,
"web_page_url" : ""
},
"use_host_network" : false,
"version" : 2,
"volume_bindings" : [
{
"host_volume_file" : "/homes/XXX/Paperless-Inbox",
"is_directory" : true,
"mount_point" : "/usr/src/paperless/consume",
"type" : "rw"
},
{
"host_volume_file" : "/docker/paperless/data",
"is_directory" : true,
"mount_point" : "/usr/src/paperless/data",
"type" : "rw"
},
{
"host_volume_file" : "/docker/paperless/media",
"is_directory" : true,
"mount_point" : "/usr/src/paperless/media",
"type" : "rw"
},
{
"host_volume_file" : "/docker/paperless/export",
"is_directory" : true,
"mount_point" : "/usr/src/paperless/export",
"type" : "rw"
}
]
}
{
"CapAdd" : null,
"CapDrop" : null,
"cmd" : "redis-server",
"cpu_priority" : 50,
"enable_publish_all_ports" : false,
"enable_restart_policy" : false,
"enable_service_portal" : null,
"enabled" : true,
"env_variables" : [
{
"key" : "PATH",
"value" : "/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin"
},
{
"key" : "GOSU_VERSION",
"value" : "1.16"
},
{
"key" : "REDIS_VERSION",
"value" : "6.2.11"
},
{
"key" : "REDIS_DOWNLOAD_URL",
"value" : "http://download.redis.io/releases/redis-6.2.11.tar.gz"
},
{
"key" : "REDIS_DOWNLOAD_SHA",
"value" : "8c75fb9cdd01849e92c23f30cb7fe205ea0032a38d11d46af191014e9acc3098"
}
],
"exporting" : false,
"id" : "573f83f91824dcd385aba3d5508121b22a090a0fcaef8972c38e980e45ab5f48",
"image" : "redis:6.2",
"is_ddsm" : false,
"is_package" : false,
"labels" : {
"com.docker.compose.config-hash" : "c421fa498044514bb27c4b26d25fa278d1df83e5da7e47dd68fa9b35e1bb0214",
"com.docker.compose.container-number" : "1",
"com.docker.compose.depends_on" : "",
"com.docker.compose.image" : "sha256:c250e371c3997705ad7f1725e30ccd26bb93b76e4a49be801bc28280c90ef310",
"com.docker.compose.oneoff" : "False",
"com.docker.compose.project" : "paperless",
"com.docker.compose.project.config_files" : "/data/compose/3/docker-compose.yml",
"com.docker.compose.project.working_dir" : "/data/compose/3",
"com.docker.compose.service" : "broker",
"com.docker.compose.version" : "2.10.2"
},
"links" : [],
"memory_limit" : 0,
"name" : "paperless-redis",
"network" : [
{
"driver" : "bridge",
"name" : "paperless_internal"
}
],
"network_mode" : "paperless_internal",
"port_bindings" : [],
"privileged" : false,
"shortcut" : {
"enable_shortcut" : false,
"enable_status_page" : false,
"enable_web_page" : false,
"web_page_url" : ""
},
"use_host_network" : false,
"version" : 2,
"volume_bindings" : [
{
"host_volume_file" : "/docker/paperless/redis",
"is_directory" : true,
"mount_point" : "/redis",
"type" : "rw"
}
]
}
Error while converting email to PDF: Client error '404 Not Found' for url 'http://gotenberg:3000/forms/chromium/convert/html'
For more information check: https://httpstatuses.com/404
PAPERLESS_TIKA_GOTENBERG_ENDPOINT: http://gotenberg:3000
PAPERLESS_TIKA_ENDPOINT: http://tika:9998
PAPERLESS_ENABLE_UPDATE_CHECK: 1
PAPERLESS_CONSUMER_POLLING_RETRY_COUNT: 2
PAPERLESS_WEBSERVER_WORKERS: 2
gotenberg:
image: gotenberg/gotenberg:7.7.2
restart: always
container_name: Paperless-NGX-GOTENBERG
ports:
- 3000:3000
command:
- "gotenberg"
- "--chromium-disable-routes=true"
tika:
image: ghcr.io/paperless-ngx/tika
container_name: Paperless-NGX-TIKA
ports:
- 9998:9998
restart: always
Hast du Parameter innerhalb der Paperless-ngx-Oberfläche gesetzt, beispielsweise, dass die Mails nicht älter sein dürfen als x Tage? Einstellungen → E-Mail-Regel bearbeiten → „Maximales Alter (Tage)“aber keine Nachrichten gefunden
[2024-01-07 16:23:43,963] [WARNING] [paperless.barcodes] Unsupported file format for barcode reader: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
[2024-01-07 16:23:44,004] [INFO] [paperless.consumer] Consuming 2023-12-07 zu archivierende Dokumente per anno.xlsm
[2024-01-07 16:23:44,009] [DEBUG] [paperless.consumer] Detected mime type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
[2024-01-07 16:23:44,425] [DEBUG] [paperless.consumer] Parser: TikaDocumentParser
[2024-01-07 16:23:44,433] [DEBUG] [paperless.consumer] Parsing 2023-12-07 zu archivierende Dokumente per anno.xlsm...
[2024-01-07 16:23:44,434] [INFO] [paperless.parsing.tika] Sending /tmp/paperless/paperless-ngxqsutgg51/2023-12-07 zu archivierende Dokumente per anno.xlsm to Tika server
[2024-01-07 16:23:46,112] [INFO] [paperless.parsing.tika] Converting /tmp/paperless/paperless-ngxqsutgg51/2023-12-07 zu archivierende Dokumente per anno.xlsm to PDF as /tmp/paperless/paperless-ina7whdf/convert.pdf
[2024-01-07 16:23:46,226] [ERROR] [paperless.consumer] Error occurred while consuming document 2023-12-07 zu archivierende Dokumente per anno.xlsm: Error while converting document to PDF: Client error '400 Bad Request' for url 'http://gotenberg:3000/forms/libreoffice/convert'
For more information check: https://developer.mozilla.org/en-US/docs/Web/HTTP/Status/400
Traceback (most recent call last):
File "/usr/src/paperless/src/paperless_tika/parsers.py", line 109, in convert_to_pdf
response = route.run()
^^^^^^^^^^^
File "/usr/local/lib/python3.11/site-packages/gotenberg_client/_base.py", line 75, in run
resp.raise_for_status()
File "/usr/local/lib/python3.11/site-packages/httpx/_models.py", line 758, in raise_for_status
raise HTTPStatusError(message, request=request, response=self)
httpx.HTTPStatusError: Client error '400 Bad Request' for url 'http://gotenberg:3000/forms/libreoffice/convert'
For more information check: https://developer.mozilla.org/en-US/docs/Web/HTTP/Status/400
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/lib/python3.11/site-packages/asgiref/sync.py", line 349, in main_wrap
raise exc_info[1]
File "/usr/src/paperless/src/documents/consumer.py", line 446, in try_consume_file
document_parser.parse(self.working_copy, mime_type, self.filename)
File "/usr/src/paperless/src/paperless_tika/parsers.py", line 84, in parse
self.archive_path = self.convert_to_pdf(document_path, file_name)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/src/paperless/src/paperless_tika/parsers.py", line 116, in convert_to_pdf
raise ParseError(
documents.parsers.ParseError: Error while converting document to PDF: Client error '400 Bad Request' for url 'http://gotenberg:3000/forms/libreoffice/convert'
For more information check: https://developer.mozilla.org/en-US/docs/Web/HTTP/Status/400
version: "3.4"
services:
broker:
image: docker.io/library/redis:7
restart: unless-stopped
volumes:
- redisdata:/data
db:
image: docker.io/library/postgres:13
restart: unless-stopped
volumes:
- ../pgdata:/var/lib/postgresql/data
environment:
POSTGRES_DB: paperless
POSTGRES_USER: paperless
POSTGRES_PASSWORD: paperless
webserver:
image: ghcr.io/paperless-ngx/paperless-ngx:latest
restart: unless-stopped
depends_on:
- db
- broker
- gotenberg
- tika
ports:
- 8090:8000
healthcheck:
test: ["CMD", "curl", "-fs", "-S", "--max-time", "2", "http://localhost:8000"]
interval: 30s
timeout: 10s
retries: 5
volumes:
- ../data:/usr/src/paperless/data
- ../media:/usr/src/paperless/media
- ../export:/usr/src/paperless/export
- /volume1/scaninput:/usr/src/paperless/consume
env_file: docker-compose.env
environment:
PAPERLESS_REDIS: redis://broker:6379
PAPERLESS_DBHOST: db
PAPERLESS_TIKA_ENABLED: 1
PAPERLESS_TIKA_GOTENBERG_ENDPOINT: http://gotenberg:3000
PAPERLESS_TIKA_ENDPOINT: http://tika:9998
PAPERLESS_CONSUMER_ENABLE_BARCODES: true
gotenberg:
image: docker.io/gotenberg/gotenberg:latest
restart: unless-stopped
# The gotenberg chromium route is used to convert .eml files. We do not
# want to allow external content like tracking pixels or even javascript.
command:
- "gotenberg"
- "--chromium-disable-javascript=true"
- "--chromium-allow-list=file:///tmp/.*"
tika:
image: ghcr.io/paperless-ngx/tika:latest
restart: unless-stopped
volumes:
redisdata:
groups: cannot find name for group ID 3****
I have no name!@****
INFO [main] 20:47:44,074 org.eclipse.jetty.util.log Logging initialized @3373ms to org.eclipse.jetty.util.log.Slf4jLog
INFO [main] 20:47:44,161 org.eclipse.jetty.server.Server jetty-9.4.51.v20230217; built: 2023-02-17T08:19:37.309Z; git: b45c405e4544384de066f814ed42ae3dceacdd49; jvm 17.0.9+9-Ubuntu-122.04
INFO [main] 20:47:44,225 org.eclipse.jetty.server.AbstractConnector Started ServerConnector@2bc12da{HTTP/1.1, (http/1.1)}{0.0.0.0:9998}
INFO [main] 20:47:44,225 org.eclipse.jetty.server.Server Started @3526ms
WARN [main] 20:47:44,236 org.eclipse.jetty.server.handler.ContextHandler Empty contextPath
INFO [main] 20:47:44,256 org.eclipse.jetty.server.handler.ContextHandler Started o.e.j.s.h.ContextHandler@5f84abe8{/,null,AVAILABLE}
INFO [main] 20:47:44,263 org.apache.tika.server.core.TikaServerProcess Started Apache Tika server f3c1de14-bc5a-4ef0-8440-32584e631cb3 at http://0.0.0.0:9998/
INFO [qtp310212872-22] 20:52:36,519 org.apache.tika.server.core.resource.TikaResource /tika (application/vnd.openxmlformats-officedocument.spreadsheetml.sheet)
...da werde ich meine stundenlangen Versuche es hinzubekommen erst einmal einstellen.Bei mir funktioniert auch aktuell der Import von Office-Dokumenten nicht. Das muss in Zusammenhang mit dem Linux-Kernel auftreten, denn auf einem Linux-Server läuft es einwandfrei
Wenn du das Forum hilfreich findest oder uns unterstützen möchtest, dann gib uns doch einfach einen Kaffee aus.
Als Dankeschön schalten wir deinen Account werbefrei.