[BUG] High search latency from ML encode-text and postgres #1042

New Issue

OVERLORD · 2026-02-05T00:10:44+03:00

OVERLORD commented

2026-02-05 00:10:44 +03:00

Originally created by @pl4nty on GitHub (Jul 2, 2023).

The bug

When doing non-metadata searches in the web client, I'm seeing high average latencies of 1.5s, and some >5s requests. I don't have enough usage to get P90 etc though. Is this expected? Metadata searches are much faster.

Here's some tracing from a particularly slow request for /search?q=test&clip=false. I've been able to replicate these latencies with the demo instance too.

Trace-91e851-2023-07-02 16 27 47.json.txt

The OS that Immich Server is running on

Official containers on Oracle Linux 7 Kubernetes node, 4 arm64 cores 24GB RAM

Version of Immich Server

v1.65.0

Version of Immich Mobile App

N/A

Platform with the issue

Server
Web
Mobile

Your docker-compose.yml content

Official Helm chart: https://github.com/immich-app/immich-charts/tree/main/charts/immich
My Helm values: https://github.com/pl4nty/lab-infra/blob/main/kubernetes/oke/immich/immich.yaml

Your .env content

N/A

Reproduction steps

1. Search for text in the web client eg `test`

Additional information

All Immich containers except proxy are injected with OpenTelemetry autoinstrumentation. Bitnami Redis, official Typesense, and cloudnative-pg Postgres are used.

Originally created by @pl4nty on GitHub (Jul 2, 2023). ### The bug When doing non-metadata searches in the web client, I'm seeing high average latencies of 1.5s, and some >5s requests. I don't have enough usage to get P90 etc though. Is this expected? Metadata searches are much faster. Here's some tracing from a particularly slow request for `/search?q=test&clip=false`. I've been able to replicate these latencies with the demo instance too. ![image](https://github.com/immich-app/immich/assets/21111317/addc852f-dc29-47fb-a36a-c62199749bfa) [Trace-91e851-2023-07-02 16 27 47.json.txt](https://github.com/immich-app/immich/files/11928835/Trace-91e851-2023-07-02.16.27.47.json.txt) ### The OS that Immich Server is running on Official containers on Oracle Linux 7 Kubernetes node, 4 arm64 cores 24GB RAM ### Version of Immich Server v1.65.0 ### Version of Immich Mobile App N/A ### Platform with the issue - [ ] Server - [X] Web - [ ] Mobile ### Your docker-compose.yml content ```YAML Official Helm chart: https://github.com/immich-app/immich-charts/tree/main/charts/immich My Helm values: https://github.com/pl4nty/lab-infra/blob/main/kubernetes/oke/immich/immich.yaml ``` ### Your .env content ```Shell N/A ``` ### Reproduction steps ```bash 1. Search for text in the web client eg `test` ``` ### Additional information All Immich containers except `proxy` are injected with OpenTelemetry autoinstrumentation. Bitnami Redis, official Typesense, and cloudnative-pg Postgres are used.

OVERLORD closed this issue

2026-02-05 00:10:47 +03:00

OVERLORD commented

2026-02-05 00:10:52 +03:00

@mertalev commented on GitHub (Jul 2, 2023):

The first request will typically be slower than following requests since models are unloaded after idling for 300s. Outside of this, the bulk of the latency comes down to inference speed, i.e. CPU performance.

It's also normal for metadata searches to be much faster since there's no live inference taking place: all of the tags have already been made and indexed beforehand.

@mertalev commented on GitHub (Jul 2, 2023): The first request will typically be slower than following requests since models are unloaded after idling for 300s. Outside of this, the bulk of the latency comes down to inference speed, i.e. CPU performance. It's also normal for metadata searches to be much faster since there's no live inference taking place: all of the tags have already been made and indexed beforehand.

OVERLORD commented

2026-02-05 00:10:55 +03:00

@pl4nty commented on GitHub (Jul 2, 2023):

Thanks. Would CPU architecture make a difference even though it's python? The underlying cores are ARM, from a 3Ghz Ampere A1. My metrics aren't precise enough to catch a CPU usage spike, but I'll do some further testing.

I noticed /encode-text is included in #2574, so I'll look forward to testing with a GPU when it releases :)

@pl4nty commented on GitHub (Jul 2, 2023): Thanks. Would CPU architecture make a difference even though it's python? The underlying cores are ARM, from a 3Ghz Ampere A1. My metrics aren't precise enough to catch a CPU usage spike, but I'll do some further testing. I noticed `/encode-text` is included in #2574, so I'll look forward to testing with a GPU when it releases :)

OVERLORD commented

2026-02-05 00:10:58 +03:00

@mertalev commented on GitHub (Jul 2, 2023):

ARM could have something to do with it, but it also seems this CPU isn't very powerful going by this.

CUDA support will be for all ML endpoints and models. Stay tuned :)

If you want to do more quantitative testing, you can use Locust by following these steps:

Clone the repo
Navigate to the machine-learning folder
Follow the instructions to install Poetry and dependencies
Run locust --host <HOST> --web-host localhost, setting the host to the ML instance
Open the web UI shown and begin swarming

@mertalev commented on GitHub (Jul 2, 2023): ARM could have something to do with it, but it also seems this CPU isn't very powerful going by [this](https://www.storagereview.com/review/oci-ampere-a1-compute-review). CUDA support will be for all ML endpoints and models. Stay tuned :) If you want to do more quantitative testing, you can use [Locust](https://locust.io) by following these steps: 1. Clone the repo 2. Navigate to the `machine-learning` folder 3. Follow the instructions to install Poetry and dependencies 4. Run `locust --host <HOST> --web-host localhost`, setting the host to the ML instance 5. Open the web UI shown and begin swarming

Sign in to join this conversation.

Branches Tags

main

fix/web-people-hidden-state

fix-filename-search-label

chore/yank-cloud-id

chore/oauth-labels

renovate/machine-learning

uhthomas/mobile-fix-app-bar-fade

feat/debug-schema

renovate/typescript-projects

fix/25803

feat/asset-file-apis

chore/translations

fix/web-switch-label-clickable

release/next

fix/timezones

fix/time-zone-upserts

midzelis/wip

push-zpwsovysllvn

push-nwxlpmyzkyrl

push-nvnkszuqwppm

renovate/github-actions

push-smstsuupsowp

refactor/adaptive_image

push-olwpzvrxnomt

push-lmxsupnmxspl

feat/web-chromecast-video-looping

feat/use-native-clients

renovate/flutter

fix/create-face-edited

fix/mobile-ios-mtls

docs/contributing

docs/mise-mobile

renovate/grafana-monorepo

feature/bottom-buttons-order

feat/immich-mobile-ui-showcase

refactor/consolidate-image-requests

renovate/connectivity_plus-7.x

renovate/major-vitest-monorepo

renovate/pypi-python-multipart-vulnerability

fix/mobile-people-query

sqlite_thumbs

feat/html-text

chore/no-macro-validation

refactor/purchase-store

uhthomas/mobile-fix-asset-jump

feat/pano-ocr

feat/shared-link-login

fix/database-backup-db-names

fix-keep-correct-ios-shared-album-asset

fix-memory-generation-and-display

feat/verify-permissions

refactor/album-service-small-tests

fix/ml-rocm-build

fix/flipped-dimensions-mobile

push-vpxwmwwxwnvw

fix-migration-width-height

refactor/more-queries

revert/prettier-translations

refactor/asset-service-queries

fix/locale-settings-desc

chore/add-debug-log

feat/edit-filters

shared-deep-link-handler

feat/mobile-editing

feat/thumbnail-native-clients

feat/platform-clients

feat/integrity-checks-izzy

fix/foreground-cloud-sync

feat/dynamic-layout

filter-by-person

feat/csp

refactor/sidebar

fix/disable-editing

fix/view-timeline-deeplink

image-zoom-on-slow-connection

fix-consider-dar-for-video-dimension

fix/merged-edited-assets

perf/optimize-album-sort

open-api-fix

feat/create-job-with-dto

use-toast-primary

feat/vitest-4

feat/ios-fastlane-match

match-signing

fix-update-time-update-timeline

chore/translation-keys

feat/modal-routes

feat/panorama-tiles

feature/mobile-view-asset-owner

feat/system-settings

feature/show-activity-count

better-info-in-asset-viewer

fix/all-people-count

feat/location-favorites

feature/rearrange-buttons-2

fix/download-storage-template

feat/kb-shortcuts-mobile

fix/people-count

push-qolzzzzxrvvn

chore/originals-in-asset-files

feat/asset-size-columns

ben/tree-a11y

new-search-filter-ui

refactor/expectSelectedReadonly

refactor/mobile-grdb

push-qvuktpxmkknu

feat/mobile-native-local-sync

refactor/timeline_ops

fix/scrubber_end

feat/version.txt

feat/context-menus

feat/server-chunked-uploads

refactor/virtualsegment

refactor/rename_daymonth_groups

fix/restrict-android-bg-worker

feat/android-periodic-worker

fix-remote-sync-clean-up

refactor/timeline_move_ops

renovate/mapbox-mapbox-gl-rtl-text-0.x

fix/timeline_split_selectable

feat/keyboard_actions_help_modal

feat/static_frontend

feat/notification-warnign-android

feat/plugins2

feat/plugins

test/create-workflow-token-action

fix/docs-force

debug/search-result-similarity

debug/cf-chunked-uploads

feat/eslint_rule

feat/search-filter-album/web

refactor/timeline_photostream

refactor/timelineasset_asset

feat/session-permissions

feat/timeline_photostream_assetnav

feat/timeline_minor_optimize

feat/timeline_perf_nocomp

feat/timeline_search_results_actions

feat/timeline_search_results_page

fix/timeline_padding

fix/timeline_search_reactivity_warnings

feat/timeline_scrollbar

feat/timeline_stream_withviewer

fix/timeline_back_forth_nav

refactor/timeline_photostream_component

fix/generated-files-checks

fix/locate-button-local

chore/base-image-mimalloc

refactor/timeline_assetlayout

refactor/timeline_selectable

refactor/timeline_aware_actions

refactor/timeline_monthsegment

feat/remove-old-pages

chore/deps-gradle

tmp_photostream

tmp/lcms

feat/mobile-dynamic-thumbnails

fix/mobile-finer-thumbnail-concurrency

refactor/timeline1

refactor/extract_photostream

refactor/rename_load_api

refactor/timeline2

refactor/timeline3

feat/multi-select-asset-viewer

feat-no-thumbhash-cache

refactor/asset_grid

feat/faster-access-checks

fix/18991

fix/19543

chore/temp-remove

fix/21419

feat/mobile-hdr-images

chore/update-mise-lockfile

feat/mise-server-checks

feat/mise-ci

feat/windows-2025

feat/dev_cli

refactor/mobile-migrate-clients

fix/map-theme

fix/require-checkbox

chore/use_swc

feat/efficient-thumbnail-decoding

refactor/mobile-thumbhash

refactor/mobile-thumbhash-new

fix/mobile-uncached-zoom

feat/beta-background-upload

fix/beta-timeline-memories-setting

fix/failed-uploads-not-removed

feat/mobile-shared-album

feat/groups

drift-map-page

drift-auth-user-sync

fix/disable-memory

feat/add-to-album-action

edit-date-time-action

drift-people-page

sqlite-remove-isIn

feat/inline-storage-columns

chore/required-reviewers

refact/asset-manager

fix/folder-sort

pnpm

feat/widget-multiple-server-urls

chore/medium-tests-dbname

fix/web-no-iterator-find

fix/map-pan-interruption

track-livephotos

timeline_events

chore/oxlint-migration

feat/maintenance-worker

feat/dav

chore/demo-snapshot

refactor/server-side-dedupe

feat/integrity-checks

dev/recognition-eval

lighter_buckets_test

perf/postgres-queue

postgres-queue

focus_rings

refactor/web-stores-1

refactor/add-to-taken

feat/sort-places

feat/sidecar-asset-file

vet

tmp/demo-snapshot-preview

fix/server-migration-file-extension

refactor/mobile-v2

fix/asset-update-race-condition

rknn-toolkit-lite2

refactor/mobile-split-up-search-page

feature/Add-rocm-support-for-machine-learning

feat/rocm

chore/async-hash-file

feat/shared-link-view-count

feat/rotation

feat/graphql

feat/job-ids

feat/ignore-library-permission-error

feat/docker-compose-builder

feat/kysely-typeorm

mobile/onboarding

no-video-player

fix/server-qsv-output-format

chore/server-geodata-tweaks

mobile/native-video-player-no-hero

feat/xxhash

fix/docs-concurrency

feat/preload-ml-textual-model

feat/local-tileserver

refactor/exif-orientation

original-path-infix

refactor/mobile/login-form-1

feat/server-editor-endpoints

fix/server-qsv-vbr

fix-mobile-db-problems

feat/ml-armnn-conversion

feat/mobile/backup-with-album-info

feat/fast-initial-sync-1

chore/handle-output_dims

feat/server-more-robust-generation

feat/unassign-faces

feat/shortcuts-on-asset-grid

feat/background-upload

feat/capacitor-mobile-app-poc

feat/server-nvenc-hw-decoding

release/v1.105

fix/mobile-fetch-non-archive

feat/fine-grained-access-controls

web/automation-ui

feat/mobile-server-endpoint-save-dropdown

feat/blurhash-thumbnail

object-storage

feat/memories-animations

dev/metrics

ml/tflite

feat/ml-export-cli

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: immich-app/immich#1042