[PR #23392] fix(ml): retry OCR OrtSession with remaining providers #17523

New Issue

OVERLORD · 2026-02-05T16:23:19+03:00

OVERLORD commented

2026-02-05 16:23:19 +03:00

📋 Pull Request Information

Original PR: https://github.com/immich-app/immich/pull/23392
Author: @apetersson
Created: 10/31/2025
Status: 🔄 Open

Base: main ← Head: allow_ocr_fallback_to_cpu_coreml

📝 Commits (5)

021ba6e fix(ml): retry OCR OrtSession with remaining providers
f81654b fix(ml): test for retry OCR OrtSession with remaining providers
e13b3e6 fix(machine-learning): stabilize ORT fallback across models
03112cf fix(ml): clarify coreml fallback logging and mock providers in tests
bbb9be6 fix(ml): pass mypy with explicit NumPy arrays

📊 Changes

8 files changed (+548 additions, -41 deletions)

View changed files

📝 machine-learning/immich_ml/models/clip/textual.py (+31 -0)
📝 machine-learning/immich_ml/models/clip/visual.py (+28 -0)
📝 machine-learning/immich_ml/models/facial_recognition/detection.py (+15 -4)
📝 machine-learning/immich_ml/models/facial_recognition/recognition.py (+22 -6)
📝 machine-learning/immich_ml/models/ocr/detection.py (+21 -10)
📝 machine-learning/immich_ml/models/ocr/recognition.py (+20 -8)
📝 machine-learning/immich_ml/sessions/ort.py (+77 -11)
📝 machine-learning/test_main.py (+334 -2)

📄 Description

Description

Extend the OCR detection and recognition pipelines so that when the leading ONNX Runtime provider (e.g., CoreML) throws an ONNXRuntimeError, we drop just that provider, rebuild the OrtSession with the remaining providers in the existing preference order, and retry the inference. This keeps OCR tasks alive on CPUs while still attempting faster providers first.
Add regression coverage to prove the behavior by stubbing RapidOCR to fail on the first call and verifying we retry with ["CPUExecutionProvider"] and still return results.

Fixes #23391
https://github.com/immich-app/immich/issues/23391

How Has This Been Tested?

My workstation (Apple Silicon, macOS 15):

UV_CACHE_DIR=.uv-cache UV_HTTP_TIMEOUT=120 uv run --extra cpu --group dev pytest test_main.py::TestOcrFallback -q

Screenshots (if appropriate)

Checklist:

I have performed a self-review of my own code
I have made corresponding changes to the documentation if applicable
I have no unrelated changes in the PR.
I have confirmed that any new dependencies are strictly necessary.
I have written tests for new code (if applicable)
I have followed naming conventions/patterns in the surrounding code
All code in src/services/ uses repositories implementations for database calls, filesystem operations, etc.
All code in src/repositories/ is pretty basic/simple and does not have any immich specific logic (that belongs in src/services/)

Please describe to which degree, if any, an LLM was used in creating this pull request.

LLM (ChatGPT/GPT-5) assisted with brainstorming the fallback approach and drafting this description; all code and tests were written and validated hybrid with codex and by hand..

_{🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.}

## 📋 Pull Request Information **Original PR:** https://github.com/immich-app/immich/pull/23392 **Author:** [@apetersson](https://github.com/apetersson) **Created:** 10/31/2025 **Status:** 🔄 Open **Base:** `main` ← **Head:** `allow_ocr_fallback_to_cpu_coreml` --- ### 📝 Commits (5) - [`021ba6e`](https://github.com/immich-app/immich/commit/021ba6e7a7fa9fc19f4d2a45b4129ffc89ef09a8) fix(ml): retry OCR OrtSession with remaining providers - [`f81654b`](https://github.com/immich-app/immich/commit/f81654b058233550574da078a4d41630c411b421) fix(ml): test for retry OCR OrtSession with remaining providers - [`e13b3e6`](https://github.com/immich-app/immich/commit/e13b3e6a37f5f631f401bbfe447ec4f00c5cf4e9) fix(machine-learning): stabilize ORT fallback across models - [`03112cf`](https://github.com/immich-app/immich/commit/03112cf7e16dc76438b87b7d3e007669a6e48206) fix(ml): clarify coreml fallback logging and mock providers in tests - [`bbb9be6`](https://github.com/immich-app/immich/commit/bbb9be6cbc24daacd339edd67bae638edbe6e979) fix(ml): pass mypy with explicit NumPy arrays ### 📊 Changes **8 files changed** (+548 additions, -41 deletions) <details> <summary>View changed files</summary> 📝 `machine-learning/immich_ml/models/clip/textual.py` (+31 -0) 📝 `machine-learning/immich_ml/models/clip/visual.py` (+28 -0) 📝 `machine-learning/immich_ml/models/facial_recognition/detection.py` (+15 -4) 📝 `machine-learning/immich_ml/models/facial_recognition/recognition.py` (+22 -6) 📝 `machine-learning/immich_ml/models/ocr/detection.py` (+21 -10) 📝 `machine-learning/immich_ml/models/ocr/recognition.py` (+20 -8) 📝 `machine-learning/immich_ml/sessions/ort.py` (+77 -11) 📝 `machine-learning/test_main.py` (+334 -2) </details> ### 📄 Description ## Description - Extend the OCR detection and recognition pipelines so that when the leading ONNX Runtime provider (e.g., CoreML) throws an ONNXRuntimeError, we drop just that provider, rebuild the OrtSession with the remaining providers in the existing preference order, and retry the inference. This keeps OCR tasks alive on CPUs while still attempting faster providers first. - Add regression coverage to prove the behavior by stubbing RapidOCR to fail on the first call and verifying we retry with ["CPUExecutionProvider"] and still return results. Fixes #23391 https://github.com/immich-app/immich/issues/23391 ## How Has This Been Tested? My workstation (Apple Silicon, macOS 15): - [x] UV_CACHE_DIR=.uv-cache UV_HTTP_TIMEOUT=120 uv run --extra cpu --group dev pytest test_main.py::TestOcrFallback -q <details><summary><h2>Screenshots (if appropriate)</h2></summary>  </details> ## Checklist: - [x] I have performed a self-review of my own code - [ ] I have made corresponding changes to the documentation if applicable - [x] I have no unrelated changes in the PR. - [x] I have confirmed that any new dependencies are strictly necessary. - [x] I have written tests for new code (if applicable) - [x] I have followed naming conventions/patterns in the surrounding code - [ ] All code in src/services/ uses repositories implementations for database calls, filesystem operations, etc. - [ ] All code in src/repositories/ is pretty basic/simple and does not have any immich specific logic (that belongs in src/services/) ## Please describe to which degree, if any, an LLM was used in creating this pull request. LLM (ChatGPT/GPT-5) assisted with brainstorming the fallback approach and drafting this description; all code and tests were written and validated hybrid with codex and by hand.. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>

OVERLORD added the pull-request label 2026-02-05 16:23:19 +03:00

Sign in to join this conversation.

Branches Tags

main

renovate/npm-svelte-vulnerability

release/next

chore/translations

feat/notification

refactor/zod-migration

csp-policy

uhthomas/fix-mobile-video-state

feat/library-offline-stats

fix/top-bar-z-search

fix/video-zooming

feat/checksum-algorithm-indicator

feat/library-offline-count

uhthomas/feat-mobile-search-results

uhthomas/fix-mobile-hero-height

fix/bring-back-globalkeys

fix/map-webgl-error

visual-review/pr-26535

claude/auto-screenshot-web-changes-Y7efI

feat/mobile-ocr

feat/custom-date-range

fix/mobile-video-aspect-ratio

push-vxwxqoulmxun

push-zlzxxyywnmtr

push-mvnsqpxklmnu

push-ztrmyrpuwvow

push-pvvtwywwqzvy

fix/ml-ocr-batch-size

push-okmnxsumoyzr

push-lvyturrtwkrq

feat/mobile-edit-3-mobile-sync-handling

push-rsywxvptwxuv

push-snrprxmlposz

fix/timeline-rtl

feat/integrity-checks-izzy

uhthomas/fix-mobile-search-results

renovate/flutter

update-pwa

uhthomas/feat-sort-smart-search

renovate/github-cqlabs-homebrew-dcm-1.x

chore/deduplicate-storage-template-example

fix/maintenance-reload

feat/video-player

feat/mobile-editing

feat/use-native-clients

refactor/remove-replace-with-upload

uhthomas/chore-mobile-maplibre

uhthomas/mobile-fix-asset-details-album-pop

feat/crawl-wrapper

feat/open-in-browser

push-skvzqoozqkpl

feat/edit-filters

fix/locale-settings-desc

push-xyozownmuwqp

postgres-socketio

feat/pg-queue

proposal/zod

refactor/asset-upload

renovate/connectivity_plus-7.x

better-project-structure

uhthomas/mobile-feat-asset-viewer-details

fix/ml-rocm-build

fix/25803

feat/asset-file-apis

midzelis/wip

push-zpwsovysllvn

push-nwxlpmyzkyrl

feature/bottom-buttons-order

sqlite_thumbs

fix-keep-correct-ios-shared-album-asset

fix-memory-generation-and-display

push-vpxwmwwxwnvw

fix-migration-width-height

revert/prettier-translations

shared-deep-link-handler

feat/thumbnail-native-clients

feat/platform-clients

fix/foreground-cloud-sync

filter-by-person

feat/csp

refactor/sidebar

fix/disable-editing

fix/view-timeline-deeplink

image-zoom-on-slow-connection

fix/merged-edited-assets

open-api-fix

feat/create-job-with-dto

use-toast-primary

feat/vitest-4

feat/ios-fastlane-match

match-signing

fix-update-time-update-timeline

feat/modal-routes

feat/panorama-tiles

feature/mobile-view-asset-owner

feat/system-settings

feature/show-activity-count

better-info-in-asset-viewer

fix/all-people-count

feat/location-favorites

feature/rearrange-buttons-2

fix/download-storage-template

feat/kb-shortcuts-mobile

fix/people-count

push-qolzzzzxrvvn

chore/originals-in-asset-files

feat/asset-size-columns

ben/tree-a11y

new-search-filter-ui

refactor/expectSelectedReadonly

refactor/mobile-grdb

push-qvuktpxmkknu

feat/mobile-native-local-sync

refactor/timeline_ops

fix/scrubber_end

feat/version.txt

feat/context-menus

feat/server-chunked-uploads

refactor/virtualsegment

refactor/rename_daymonth_groups

fix/restrict-android-bg-worker

feat/android-periodic-worker

fix-remote-sync-clean-up

refactor/timeline_move_ops

fix/timeline_split_selectable

feat/keyboard_actions_help_modal

feat/static_frontend

feat/notification-warnign-android

feat/plugins2

feat/plugins

test/create-workflow-token-action

fix/docs-force

debug/search-result-similarity

debug/cf-chunked-uploads

feat/eslint_rule

feat/search-filter-album/web

refactor/timeline_photostream

refactor/timelineasset_asset

feat/session-permissions

feat/timeline_photostream_assetnav

feat/timeline_minor_optimize

feat/timeline_perf_nocomp

feat/timeline_search_results_actions

feat/timeline_search_results_page

fix/timeline_padding

fix/timeline_search_reactivity_warnings

feat/timeline_scrollbar

feat/timeline_stream_withviewer

fix/timeline_back_forth_nav

refactor/timeline_photostream_component

fix/generated-files-checks

fix/locate-button-local

chore/base-image-mimalloc

refactor/timeline_assetlayout

refactor/timeline_selectable

refactor/timeline_aware_actions

refactor/timeline_monthsegment

feat/remove-old-pages

chore/deps-gradle

tmp_photostream

tmp/lcms

feat/mobile-dynamic-thumbnails

fix/mobile-finer-thumbnail-concurrency

refactor/timeline1

refactor/extract_photostream

refactor/rename_load_api

refactor/timeline2

refactor/timeline3

feat/multi-select-asset-viewer

feat-no-thumbhash-cache

refactor/asset_grid

feat/faster-access-checks

fix/18991

fix/19543

chore/temp-remove

fix/21419

feat/mobile-hdr-images

chore/update-mise-lockfile

feat/mise-server-checks

feat/mise-ci

feat/windows-2025

feat/dev_cli

refactor/mobile-migrate-clients

fix/map-theme

fix/require-checkbox

chore/use_swc

feat/efficient-thumbnail-decoding

refactor/mobile-thumbhash

refactor/mobile-thumbhash-new

feat/beta-background-upload

fix/beta-timeline-memories-setting

fix/failed-uploads-not-removed

feat/mobile-shared-album

feat/groups

drift-map-page

drift-auth-user-sync

fix/disable-memory

feat/add-to-album-action

edit-date-time-action

drift-people-page

sqlite-remove-isIn

chore/required-reviewers

refact/asset-manager

fix/folder-sort

pnpm

feat/widget-multiple-server-urls

chore/medium-tests-dbname

fix/web-no-iterator-find

fix/map-pan-interruption

track-livephotos

timeline_events

chore/oxlint-migration

feat/maintenance-worker

feat/dav

chore/demo-snapshot

refactor/server-side-dedupe

feat/integrity-checks

dev/recognition-eval

lighter_buckets_test

perf/postgres-queue

postgres-queue

focus_rings

refactor/web-stores-1

refactor/add-to-taken

feat/sort-places

vet

tmp/demo-snapshot-preview

fix/server-migration-file-extension

fix/asset-update-race-condition

rknn-toolkit-lite2

refactor/mobile-split-up-search-page

feature/Add-rocm-support-for-machine-learning

feat/rocm

chore/async-hash-file

feat/shared-link-view-count

feat/rotation

feat/graphql

feat/job-ids

feat/ignore-library-permission-error

feat/docker-compose-builder

feat/kysely-typeorm

mobile/onboarding

no-video-player

fix/server-qsv-output-format

chore/server-geodata-tweaks

mobile/native-video-player-no-hero

feat/xxhash

fix/docs-concurrency

feat/local-tileserver

refactor/exif-orientation

original-path-infix

refactor/mobile/login-form-1

feat/server-editor-endpoints

fix/server-qsv-vbr

fix-mobile-db-problems

feat/ml-armnn-conversion

feat/mobile/backup-with-album-info

feat/fast-initial-sync-1

chore/handle-output_dims

feat/unassign-faces

feat/shortcuts-on-asset-grid

feat/capacitor-mobile-app-poc

feat/server-nvenc-hw-decoding

fix/mobile-fetch-non-archive

web/automation-ui

feat/mobile-server-endpoint-save-dropdown

object-storage

feat/memories-animations

dev/metrics

ml/tflite

feat/ml-export-cli

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: immich-app/immich#17523