[Feature]: OCR #242

New Issue

OVERLORD · 2026-02-04T18:58:59+03:00

OVERLORD commented

2026-02-04 18:58:59 +03:00

Originally created by @akoyaxd on GitHub (Sep 2, 2022).

Feature detail

Additionally to object detection it would be awesome to have the images ocr'ed to search for Text inside the images and added to the metadata.

Platform

Server

Originally created by @akoyaxd on GitHub (Sep 2, 2022). ### Feature detail Additionally to object detection it would be awesome to have the images ocr'ed to search for Text inside the images and added to the metadata. ### Platform Server

OVERLORD added the nice to have label 2026-02-04 18:58:59 +03:00

OVERLORD closed this issue

2026-02-04 18:59:07 +03:00

OVERLORD commented

2026-02-04 18:59:18 +03:00

@palitu commented on GitHub (Sep 13, 2022):

is this something that can be completed by a webhook into an eco-system of ML containers?

Ie, on upload, a webhook is triggered, which is registered by one or more individual ML containers to do their thing, OCR, face detection, object detection. Whatever is actually wanted/needed by the individual.

@palitu commented on GitHub (Sep 13, 2022): is this something that can be completed by a webhook into an eco-system of ML containers? Ie, on upload, a webhook is triggered, which is registered by one or more individual ML containers to do their thing, OCR, face detection, object detection. Whatever is actually wanted/needed by the individual.

OVERLORD commented

2026-02-04 18:59:22 +03:00

@alextran1502 commented on GitHub (Dec 23, 2022):

This is nice but out of scope of the project

@alextran1502 commented on GitHub (Dec 23, 2022): This is nice but out of scope of the project

OVERLORD commented

2026-02-04 18:59:46 +03:00

@jasongwq commented on GitHub (Dec 28, 2022):

I am using PaddleOCR to implement ocr and support retrieval on the app

@jasongwq commented on GitHub (Dec 28, 2022): I am using [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.6/README.md) to implement ocr and support retrieval on the app

OVERLORD commented

2026-02-04 18:59:57 +03:00

@eagle470 commented on GitHub (Oct 13, 2023):

I am using PaddleOCR to implement ocr and support retrieval on the app

How?

@eagle470 commented on GitHub (Oct 13, 2023): > I am using [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.6/README.md) to implement ocr and support retrieval on the app How?

OVERLORD commented

2026-02-04 19:00:11 +03:00

@vb0 commented on GitHub (May 24, 2024):

Approaching this from a different angle: Google Photos android app saves locally¹ a fairly complete (and GB-large² for any sizeable number of assets) gphotos0.db which is a sqlite3 db with a lot of metadata for (all the) Google Photos assets from the account. There is a lot of data there, including of course the OCRed strings. If we had an endpoint, or a simple no matter how hackish workflow to ingest this into Immich it'll mean a lot for power users coming from Google Photos.

¹ albeit you'd generally need root to grab it, or just some Android emulator with enough stuff on it so you can install Google Photos, log in and let it sync the db, and then open the local disk and access it some way
² this is what you see as GBs taken by Google Photos even if you don't have anything locally, but many pictures online

@vb0 commented on GitHub (May 24, 2024): Approaching this from a different angle: Google Photos android app saves locally1 a fairly complete (and GB-large2 for any sizeable number of assets) gphotos0.db which is a sqlite3 db with a lot of metadata for (all the) Google Photos assets from the account. There is a lot of data there, including of course the OCRed strings. If we had an endpoint, or a simple no matter how hackish workflow to ingest this into Immich it'll mean a lot for power users coming from Google Photos. 1 albeit you'd generally need root to grab it, or just some Android emulator with enough stuff on it so you can install Google Photos, log in and let it sync the db, and then open the local disk and access it some way 2 this is what you see as GBs taken by Google Photos even if you don't have anything locally, but many pictures online

OVERLORD commented

2026-02-04 19:00:18 +03:00

@kingp0dd commented on GitHub (Nov 9, 2024):

+1
Really enjoyed this feature in Google photos

@kingp0dd commented on GitHub (Nov 9, 2024): +1 Really enjoyed this feature in Google photos

OVERLORD commented

2026-02-04 19:00:40 +03:00

@banjuer commented on GitHub (May 17, 2025):

I am using PaddleOCR to implement ocr and support retrieval on the app

hi, Could you make a tutorial? thanks a lot

@banjuer commented on GitHub (May 17, 2025): > I am using [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.6/README.md) to implement ocr and support retrieval on the app hi, Could you make a tutorial? thanks a lot

OVERLORD commented

2026-02-04 19:00:45 +03:00

@ragsmaroon commented on GitHub (Jun 14, 2025):

I'm really surprised that this isn't in the scope of the product, especially because something like memories are an unnecessary feature but I assume they were implemented to have parity with Google Photos. Meanwhile, searching screenshots for text is a feature has a lot of utility and one that I used quite frequently in gphotos, and its absence here really neuters Immich in comparison.

@ragsmaroon commented on GitHub (Jun 14, 2025): I'm really surprised that this isn't in the scope of the product, especially because something like memories are an unnecessary feature but I assume they were implemented to have parity with Google Photos. Meanwhile, searching screenshots for text is a feature has a lot of utility and one that I used quite frequently in gphotos, and its absence here really neuters Immich in comparison.

OVERLORD commented

2026-02-04 19:00:55 +03:00

@devkamiki commented on GitHub (Oct 17, 2025):

ente has ocr that also depends on machine learning, the model runs on the device locally instead of server iirc

@devkamiki commented on GitHub (Oct 17, 2025): ente has ocr that also depends on machine learning, the model runs on the device locally instead of server iirc

Sign in to join this conversation.

Branches Tags

main

feat/asset-file-apis

chore/translations

fix/web-switch-label-clickable

fix/web-people-hidden-state

renovate/typescript-projects

release/next

fix/timezones

fix/time-zone-upserts

midzelis/wip

push-zpwsovysllvn

push-nwxlpmyzkyrl

push-nvnkszuqwppm

renovate/github-actions

push-smstsuupsowp

refactor/adaptive_image

push-olwpzvrxnomt

push-lmxsupnmxspl

renovate/machine-learning

feat/web-chromecast-video-looping

feat/use-native-clients

renovate/flutter

fix/create-face-edited

fix/mobile-ios-mtls

docs/contributing

docs/mise-mobile

renovate/grafana-monorepo

feature/bottom-buttons-order

feat/immich-mobile-ui-showcase

refactor/consolidate-image-requests

renovate/connectivity_plus-7.x

renovate/major-vitest-monorepo

renovate/pypi-python-multipart-vulnerability

fix/mobile-people-query

sqlite_thumbs

feat/html-text

chore/no-macro-validation

refactor/purchase-store

uhthomas/mobile-fix-app-bar-fade

uhthomas/mobile-fix-asset-jump

feat/pano-ocr

feat/shared-link-login

fix/database-backup-db-names

fix-keep-correct-ios-shared-album-asset

fix-memory-generation-and-display

feat/verify-permissions

refactor/album-service-small-tests

fix/ml-rocm-build

fix/flipped-dimensions-mobile

push-vpxwmwwxwnvw

fix-migration-width-height

refactor/more-queries

revert/prettier-translations

refactor/asset-service-queries

fix/locale-settings-desc

chore/add-debug-log

feat/edit-filters

shared-deep-link-handler

feat/mobile-editing

feat/thumbnail-native-clients

feat/platform-clients

feat/integrity-checks-izzy

fix/foreground-cloud-sync

feat/dynamic-layout

filter-by-person

feat/csp

refactor/sidebar

fix/disable-editing

fix/view-timeline-deeplink

image-zoom-on-slow-connection

fix-consider-dar-for-video-dimension

fix/merged-edited-assets

perf/optimize-album-sort

open-api-fix

feat/create-job-with-dto

use-toast-primary

feat/vitest-4

feat/ios-fastlane-match

match-signing

fix-update-time-update-timeline

chore/translation-keys

feat/modal-routes

feat/panorama-tiles

feature/mobile-view-asset-owner

feat/system-settings

feature/show-activity-count

better-info-in-asset-viewer

fix/all-people-count

feat/location-favorites

feature/rearrange-buttons-2

fix/download-storage-template

feat/kb-shortcuts-mobile

fix/people-count

push-qolzzzzxrvvn

chore/originals-in-asset-files

feat/asset-size-columns

ben/tree-a11y

new-search-filter-ui

refactor/expectSelectedReadonly

refactor/mobile-grdb

push-qvuktpxmkknu

feat/mobile-native-local-sync

refactor/timeline_ops

fix/scrubber_end

feat/version.txt

feat/context-menus

feat/server-chunked-uploads

refactor/virtualsegment

refactor/rename_daymonth_groups

fix/restrict-android-bg-worker

feat/android-periodic-worker

fix-remote-sync-clean-up

refactor/timeline_move_ops

renovate/mapbox-mapbox-gl-rtl-text-0.x

fix/timeline_split_selectable

feat/keyboard_actions_help_modal

feat/static_frontend

feat/notification-warnign-android

feat/plugins2

feat/plugins

test/create-workflow-token-action

fix/docs-force

debug/search-result-similarity

debug/cf-chunked-uploads

feat/eslint_rule

feat/search-filter-album/web

refactor/timeline_photostream

refactor/timelineasset_asset

feat/session-permissions

feat/timeline_photostream_assetnav

feat/timeline_minor_optimize

feat/timeline_perf_nocomp

feat/timeline_search_results_actions

feat/timeline_search_results_page

fix/timeline_padding

fix/timeline_search_reactivity_warnings

feat/timeline_scrollbar

feat/timeline_stream_withviewer

fix/timeline_back_forth_nav

refactor/timeline_photostream_component

fix/generated-files-checks

fix/locate-button-local

chore/base-image-mimalloc

refactor/timeline_assetlayout

refactor/timeline_selectable

refactor/timeline_aware_actions

refactor/timeline_monthsegment

feat/remove-old-pages

chore/deps-gradle

tmp_photostream

tmp/lcms

feat/mobile-dynamic-thumbnails

fix/mobile-finer-thumbnail-concurrency

refactor/timeline1

refactor/extract_photostream

refactor/rename_load_api

refactor/timeline2

refactor/timeline3

feat/multi-select-asset-viewer

feat-no-thumbhash-cache

refactor/asset_grid

feat/faster-access-checks

fix/18991

fix/19543

chore/temp-remove

fix/21419

feat/mobile-hdr-images

chore/update-mise-lockfile

feat/mise-server-checks

feat/mise-ci

feat/windows-2025

feat/dev_cli

refactor/mobile-migrate-clients

fix/map-theme

fix/require-checkbox

chore/use_swc

feat/efficient-thumbnail-decoding

refactor/mobile-thumbhash

refactor/mobile-thumbhash-new

fix/mobile-uncached-zoom

feat/beta-background-upload

fix/beta-timeline-memories-setting

fix/failed-uploads-not-removed

feat/mobile-shared-album

feat/groups

drift-map-page

drift-auth-user-sync

fix/disable-memory

feat/add-to-album-action

edit-date-time-action

drift-people-page

sqlite-remove-isIn

feat/inline-storage-columns

chore/required-reviewers

refact/asset-manager

fix/folder-sort

pnpm

feat/widget-multiple-server-urls

chore/medium-tests-dbname

fix/web-no-iterator-find

fix/map-pan-interruption

track-livephotos

timeline_events

chore/oxlint-migration

feat/maintenance-worker

feat/dav

chore/demo-snapshot

refactor/server-side-dedupe

feat/integrity-checks

dev/recognition-eval

lighter_buckets_test

perf/postgres-queue

postgres-queue

focus_rings

refactor/web-stores-1

refactor/add-to-taken

feat/sort-places

feat/sidecar-asset-file

vet

tmp/demo-snapshot-preview

fix/server-migration-file-extension

refactor/mobile-v2

fix/asset-update-race-condition

rknn-toolkit-lite2

refactor/mobile-split-up-search-page

feature/Add-rocm-support-for-machine-learning

feat/rocm

chore/async-hash-file

feat/shared-link-view-count

feat/rotation

feat/graphql

feat/job-ids

feat/ignore-library-permission-error

feat/docker-compose-builder

feat/kysely-typeorm

mobile/onboarding

no-video-player

fix/server-qsv-output-format

chore/server-geodata-tweaks

mobile/native-video-player-no-hero

feat/xxhash

fix/docs-concurrency

feat/preload-ml-textual-model

feat/local-tileserver

refactor/exif-orientation

original-path-infix

refactor/mobile/login-form-1

feat/server-editor-endpoints

fix/server-qsv-vbr

fix-mobile-db-problems

feat/ml-armnn-conversion

feat/mobile/backup-with-album-info

feat/fast-initial-sync-1

chore/handle-output_dims

feat/server-more-robust-generation

feat/unassign-faces

feat/shortcuts-on-asset-grid

feat/background-upload

feat/capacitor-mobile-app-poc

feat/server-nvenc-hw-decoding

release/v1.105

fix/mobile-fetch-non-archive

feat/fine-grained-access-controls

web/automation-ui

feat/mobile-server-endpoint-save-dropdown

feat/blurhash-thumbnail

object-storage

feat/memories-animations

dev/metrics

ml/tflite

feat/ml-export-cli

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: immich-app/immich#242