Skip to content

Conversation

@marcelklehr
Copy link
Member

Summary

Adds a task processing task type for doing OCR

TODO

  • Ideas for more inputs?

Checklist

@marcelklehr marcelklehr added this to the Nextcloud 33 milestone Dec 8, 2025
@marcelklehr marcelklehr requested a review from a team as a code owner December 8, 2025 11:40
@marcelklehr marcelklehr added the 3. to review Waiting for reviews label Dec 8, 2025
@marcelklehr marcelklehr requested review from ArtificialOwl, CarlSchwan, icewind1991, julien-nc, kyteinsky and leftybournes and removed request for a team December 8, 2025 11:40
@janepie
Copy link
Member

janepie commented Dec 8, 2025

Looks good! We could add an input for the language to be extracted and have it default to automatic detection, or add that as optional input only for providers that make use of it. Both fine for me, wdyt @julien-nc @kyteinsky ?

@julien-nc
Copy link
Member

Not sure the OCR libraries take a "language" param to help them perform an optimal extraction. @marcelklehr Do they?
If so, I'm ok with adding an input field. It's also fine to let the providers add an optional one as not all the providers might support the param.

@marcelklehr
Copy link
Member Author

Not sure the OCR libraries take a "language" param to help them perform an optimal extraction. @marcelklehr Do they?

The latest models don't require a language input, but older libraries like tesseract may require this. I think an optional input is fine.

@kesselb kesselb disabled auto-merge December 8, 2025 16:45
@kesselb kesselb merged commit b7b4a3a into master Dec 8, 2025
173 of 179 checks passed
@kesselb kesselb deleted the feat/tasktype-ocr branch December 8, 2025 16:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants