Feature Request: Add OCR Backend Support for Local Document Processing #64

20yuto20 · 2025-05-30T09:33:27Z

Summary

Propose adding an OCR (Optical Character Recognition) backend to enable local document text extraction capabilities within Docker Model Runner.

Expand Docker Model Runner beyond text generation to include vision/document processing
Enable privacy-focused local OCR without cloud dependencies
Leverage existing model distribution and scheduling infrastructure

Follow existing backend interface in pkg/inference/backends/llamacpp/llamacpp.go
Leverage model distribution system for OCR model downloads
Integrate with resource management for memory allocation
Support both CPU and GPU acceleration where available

I would be very grateful in my work if I could easily test document AI; OCR locally!

The text was updated successfully, but these errors were encountered: