Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SPEC : Implement 1-step OCR & access to TTS #51

Open
carbontracking opened this issue Feb 9, 2018 · 0 comments
Open

SPEC : Implement 1-step OCR & access to TTS #51

carbontracking opened this issue Feb 9, 2018 · 0 comments
Labels

Comments

@carbontracking
Copy link
Owner

carbontracking commented Feb 9, 2018

Another version of UpScribers, instantly accessible

  1. Drag and drop an image onto the webpage
    • Or Copy / Paste : NOTE : You cannot copy paste a file, e.g. from File Explorer, but you can copy/paste a screenshot or a copy from within a graphics program.
  2. The image is uploaded to UpScribers
    • UpScribers may reduce the size of the image to pass the 4MB limit for AZURE OCR
    • May also perform cleaning of the image to improve the OCR-ability of the image (see textcleaner)
  3. Do OCR via AZURE and retrieve the co-ordinates of all the areas recognised
  4. Display the image fullscreen with the OCR'd areas highlighted by boxes.
    • the boxes may be linked to show the order
  5. Each OCR area can display the included text in a textarea and/or an option for TTS direct.

Minimal buttons for maximum result.

Inspiration

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant