-
Notifications
You must be signed in to change notification settings - Fork 0
UpEditor
Colm O'Gairbhith edited this page Jun 27, 2017
·
16 revisions
- This is the WYSIWYG editor that allows the creation of well-structured webpages that can be easily read by UpBrowser.
- A mechanism for defining rules may be needed, perhpas a simplified subset of the HTML5 DTD
- e.g. below a H1 you can only have the elements P/UL/OL/H2/IMG
- Long list of XML editors (many opensource)http://www.lexiconista.com/xonomy/web-based-xml-editors/
- See Doctored.js as and example of an XML Editor with possibility to check structure
- After editing by the user, the content is parsed/modifed
- Ensures that the html file is ready for the keyboard navigation provided by UpBrowser
- Text is parsed to ensure that individual phrases are identified
- A mechanism for defining rules may be needed, perhpas a simplified subset of the HTML5 DTD
- This editor integrates an OCR engine
- This can be used when copying text sections from image/pdf files via the clipbpoard.
- https://onlineocr.net/
- http://www.free-ocr.com/ (document layout analysis is not good, i.e. it loses the flow)
- https://www.microsoft.com/cognitive-services/en-us/computer-vision-api (This presenrves document layout, but loses images)
- Extra points for including crop=>OCR directly from the image view
- The WYSIWYG allow a subset of HTML, only providing HTML elements that are relevant in a non-visual context
- The rules for the semantic structure of the webpage/document are described by WebAIM here
- The editor also integrates a simplified SVG editor, similar to boxy.svg which is, I believe, based on svg-edit
- The editor should also integrate AsciiMath with display handled by MathJax javascript library
- The editor must display structure in a clear manner, perhaps by adding a H1/H2/H3/H4 indicating header level in the editor window
- From UpStore, to begin the transcription of an imported document/image
- From UpStore in a Create new scenario
- From UpBrowser, on double-clicking on any part of the webpage
- Accessible structure of all interfaces
- WYSIWYG editeur (scope réduit)
- Contrôler la structure du html généré.
- Hn ne peut contenir que ( (P|UL|OL|IMG), Hn+1)
- Indiquer la structure dans l'éditeur
- Intégrer la copier/coller d'une autre source texte (MS Word / Wikipedia) ou d'un image en utilisant OCR
- Nettoyer des sources "sales" (spécifiquement tout produit MS Office)
- Optionnel