generated from greek-learner-texts/text-repository-template
-
Notifications
You must be signed in to change notification settings - Fork 4
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* fix macrons etc. * more fixesss * even more fixes ugh * add google sheet * add vocab to html, restructure pages
- Loading branch information
1 parent
d4bae91
commit 849b8e6
Showing
12 changed files
with
5,846 additions
and
335 deletions.
There are no files selected for viewing
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
Token-level analysis like lemmatisation or postagging can go here. | ||
|
||
[Link to google sheet w/ lemmatization in progress](https://docs.google.com/spreadsheets/d/1u0OokpNZQAOxcr0HQaTV1Sk9LifvG8mpWWqNjylh-xY/edit?usp=sharing). |
Large diffs are not rendered by default.
Oops, something went wrong.
Large diffs are not rendered by default.
Oops, something went wrong.
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
Intermediate or helpful checkpoint files can go here. | ||
|
||
`chambers_ocr.md` file is the proofread ocr file. | ||
|
||
`chambers_notes_ocr.md` has Chambers' textual notes. Proofreading is complete but the ref tags need to be updated with GLTP ids for linking. | ||
|
||
`greek-english-vocab.md` is proofread vocabulary, in process of being stripped of layout refs and tagged with shortcodes. For the original corrected vocab OCR look in `orig/`. | ||
|
||
Checklist: | ||
- [x] Paragraph & Line numbering appropriately represents the original core text and is in the GLTP format. | ||
- [x] Spelling, accentuation, and punctuation of core text have been updated to match the original text. | ||
- [x] Notes file is OCR corrected. | ||
- [x] Vocab file is OCR corrected. | ||
- [ ] Vocab file is cleaned and marked with shortcodes for analytics and linking (maybe convert to spreadsheet). | ||
- [ ] Core text file has word-level ids to support vocab linking. | ||
- [ ] Create a python script to generate vocab objects for linking. | ||
- [ ] Foreach word in core file, link appropriate vocab object. | ||
- [ ] Foreach note in notes file, link to appropriate line id. | ||
- [ ] Come up with a nice way to render everything as a reader. | ||
|
||
Some cleanup stuff that needs to be done (misc. list to track) | ||
- [ ] Fix macrons on vocab | ||
- [ ] Validate vocab - all words have proper spelling, accenting, breathings |
This file was deleted.
Oops, something went wrong.
Oops, something went wrong.