Skip to content

Commit

Permalink
Add vocabulary (#17)
Browse files Browse the repository at this point in the history
* fix macrons etc.

* more fixesss

* even more fixes ugh

* add google sheet

* add vocab to html, restructure pages
  • Loading branch information
sarahmccuan authored Feb 16, 2024
1 parent d4bae91 commit 849b8e6
Show file tree
Hide file tree
Showing 12 changed files with 5,846 additions and 335 deletions.
1 change: 0 additions & 1 deletion analysis/README

This file was deleted.

3 changes: 3 additions & 0 deletions analysis/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
Token-level analysis like lemmatisation or postagging can go here.

[Link to google sheet w/ lemmatization in progress](https://docs.google.com/spreadsheets/d/1u0OokpNZQAOxcr0HQaTV1Sk9LifvG8mpWWqNjylh-xY/edit?usp=sharing).
1,842 changes: 1,842 additions & 0 deletions docs/greek_english_vocab.html

Large diffs are not rendered by default.

322 changes: 322 additions & 0 deletions docs/greekwar.html

Large diffs are not rendered by default.

346 changes: 34 additions & 312 deletions docs/index.html

Large diffs are not rendered by default.

14 changes: 10 additions & 4 deletions docs/style.css
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ h1,h2,h3,h4,h5,h6 {
body {
font-family: 'Noto Serif', serif;
display: flex;
justify-content: center;
/* justify-content: center; */
max-width: 38rem;
padding: 2rem;
margin: auto;
Expand Down Expand Up @@ -62,6 +62,7 @@ ul {
li {
font-size: 1.2em;
line-height: 1.5em;
padding-bottom: 1.2em;
}

.section {
Expand Down Expand Up @@ -133,15 +134,15 @@ li {
} */


.toc ul {
/* .toc ul {
list-style-type: none;
margin: 0;
padding: 0;
}
.toc li {
display: inline;
}
} */

.title-page {

Expand All @@ -167,4 +168,9 @@ li {
.jumpto {
margin-top:2em;
margin-bottom: 2em;
}
}

.dict-entry {
font-size: 1.1em;
font-weight: bold;
}
8 changes: 0 additions & 8 deletions drafts/README

This file was deleted.

23 changes: 23 additions & 0 deletions drafts/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
Intermediate or helpful checkpoint files can go here.

`chambers_ocr.md` file is the proofread ocr file.

`chambers_notes_ocr.md` has Chambers' textual notes. Proofreading is complete but the ref tags need to be updated with GLTP ids for linking.

`greek-english-vocab.md` is proofread vocabulary, in process of being stripped of layout refs and tagged with shortcodes. For the original corrected vocab OCR look in `orig/`.

Checklist:
- [x] Paragraph & Line numbering appropriately represents the original core text and is in the GLTP format.
- [x] Spelling, accentuation, and punctuation of core text have been updated to match the original text.
- [x] Notes file is OCR corrected.
- [x] Vocab file is OCR corrected.
- [ ] Vocab file is cleaned and marked with shortcodes for analytics and linking (maybe convert to spreadsheet).
- [ ] Core text file has word-level ids to support vocab linking.
- [ ] Create a python script to generate vocab objects for linking.
- [ ] Foreach word in core file, link appropriate vocab object.
- [ ] Foreach note in notes file, link to appropriate line id.
- [ ] Come up with a nice way to render everything as a reader.

Some cleanup stuff that needs to be done (misc. list to track)
- [ ] Fix macrons on vocab
- [ ] Validate vocab - all words have proper spelling, accenting, breathings
9 changes: 0 additions & 9 deletions drafts/greek-english-vocab-intro.md

This file was deleted.

Loading

0 comments on commit 849b8e6

Please sign in to comment.