Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding Orthic #1

Open
dpflug opened this issue Feb 23, 2025 · 5 comments
Open

Adding Orthic #1

dpflug opened this issue Feb 23, 2025 · 5 comments

Comments

@dpflug
Copy link

dpflug commented Feb 23, 2025

Hi! Love the project.

Orthic has a dictionary at https://orthic.shorthand.fun/dictionary#dictionary. I should be able to scrape it, but I wanted to talk formatting first.

Could that notation be a method of handling positioning systems?

@Koloth
Copy link
Owner

Koloth commented Feb 24, 2025

Huh, I looked at that page but somehow missed that it has it all in plain text! That should be very scrapeable, and provides it in all three styles.

In terms of positional information, the method (using "^", maybe others? I've only skimmed.) should be able to be used, but it over estimates the complexity since the position should be considered a form of space character which is currently ignored. I think once I have a few systems that use positional information in the plot, the. I'll switch to that method of evaluation. I'll need to rework the bounds though.

@Koloth
Copy link
Owner

Koloth commented Feb 24, 2025

Forgot to ask: what questions do you have on formatting? I got pretty good handling pretty much any format lol, but a simple csv would do. Perhaps three for the three styles?

@dpflug
Copy link
Author

dpflug commented Feb 24, 2025

I forgot the site has a repo on here. I've asked for permission to use the data: jeremy-w/orthic#30

It's just a yaml: https://github.com/jeremy-w/orthic/blob/master/_data/dictionary.yml

@Koloth
Copy link
Owner

Koloth commented Feb 25, 2025

It'll be somewhat pushing the limits on the size of the dictionary. Just eyeballing it, it seems like there are only 1.3K total across full, ordinary, and abbreviated, which means that each of those styles likely has only a few hundred words. I'll give it a look though.

@Koloth
Copy link
Owner

Koloth commented Feb 26, 2025

I tested implementation, and the resulting graph is highly suspect. In particular, it does not even properly reflect the correct order between the three variants (it places ordinary as vasty shorter than abbreviated which is almost identical to full). I'll keep poking as it is a somewhat strange dictionary so my simplistic first implementation might not represent it properly, but I think I'll leave it out for now. In any case, I want to wait until you get permission.

Image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants