-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some stuff not getting parsed #1
Comments
Hi! Yes, derivative or compound words are managed differently - their root will be linked to the parent, but themselves won't. |
I see, thank you for your response, I really appreciate it 🙂. Question 1Also, I'm having similar issues for English words whose immediate ancestor is Middle English or old French. Case in point: the word example (see wikitionary). It doesn't seem to go all the way back to the Latin root which is what I need. Question 2I had to deaccent the latin forms to get some etymological paths working properly (see the notebook I originally linked in the post) Here I have collapsed all subcategories of Latin into a column called see the word plethora for example. Before deaccenting After deaccenting Now it correctly finds the etyomlogical paths: Is there something that can be done about this? I feel like my solution is very brittle. 😅 |
I think it's because |
I have not done any Perl since, and tbh I ended up re-writing the entire code base in Python a while ago XD (it was as fast as the Perl parsing). |
I have been attempting to rewrite all the Perl code in Python for the past week 😅.
I.e., you no longer have the Python files?
This would be great. I would be happy to debug / contribute to the code base if I knew how lol. I could try and rewrite it in Python with some help. |
Hello, thank you very for the very useful code.
here is my somewhat messy python notebook.
I am running into some issues:
Issue #1
It seems the script is not properly recognizing compounds properly:
Here is the original wikitionary page
I would like to see the full etymological path like so:
litográfico -> litografía -> lito- -> λίθος (Ancient Greek)
litográfico -> -ico -> -icus (Latin)
But as you can see in the image it seems it can't find the parent.
Issue #2
Similarly, for the English word copious, I can't find its etymological path. Other cognates of copious seem to have successfully parsed : (
The text was updated successfully, but these errors were encountered: