Long vowels ᾱῑῡ
- bedwere
- Global Moderator
- Posts: 5103
- Joined: Fri Mar 07, 2008 10:23 pm
- Location: Didacopoli in California
- Contact:
Long vowels ᾱῑῡ
How do I get a list of the headwords with long ᾱ ῑ ῡ in LJS from Perseus? Thanks!
Corrections are welcome (especially for projects).
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
- ἑκηβόλος
- Textkit Zealot
- Posts: 969
- Joined: Wed Aug 07, 2013 10:19 am
- Contact:
Re: Long vowels ᾱῑῡ
Are you wanting to restrict the search to long vowels in initial position or anywhere in the headword?
τί δὲ ἀγαθὸν τῇ πομφόλυγι συνεστώσῃ ἢ κακὸν διαλυθείσῃ;
- jeidsath
- Textkit Zealot
- Posts: 5334
- Joined: Mon Dec 30, 2013 2:42 pm
- Location: Γαλεήπολις, Οὐισκόνσιν
Re: Long vowels ᾱῑῡ
Download the .txt file from here and do a file search:
https://archive.org/details/Lsj--LiddellScott
There are different unicode combining character encoding schemes, and to get vim search for ᾱ to work for me, I found that instead of searching for the version that my keyboard makes, I needed to copy an example of ᾱ from the file and search for that.
https://archive.org/details/Lsj--LiddellScott
There are different unicode combining character encoding schemes, and to get vim search for ᾱ to work for me, I found that instead of searching for the version that my keyboard makes, I needed to copy an example of ᾱ from the file and search for that.
“One might get one’s Greek from the very lips of Homer and Plato." "In which case they would certainly plough you for the Little-go. The German scholars have improved Greek so much.”
Joel Eidsath -- jeidsath@gmail.com
Joel Eidsath -- jeidsath@gmail.com
- bedwere
- Global Moderator
- Posts: 5103
- Joined: Fri Mar 07, 2008 10:23 pm
- Location: Didacopoli in California
- Contact:
Re: Long vowels ᾱῑῡ
Thank you, Joel. The text file you kindly provided seems to have combined characters, but this works using sed:
And it finds the long vowels anywhere.
Spoiler
Show
varda-lionel:echo "ῡ" | hexdump -C
00000000 cf 85 cc 84 0a |.....|
00000005
varda-lionel:sed -n '/\xcf\x85\xcc\x84/p' lsj.txt | less
varda-lionel:echo "ᾱ" | hexdump -C
00000000 ce b1 cc 84 0a |.....|
00000005
varda-lionel:sed -n '/\xce\xb1\xcc\x84/p' lsj.txt | less
varda-lionel:echo "ῑ" | hexdump -C
00000000 ce b9 cc 84 0a |.....|
00000005
varda-lionel:sed -n '/\xce\xb9\xcc\x84/p' lsj.txt | less
00000000 cf 85 cc 84 0a |.....|
00000005
varda-lionel:sed -n '/\xcf\x85\xcc\x84/p' lsj.txt | less
varda-lionel:echo "ᾱ" | hexdump -C
00000000 ce b1 cc 84 0a |.....|
00000005
varda-lionel:sed -n '/\xce\xb1\xcc\x84/p' lsj.txt | less
varda-lionel:echo "ῑ" | hexdump -C
00000000 ce b9 cc 84 0a |.....|
00000005
varda-lionel:sed -n '/\xce\xb9\xcc\x84/p' lsj.txt | less
Corrections are welcome (especially for projects).
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
- jeidsath
- Textkit Zealot
- Posts: 5334
- Joined: Mon Dec 30, 2013 2:42 pm
- Location: Γαλεήπολις, Οὐισκόνσιν
Re: Long vowels ᾱῑῡ
If you would only like to find it only on the headword line, notice that this is the format:
So you can use************************************************************
<headword>, <body>
<body cont.>
grep -A2 '************************************************************' | cut -d',' -f0
and pipe that into sed to get only the headwords.
You may find some entry inconsistencies however. And vowel length discussion is sometimes buried inside the entry instead of included in the headword.
“One might get one’s Greek from the very lips of Homer and Plato." "In which case they would certainly plough you for the Little-go. The German scholars have improved Greek so much.”
Joel Eidsath -- jeidsath@gmail.com
Joel Eidsath -- jeidsath@gmail.com
- bedwere
- Global Moderator
- Posts: 5103
- Joined: Fri Mar 07, 2008 10:23 pm
- Location: Didacopoli in California
- Contact:
Re: Long vowels ᾱῑῡ
I guess I had to add the name of the file, but it gives me an error:
grep -A2 '************************************************************' lsj.txt | cut -d',' -f0
cut: fields are numbered from 1
Try 'cut --help' for more information
Corrections are welcome (especially for projects).
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
- jeidsath
- Textkit Zealot
- Posts: 5334
- Joined: Mon Dec 30, 2013 2:42 pm
- Location: Γαλεήπολις, Οὐισκόνσιν
Re: Long vowels ᾱῑῡ
Oh, change that to '-f1'. Newer versions of coreutils correctly flag '-f0' as an error, which bites old engineers like me used to it getting silently accepted.
“One might get one’s Greek from the very lips of Homer and Plato." "In which case they would certainly plough you for the Little-go. The German scholars have improved Greek so much.”
Joel Eidsath -- jeidsath@gmail.com
Joel Eidsath -- jeidsath@gmail.com
- bedwere
- Global Moderator
- Posts: 5103
- Joined: Fri Mar 07, 2008 10:23 pm
- Location: Didacopoli in California
- Contact:
Re: Long vowels ᾱῑῡ
Thanks, Joel. This works best for me:
Spoiler
Show
grep -A2 '\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*' lsj.txt | sed -n '/\xcf\x85\xcc\x84/p'
grep -A2 '\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*' lsj.txt | sed -n '/\xce\xb1\xcc\x84/p'
grep -A2 '\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*' lsj.txt | sed -n '/\xce\xb9\xcc\x84/p'
grep -A2 '\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*' lsj.txt | sed -n '/\xce\xb1\xcc\x84/p'
grep -A2 '\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*' lsj.txt | sed -n '/\xce\xb9\xcc\x84/p'
Corrections are welcome (especially for projects).
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
- jeidsath
- Textkit Zealot
- Posts: 5334
- Joined: Mon Dec 30, 2013 2:42 pm
- Location: Γαλεήπολις, Οὐισκόνσιν
Re: Long vowels ᾱῑῡ
Sorry about that. I finally did it on a computer and fixed my code:
It would probably be useful to convert the whole file to unicode NFC if you're doing much searching with it.
It would likely be extremely useful work for you or I to look over these lists carefully and generate a ruleset for vowel length, equivalent to what people like Chandler did for accent.
Spoiler
Show
grep -F -A2 '******' lsj.txt | cut -d ',' -f1 | grep $'\xcf\x85\xcc\x84'
grep -F -A2 '******' lsj.txt | cut -d ',' -f1 | grep $'\xce\xb1\xcc\x84'
grep -F -A2 '******' lsj.txt | cut -d ',' -f1 | grep $'\xce\xb9\xcc\x84'
grep -F -A2 '******' lsj.txt | cut -d ',' -f1 | grep $'\xce\xb1\xcc\x84'
grep -F -A2 '******' lsj.txt | cut -d ',' -f1 | grep $'\xce\xb9\xcc\x84'
It would likely be extremely useful work for you or I to look over these lists carefully and generate a ruleset for vowel length, equivalent to what people like Chandler did for accent.
“One might get one’s Greek from the very lips of Homer and Plato." "In which case they would certainly plough you for the Little-go. The German scholars have improved Greek so much.”
Joel Eidsath -- jeidsath@gmail.com
Joel Eidsath -- jeidsath@gmail.com
- bedwere
- Global Moderator
- Posts: 5103
- Joined: Fri Mar 07, 2008 10:23 pm
- Location: Didacopoli in California
- Contact:
Re: Long vowels ᾱῑῡ
That works in bash. I'm not familiar with Chandler. Would you care to explain?
Corrections are welcome (especially for projects).
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.