The texts from Perseus Library
-
- Textkit Neophyte
- Posts: 34
- Joined: Wed Jan 03, 2018 6:50 pm
The texts from Perseus Library
Perseus offers almost all of his texts for download in ( http://www.perseus.tufts.edu/hopper/opensource/download ), but is in HTML format. Does someone here know a easy way (maybe a software, a site etc...) to convert the HTML format to a readable format ?
- bedwere
- Global Moderator
- Posts: 5110
- Joined: Fri Mar 07, 2008 10:23 pm
- Location: Didacopoli in California
- Contact:
Re: The texts from Perseus Library
Maybe the easiest thing for you is to open an html file with your favorite browser and then copy and paste.
Corrections are welcome (especially for projects).
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
-
- Textkit Neophyte
- Posts: 34
- Joined: Wed Jan 03, 2018 6:50 pm
Re: The texts from Perseus Library
But If I do this, they open the HTML code, not the text... Ok, let leave the "easy" way, but I need a HTML editor or something like that?
- bedwere
- Global Moderator
- Posts: 5110
- Joined: Fri Mar 07, 2008 10:23 pm
- Location: Didacopoli in California
- Contact:
Re: The texts from Perseus Library
I think they are XML, not HTML files. Maybe someone else knows more.
Corrections are welcome (especially for projects).
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
Blogger Profile My library at the Internet Archive
Meae editiones librorum. Αἱ ἐμαὶ ἐκδόσεις βίβλων.
- jeidsath
- Textkit Zealot
- Posts: 5342
- Joined: Mon Dec 30, 2013 2:42 pm
- Location: Γαλεήπολις, Οὐισκόνσιν
Re: The texts from Perseus Library
Yes, they are XML. You need to use a programming language with an XML parser Python, Java, etc., to get them to spit out the text. And even then, you'll have to do a fair amount of custom coding.
“One might get one’s Greek from the very lips of Homer and Plato." "In which case they would certainly plough you for the Little-go. The German scholars have improved Greek so much.”
Joel Eidsath -- jeidsath@gmail.com
Joel Eidsath -- jeidsath@gmail.com
- ἑκηβόλος
- Textkit Zealot
- Posts: 969
- Joined: Wed Aug 07, 2013 10:19 am
- Contact:
Re: The texts from Perseus Library
On a related issue...
The text of the very first section if Daphnis and Chloe can not be seen as a single section on Perseus:
It ought to be after the <p> in
Is the Perseus xml user editable? Can I actually do something about that?
The text of the very first section if Daphnis and Chloe can not be seen as a single section on Perseus:
The reason that it can't be seen is that there is a line missing in the xml code, viz.Longus 1.1.1-2 wrote:[p. 241] κάλλιστον ὧν εἶδον: εἰκόνα, γραφήν, ἱστορίαν ἔρωτος. Καλὸν μὲν καὶ τὸ ἄλσος, πολύδενδρον, ἀνθηρόν, κατάρρυτον: μία πηγὴ πάντα ἔτρεφε, καὶ τὰ ἄνθη καὶ τὰ δένδρα: ἀλλ̓ ἡ γραφὴ τερπνοτέρα καὶ τέχνην ἔχουσα περιττὴν καὶ τύχην ἐρωτικήν: ὥστε πολλοὶ καὶ τῶν ξένων κατὰ φήμην ᾔεσαν, τῶν μὲν Νυμφῶν ἱκέται, τῆς δὲ εἰκόνος θεαταί. [2] Γυναῖκες ἐπ̓ αὐτῆς τίκτουσαι καὶ ἄλλαι σπαργάνοις κοσμοῦσαι: παιδία ἐκκείμενα, ποίμνια τρέφοντα: ποιμένες ἀναιρούμενοι, νέοι συντιθέμενοι: λῃστῶν καταδρομ
Code: Select all
<milestone unit="section" n="1"/>
Code: Select all
<head>*p*r*o*o*i*m*i*o*n</head>
<p>
<pb id="p.241"/>
τί δὲ ἀγαθὸν τῇ πομφόλυγι συνεστώσῃ ἢ κακὸν διαλυθείσῃ;
- jeidsath
- Textkit Zealot
- Posts: 5342
- Joined: Mon Dec 30, 2013 2:42 pm
- Location: Γαλεήπολις, Οὐισκόνσιν
Re: The texts from Perseus Library
Email perseus_webmaster@tufts.edu to get things fixed. They've been working on a new viewer for a while now.
You could also submit a PR here, though I don't know their policy on accepting them: https://github.com/PerseusDL
You could also submit a PR here, though I don't know their policy on accepting them: https://github.com/PerseusDL
“One might get one’s Greek from the very lips of Homer and Plato." "In which case they would certainly plough you for the Little-go. The German scholars have improved Greek so much.”
Joel Eidsath -- jeidsath@gmail.com
Joel Eidsath -- jeidsath@gmail.com
- ἑκηβόλος
- Textkit Zealot
- Posts: 969
- Joined: Wed Aug 07, 2013 10:19 am
- Contact:
Re: The texts from Perseus Library
jeidsath wrote:Email perseus_webmaster@tufts.edu to get things fixed. They've been working on a new viewer for a while now.
Thanks for that.
I got a prompt and polite reply from the managing editor. It said that the problem had been logged before and has been updated in the github repository, (but not in the current P4 browser). The most up to date version of this work is available at:
https://github.com/PerseusDL/canonical- ... s-grc1.xml
Viewing the text in the most recent xml file on github in unicode is a lot more intuitive, I think.
The policy for accepting pull requests is here:jeidsath wrote:You could also submit a PR here, though I don't know their policy on accepting them
https://github.com/PerseusDL/canonical- ... l-requests
τί δὲ ἀγαθὸν τῇ πομφόλυγι συνεστώσῃ ἢ κακὸν διαλυθείσῃ;