Has anyone here worked with the data files available for download over at Perseus? I downloaded several of them and took a quick look. The notes say they are XML files but when opening the file in an XML editor (Firefox) it won't parse. When I open in a text editor it looks like the structure is messes up an lots of data is malformed.
http://www.perseus.tufts.edu/hopper/opensource/download
I'm trying to discover all data sets out there that cover Greek and Latin word frequency and lexeme
Jeff

