Rigorously curated citable Latin texts
#!ctsdata followed by one line per citable passage. Each line has two columns delimited by the # character. The first column is the passage’s URN, the second its text contents.latin23: Latin alphabet with 23 alphabetic characters, and no distinction of i/j or u/vlatin24: Latin alphabet with 24 alphabetic characters. Vocalic u and consonantal v are distinct, but there is but no distinction of i/j.latin25: Latin alphabet with 25 alphabetic characters, including vocalic u and i, and consonantal v and j.EXPLICIT LIST TBA
Notes on four characters with special semantics:
\n separates records of citable nodes in the CEX files.# separate columns on a given line in the CEX files.| to represent line breaks/new line characters within the textual contents of a citable node.ratione the nominative singular of ratio + enclitic ne or the ablative singular?), we use the hyphen - to mark enclitic boundaries.Because these characters have a special meaning in the structure of our data, they may not be used within the Latin text content of a citable node.
You may of course treat these characters however like for display purposes (replace | with \n and remove -, for example).