Rigorously curated citable Latin texts
#!ctsdata
followed by one line per citable passage. Each line has two columns delimited by the #
character. The first column is the passage’s URN, the second its text contents.latin23
: Latin alphabet with 23 alphabetic characters, and no distinction of i/j or u/vlatin24
: Latin alphabet with 24 alphabetic characters. Vocalic u and consonantal v are distinct, but there is but no distinction of i/j.latin25
: Latin alphabet with 25 alphabetic characters, including vocalic u and i, and consonantal v and j.EXPLICIT LIST TBA
Notes on four characters with special semantics:
\n
separates records of citable nodes in the CEX files.#
separate columns on a given line in the CEX files.|
to represent line breaks/new line characters within the textual contents of a citable node.ratione
the nominative singular of ratio
+ enclitic ne
or the ablative singular?), we use the hyphen -
to mark enclitic boundaries.Because these characters have a special meaning in the structure of our data, they may not be used within the Latin text content of a citable node.
You may of course treat these characters however like for display purposes (replace |
with \n
and remove -
, for example).