Kura can read texts in the following formats:
Possible new formats will be simple tagged interlinear and plain single language text with some markup. The simple interlinear xml format is perhaps the most useful.
An example:
<?xml version="1.0" encoding='iso-8859-1'?>
<kura_text
title="The Charter of the Guild of Anshen" languagenr="13"
description="This is purported to have been written by Vauri,
Commander of the Order of the Sworn of Anshen in the time of King
Vegelin II, the Great."
>
<stream>
<line
text="shen vestyn anin"
gloss="light-nom-s source-nom-s Poss1p"
/>
<translation text="The Light is our source."/>
</stream>
</kura_text>
The import facility is started from the command line:
> imp filename.xml filename.xml ...
or
> imp.bat filename.xml filename.xml
(Note: someone using Windows please check this batch file - it might be wrong)
Simple xml import expects the following prerequisites:
The languagenr should exist; there should be a tag "GL" and a tag "TR". There should be an elementtypecode "FORM"
Kura can save texts in the following formats (all in utf-8 encoding):
Example:
> python kurafile/kuraplaintextwriter.py 1
where 1 is the number of the text.
(Note: this is not ideal. In kura 1.1 the export facility will be integrated in the desktop application.)
There is no import facility for lexical data as yet, other than a custom script to load the data into the database. However, I will be very glad to assist anyone who wants to do this. It isn't hard. The same goes for exporting lexical data. (Although you could use the following command: mysqldump [database] lng_lex
from the prompt).
Previous: Default configuration | Next: The desktop client | Table of contents