Importing and exporting

Texts

Kura can read texts in the following formats:

Possible new formats will be simple tagged interlinear and plain single language text with some markup. The simple interlinear xml format is perhaps the most useful.

An example:

<?xml version="1.0" encoding='iso-8859-1'?>
<kura_text
  title="The Charter of the Guild of Anshen" languagenr="13"
  description="This is purported to have been written by Vauri,
  Commander of the Order of the Sworn of Anshen in the time of King
  Vegelin II, the Great."
>
<stream>
  <line
    text="shen vestyn anin"
    gloss="light-nom-s source-nom-s Poss1p"
  />
  <translation text="The Light is our source."/>
</stream>
</kura_text>

The import facility is started from the command line:

> imp filename.xml filename.xml ...

or

> imp.bat filename.xml filename.xml

(Note: someone using Windows please check this batch file - it might be wrong)

Simple xml import expects the following prerequisites:

The languagenr should exist; there should be a tag "GL" and a tag "TR". There should be an elementtypecode "FORM"

Kura can save texts in the following formats (all in utf-8 encoding):

Example:

> python kurafile/kuraplaintextwriter.py 1

where 1 is the number of the text.

(Note: this is not ideal. In kura 1.1 the export facility will be integrated in the desktop application.)

Lexemes

There is no import facility for lexical data as yet, other than a custom script to load the data into the database. However, I will be very glad to assist anyone who wants to do this. It isn't hard. The same goes for exporting lexical data. (Although you could use the following command: mysqldump [database] lng_lex from the prompt).



Previous: Default configuration | Next: The desktop client | Table of contents