Iceni logo

Exporting XML

Pro

Infix can export the text of a PDF as XML ready for translation by an external process. Once exported, the original PDF becomes a 'skeleton' into which translated XML can be imported in future.

The XML produced by Infix is very simple, containing stories, paragraphs and basic styling tags.

For ease of use with popular CAT tools, XML tag definition files can be found in the Translation Support folder in the Infix Pro installation folder.

The Entire Document

  1. Open your PDF.
  2. Choose Document > Translate > Export XML...
    This will export the contents of every text box in the PDF. Linked text boxes will be placed within the same <STORY> tag.
  3. Every paragraph will be given a unique ID in the exported XML. This id will also be stored inside the PDF. It is important to save the PDF now to ensure the paragraph id information is preserved.

Exporting Individual Stories

  1. Using the Text tool click inside the text box you wish to export. You can link multiple text boxes together using the T123 tool prior to expoort. Linked text boxes will be exported as a single <STORY> element.
  2. Choose Translate > Export XML...
  3. The paragraphs in the story will be given a unique ID in the exported XML and in the PDF. It is important to save the PDF now to ensure the paragraph id information is preserved.

After export, every paragraph in the document is marked with a unique ID. This ID must be stored in the PDF for future reference therefore the PDF must be saved after the initial export.

<DOC id='translation test.pdf'>
<STORY>
<P id='2'>
<span style='font-family:GaramondLight'><span style='color:#000000'>
Adstract automatically fields, reformats and exports text and graphics
from display ads including recruitment, real-estate and motors ads.
</span></span>
</P>
<P id='3'>
<span style='font-family:GaramondLight'><span style='color:#000000'>
The Adstract workflow eliminates costly re-keying and scanning, generating
fully-fielded data - ideal for presentation to the users of your web
site.</span></span>
</P>

Example of the XML produced from the export process.

Notes

  • To see the boundaries of paragraphs once exported, choose View > Show Text Fields.
  • Placing the mouse over an exported text box shows a tool-tip containing the original text before any translations were made.