Complex Type pc:GlyphType

Namespace http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15
Diagram
Attribute pc:GlyphType / @idAttribute pc:GlyphType / @ligatureAttribute pc:GlyphType / @symbolAttribute pc:GlyphType / @scriptAttribute pc:GlyphType / @productionAttribute pc:GlyphType / @customAttribute pc:GlyphType / @commentsElement pc:GlyphType / pc:AlternativeImageElement pc:GlyphType / pc:CoordsElement pc:GlyphType / pc:GraphemesElement pc:GlyphType / pc:TextEquivElement pc:GlyphType / pc:TextStyleElement pc:GlyphType / pc:UserDefinedElement pc:GlyphType / pc:Labels
Used by
Model Element pc:GlyphType / pc:AlternativeImage , Element pc:GlyphType / pc:Coords , Element pc:GlyphType / pc:Graphemes , Element pc:GlyphType / pc:TextEquiv , Element pc:GlyphType / pc:TextStyle , Element pc:GlyphType / pc:UserDefined , Element pc:GlyphType / pc:Labels
Children Element pc:GlyphType / pc:AlternativeImage, Element pc:GlyphType / pc:Coords, Element pc:GlyphType / pc:Graphemes, Element pc:GlyphType / pc:Labels, Element pc:GlyphType / pc:TextEquiv, Element pc:GlyphType / pc:TextStyle, Element pc:GlyphType / pc:UserDefined
Attributes
QName Type Use
Attribute pc:GlyphType / @comments string optional
Attribute pc:GlyphType / @custom string optional
For generic use
Attribute pc:GlyphType / @id ID required
Attribute pc:GlyphType / @ligature boolean optional
Attribute pc:GlyphType / @production Simple Type pc:ProductionSimpleType optional
Overrides the production attribute of the parent
word / text line / text region.
Attribute pc:GlyphType / @script Simple Type pc:ScriptSimpleType optional
The script used for the glyph
Attribute pc:GlyphType / @symbol boolean optional
Source
<complexType name="GlyphType">
  <sequence>
    <element name="AlternativeImage" type="pc:AlternativeImageType" minOccurs="0" maxOccurs="unbounded">
      <annotation>
        <documentation>Alternative glyph images (e.g. black-and-white)</documentation>
      </annotation>
    </element>
    <element name="Coords" type="pc:CoordsType"/>
    <element name="Graphemes" type="pc:GraphemesType" minOccurs="0" maxOccurs="1">
      <annotation>
        <documentation>Container for graphemes, grapheme groups and non-printing characters</documentation>
      </annotation>
    </element>
    <element name="TextEquiv" type="pc:TextEquivType" minOccurs="0" maxOccurs="unbounded">
    </element>
    <element name="TextStyle" type="pc:TextStyleType" minOccurs="0">
    </element>
    <element name="UserDefined" type="pc:UserDefinedType" minOccurs="0" maxOccurs="1">
    </element>
    <element name="Labels" type="pc:LabelsType" minOccurs="0" maxOccurs="unbounded">
      <annotation>
        <documentation>Semantic labels / tags</documentation>
      </annotation>
    </element>
  </sequence>
  <attribute name="id" type="ID" use="required"/>
  <attribute name="ligature" use="optional" type="boolean"/>
  <attribute name="symbol" use="optional" type="boolean"/>
  <attribute name="script" type="pc:ScriptSimpleType">
    <annotation>
      <documentation>The script used for the glyph</documentation>
    </annotation>
  </attribute>
  <attribute name="production" type="pc:ProductionSimpleType">
    <annotation>
      <documentation>Overrides the production attribute of the parent word / text line / text region.</documentation>
    </annotation>
  </attribute>
  <attribute name="custom" type="string">
    <annotation>
      <documentation>For generic use</documentation>
    </annotation>
  </attribute>
  <attribute name="comments" type="string"/>
</complexType>

Attribute pc:GlyphType / @id

Namespace No namespace
Type ID
Properties
use: required
Used by
Source
<attribute name="id" type="ID" use="required"/>

Attribute pc:GlyphType / @ligature

Namespace No namespace
Type boolean
Properties
use: optional
Used by
Source
<attribute name="ligature" use="optional" type="boolean"/>

Attribute pc:GlyphType / @symbol

Namespace No namespace
Type boolean
Properties
use: optional
Used by
Source
<attribute name="symbol" use="optional" type="boolean"/>

Attribute pc:GlyphType / @script

Namespace No namespace
Annotations
The script used for the glyph
Type Simple Type pc:ScriptSimpleType
Properties
content: simple
Facets
enumeration Adlm - Adlam
enumeration Afak - Afaka
enumeration Aghb - Caucasian Albanian
enumeration Ahom - Ahom, Tai Ahom
enumeration Arab - Arabic
enumeration Aran - Arabic (Nastaliq variant)
enumeration Armi - Imperial Aramaic
enumeration Armn - Armenian
enumeration Avst - Avestan
enumeration Bali - Balinese
enumeration Bamu - Bamum
enumeration Bass - Bassa Vah
enumeration Batk - Batak
enumeration Beng - Bengali
enumeration Bhks - Bhaiksuki
enumeration Blis - Blissymbols
enumeration Bopo - Bopomofo
enumeration Brah - Brahmi
enumeration Brai - Braille
enumeration Bugi - Buginese
enumeration Buhd - Buhid
enumeration Cakm - Chakma
enumeration Cans - Unified Canadian Aboriginal Syllabics
enumeration Cari - Carian
enumeration Cham - Cham
enumeration Cher - Cherokee
enumeration Cirt - Cirth
enumeration Copt - Coptic
enumeration Cprt - Cypriot
enumeration Cyrl - Cyrillic
enumeration Cyrs - Cyrillic (Old Church Slavonic variant)
enumeration Deva - Devanagari (Nagari)
enumeration Dsrt - Deseret (Mormon)
enumeration Dupl - Duployan shorthand, Duployan stenography
enumeration Egyd - Egyptian demotic
enumeration Egyh - Egyptian hieratic
enumeration Egyp - Egyptian hieroglyphs
enumeration Elba - Elbasan
enumeration Ethi - Ethiopic
enumeration Geok - Khutsuri (Asomtavruli and Nuskhuri)
enumeration Geor - Georgian (Mkhedruli)
enumeration Glag - Glagolitic
enumeration Goth - Gothic
enumeration Gran - Grantha
enumeration Grek - Greek
enumeration Gujr - Gujarati
enumeration Guru - Gurmukhi
enumeration Hanb - Han with Bopomofo
enumeration Hang - Hangul
enumeration Hani - Han (Hanzi, Kanji, Hanja)
enumeration Hano - Hanunoo (Hanunóo)
enumeration Hans - Han (Simplified variant)
enumeration Hant - Han (Traditional variant)
enumeration Hatr - Hatran
enumeration Hebr - Hebrew
enumeration Hira - Hiragana
enumeration Hluw - Anatolian Hieroglyphs
enumeration Hmng - Pahawh Hmong
enumeration Hrkt - Japanese syllabaries
enumeration Hung - Old Hungarian (Hungarian Runic)
enumeration Inds - Indus (Harappan)
enumeration Ital - Old Italic (Etruscan, Oscan etc.)
enumeration Jamo - Jamo
enumeration Java - Javanese
enumeration Jpan - Japanese
enumeration Jurc - Jurchen
enumeration Kali - Kayah Li
enumeration Kana - Katakana
enumeration Khar - Kharoshthi
enumeration Khmr - Khmer
enumeration Khoj - Khojki
enumeration Kitl - Khitan large script
enumeration Kits - Khitan small script
enumeration Knda - Kannada
enumeration Kore - Korean (alias for Hangul + Han)
enumeration Kpel - Kpelle
enumeration Kthi - Kaithi
enumeration Lana - Tai Tham (Lanna)
enumeration Laoo - Lao
enumeration Latf - Latin (Fraktur variant)
enumeration Latg - Latin (Gaelic variant)
enumeration Latn - Latin
enumeration Leke - Leke
enumeration Lepc - Lepcha (Róng)
enumeration Limb - Limbu
enumeration Lina - Linear A
enumeration Linb - Linear B
enumeration Lisu - Lisu (Fraser)
enumeration Loma - Loma
enumeration Lyci - Lycian
enumeration Lydi - Lydian
enumeration Mahj - Mahajani
enumeration Mand - Mandaic, Mandaean
enumeration Mani - Manichaean
enumeration Marc - Marchen
enumeration Maya - Mayan hieroglyphs
enumeration Mend - Mende Kikakui
enumeration Merc - Meroitic Cursive
enumeration Mero - Meroitic Hieroglyphs
enumeration Mlym - Malayalam
enumeration Modi - Modi, Moḍī
enumeration Mong - Mongolian
enumeration Moon - Moon (Moon code, Moon script, Moon type)
enumeration Mroo - Mro, Mru
enumeration Mtei - Meitei Mayek (Meithei, Meetei)
enumeration Mult - Multani
enumeration Mymr - Myanmar (Burmese)
enumeration Narb - Old North Arabian (Ancient North Arabian)
enumeration Nbat - Nabataean
enumeration Newa - Newa, Newar, Newari
enumeration Nkgb - Nakhi Geba
enumeration Nkoo - N’Ko
enumeration Nshu - Nüshu
enumeration Ogam - Ogham
enumeration Olck - Ol Chiki (Ol Cemet’, Ol, Santali)
enumeration Orkh - Old Turkic, Orkhon Runic
enumeration Orya - Oriya
enumeration Osge - Osage
enumeration Osma - Osmanya
enumeration Palm - Palmyrene
enumeration Pauc - Pau Cin Hau
enumeration Perm - Old Permic
enumeration Phag - Phags-pa
enumeration Phli - Inscriptional Pahlavi
enumeration Phlp - Psalter Pahlavi
enumeration Phlv - Book Pahlavi
enumeration Phnx - Phoenician
enumeration Piqd - Klingon (KLI pIqaD)
enumeration Plrd - Miao (Pollard)
enumeration Prti - Inscriptional Parthian
enumeration Rjng - Rejang (Redjang, Kaganga)
enumeration Roro - Rongorongo
enumeration Runr - Runic
enumeration Samr - Samaritan
enumeration Sara - Sarati
enumeration Sarb - Old South Arabian
enumeration Saur - Saurashtra
enumeration Sgnw - SignWriting
enumeration Shaw - Shavian (Shaw)
enumeration Shrd - Sharada, Śāradā
enumeration Sidd - Siddham
enumeration Sind - Khudawadi, Sindhi
enumeration Sinh - Sinhala
enumeration Sora - Sora Sompeng
enumeration Sund - Sundanese
enumeration Sylo - Syloti Nagri
enumeration Syrc - Syriac
enumeration Syre - Syriac (Estrangelo variant)
enumeration Syrj - Syriac (Western variant)
enumeration Syrn - Syriac (Eastern variant)
enumeration Tagb - Tagbanwa
enumeration Takr - Takri
enumeration Tale - Tai Le
enumeration Talu - New Tai Lue
enumeration Taml - Tamil
enumeration Tang - Tangut
enumeration Tavt - Tai Viet
enumeration Telu - Telugu
enumeration Teng - Tengwar
enumeration Tfng - Tifinagh (Berber)
enumeration Tglg - Tagalog (Baybayin, Alibata)
enumeration Thaa - Thaana
enumeration Thai - Thai
enumeration Tibt - Tibetan
enumeration Tirh - Tirhuta
enumeration Ugar - Ugaritic
enumeration Vaii - Vai
enumeration Visp - Visible Speech
enumeration Wara - Warang Citi (Varang Kshiti)
enumeration Wole - Woleai
enumeration Xpeo - Old Persian
enumeration Xsux - Cuneiform, Sumero-Akkadian
enumeration Yiii - Yi
enumeration Zinh - Code for inherited script
enumeration Zmth - Mathematical notation
enumeration Zsye - Symbols (Emoji variant)
enumeration Zsym - Symbols
enumeration Zxxx - Code for unwritten documents
enumeration Zyyy - Code for undetermined script
enumeration Zzzz - Code for uncoded script
enumeration other
Used by
Source
<attribute name="script" type="pc:ScriptSimpleType">
  <annotation>
    <documentation>The script used for the glyph</documentation>
  </annotation>
</attribute>

Attribute pc:GlyphType / @production

Namespace No namespace
Annotations
Overrides the production attribute of the parent
word / text line / text region.
Type Simple Type pc:ProductionSimpleType
Properties
content: simple
Facets
enumeration printed
enumeration typewritten
enumeration handwritten-cursive
enumeration handwritten-printscript
enumeration medieval-manuscript
enumeration other
Used by
Source
<attribute name="production" type="pc:ProductionSimpleType">
  <annotation>
    <documentation>Overrides the production attribute of the parent word / text line / text region.</documentation>
  </annotation>
</attribute>

Attribute pc:GlyphType / @custom

Namespace No namespace
Annotations
For generic use
Type string
Properties
content: simple
Used by
Source
<attribute name="custom" type="string">
  <annotation>
    <documentation>For generic use</documentation>
  </annotation>
</attribute>

Attribute pc:GlyphType / @comments

Namespace No namespace
Type string
Properties
content: simple
Used by
Source
<attribute name="comments" type="string"/>

Element pc:GlyphType / pc:AlternativeImage

Namespace http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15
Annotations
Alternative glyph images (e.g.
black-and-white)
Diagram
Attribute pc:AlternativeImageType / @filenameAttribute pc:AlternativeImageType / @commentsAttribute pc:AlternativeImageType / @confComplex Type pc:AlternativeImageType
Type Complex Type pc:AlternativeImageType
Properties
content: complex
minOccurs: 0
maxOccurs: unbounded
Attributes
Source
<element name="AlternativeImage" type="pc:AlternativeImageType" minOccurs="0" maxOccurs="unbounded">
  <annotation>
    <documentation>Alternative glyph images (e.g. black-and-white)</documentation>
  </annotation>
</element>

Element pc:GlyphType / pc:Coords

Namespace http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15
Diagram
Attribute pc:CoordsType / @pointsAttribute pc:CoordsType / @confComplex Type pc:CoordsType
Type Complex Type pc:CoordsType
Properties
content: complex
Attributes
QName Type Use
Attribute pc:CoordsType / @conf Simple Type pc:ConfSimpleType optional
Confidence value (between 0 and 1)
Attribute pc:CoordsType / @points Simple Type pc:PointsType required
Polygon outline of the element as a path of points.
No points may lie outside the outline of its parent,
which in the case of Border is the bounding rectangle
of the root image. Paths are closed by convention,
i.e. the last point logically connects with the first
(and at least 3 points are required to span an area).
Paths must be planar (i.e. must not self-intersect).
Source
<element name="Coords" type="pc:CoordsType"/>

Element pc:GlyphType / pc:Graphemes

Namespace http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15
Annotations
Container for graphemes, grapheme groups and
non-printing characters
Diagram
Element pc:GraphemesType / pc:GraphemeElement pc:GraphemesType / pc:NonPrintingCharElement pc:GraphemesType / pc:GraphemeGroupComplex Type pc:GraphemesType
Type Complex Type pc:GraphemesType
Properties
content: complex
minOccurs: 0
maxOccurs: 1
Model Element pc:GraphemesType / pc:Grapheme | Element pc:GraphemesType / pc:NonPrintingChar | Element pc:GraphemesType / pc:GraphemeGroup
Children Element pc:GraphemesType / pc:Grapheme, Element pc:GraphemesType / pc:GraphemeGroup, Element pc:GraphemesType / pc:NonPrintingChar
Instance
<pc:Graphemes xmlns:pc="http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15">
  <pc:Grapheme charType="" comments="" custom="" id="" index="" ligature="">{1,1}</pc:Grapheme>
  <pc:NonPrintingChar charType="" comments="" custom="" id="" index="" ligature="">{1,1}</pc:NonPrintingChar>
  <pc:GraphemeGroup charType="" comments="" custom="" id="" index="" ligature="">{1,1}</pc:GraphemeGroup>
</pc:Graphemes>
Source
<element name="Graphemes" type="pc:GraphemesType" minOccurs="0" maxOccurs="1">
  <annotation>
    <documentation>Container for graphemes, grapheme groups and non-printing characters</documentation>
  </annotation>
</element>

Element pc:GlyphType / pc:TextEquiv

Namespace http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15
Diagram
Attribute pc:TextEquivType / @indexAttribute pc:TextEquivType / @confAttribute pc:TextEquivType / @dataTypeAttribute pc:TextEquivType / @dataTypeDetailsAttribute pc:TextEquivType / @commentsElement pc:TextEquivType / pc:PlainTextElement pc:TextEquivType / pc:UnicodeComplex Type pc:TextEquivType
Type Complex Type pc:TextEquivType
Properties
content: complex
minOccurs: 0
maxOccurs: unbounded
Model Element pc:TextEquivType / pc:PlainText , Element pc:TextEquivType / pc:Unicode
Children Element pc:TextEquivType / pc:PlainText, Element pc:TextEquivType / pc:Unicode
Instance
<pc:TextEquiv comments="" conf="" dataType="" dataTypeDetails="" index="" xmlns:pc="http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15">
  <pc:PlainText>{0,1}</pc:PlainText>
  <pc:Unicode>{1,1}</pc:Unicode>
</pc:TextEquiv>
Attributes
QName Type Use
Attribute pc:TextEquivType / @comments string optional
Attribute pc:TextEquivType / @conf Simple Type pc:ConfSimpleType optional
OCR confidence value (between 0 and 1)
Attribute pc:TextEquivType / @dataType Simple Type pc:TextDataTypeSimpleType optional
Type of text content (is it free text or a number, for instance).
This is only a descriptive attribute, the text type
is not checked during XML validation.
Attribute pc:TextEquivType / @dataTypeDetails string optional
Refinement for dataType attribute. Can be a regular expression, for instance.
Attribute pc:TextEquivType / @index restriction of integer optional
Used for sort order in case multiple TextEquivs are defined.
The text content with the lowest index should be interpreted
as the main text content.
Source
<element name="TextEquiv" type="pc:TextEquivType" minOccurs="0" maxOccurs="unbounded">
</element>

Element pc:GlyphType / pc:TextStyle

Namespace http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15
Diagram
Attribute pc:TextStyleType / @fontFamilyAttribute pc:TextStyleType / @serifAttribute pc:TextStyleType / @monospaceAttribute pc:TextStyleType / @fontSizeAttribute pc:TextStyleType / @xHeightAttribute pc:TextStyleType / @kerningAttribute pc:TextStyleType / @textColourAttribute pc:TextStyleType / @textColourRgbAttribute pc:TextStyleType / @bgColourAttribute pc:TextStyleType / @bgColourRgbAttribute pc:TextStyleType / @reverseVideoAttribute pc:TextStyleType / @boldAttribute pc:TextStyleType / @italicAttribute pc:TextStyleType / @underlinedAttribute pc:TextStyleType / @underlineStyleAttribute pc:TextStyleType / @subscriptAttribute pc:TextStyleType / @superscriptAttribute pc:TextStyleType / @strikethroughAttribute pc:TextStyleType / @smallCapsAttribute pc:TextStyleType / @letterSpacedComplex Type pc:TextStyleType
Type Complex Type pc:TextStyleType
Properties
content: complex
minOccurs: 0
Attributes
QName Type Use
Attribute pc:TextStyleType / @bgColour Simple Type pc:ColourSimpleType optional
Background colour
Attribute pc:TextStyleType / @bgColourRgb integer optional
Background colour in RGB encoded format
(red value) + (256 x green value) + (65536 x blue value).
Attribute pc:TextStyleType / @bold boolean optional
Attribute pc:TextStyleType / @fontFamily string optional
For instance: Arial, Times New Roman.
Add more information if necessary
(e.g. blackletter, antiqua).
Attribute pc:TextStyleType / @fontSize float optional
The size of the characters in points.
Attribute pc:TextStyleType / @italic boolean optional
Attribute pc:TextStyleType / @kerning int optional
The degree of space (in points) between
the characters in a string of text.
Attribute pc:TextStyleType / @letterSpaced boolean optional
Attribute pc:TextStyleType / @monospace boolean optional
Attribute pc:TextStyleType / @reverseVideo boolean optional
Specifies whether the colour of the text appears
reversed against a background colour.
Attribute pc:TextStyleType / @serif boolean optional
Serif or sans-serif typeface.
Attribute pc:TextStyleType / @smallCaps boolean optional
Attribute pc:TextStyleType / @strikethrough boolean optional
Attribute pc:TextStyleType / @subscript boolean optional
Attribute pc:TextStyleType / @superscript boolean optional
Attribute pc:TextStyleType / @textColour Simple Type pc:ColourSimpleType optional
Attribute pc:TextStyleType / @textColourRgb integer optional
Text colour in RGB encoded format
(red value) + (256 x green value) + (65536 x blue value).
Attribute pc:TextStyleType / @underlineStyle Simple Type pc:UnderlineStyleSimpleType optional
Line style details if "underlined" is TRUE
Attribute pc:TextStyleType / @underlined boolean optional
Attribute pc:TextStyleType / @xHeight integer optional
The x-height or corpus size refers to the distance
between the baseline and the mean line of
lower-case letters in a typeface.
The unit is assumed to be pixels.
Source
<element name="TextStyle" type="pc:TextStyleType" minOccurs="0">
</element>

Element pc:GlyphType / pc:UserDefined

Namespace http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15
Diagram
Element pc:UserDefinedType / pc:UserAttributeComplex Type pc:UserDefinedType
Type Complex Type pc:UserDefinedType
Properties
content: complex
minOccurs: 0
maxOccurs: 1
Model Element pc:UserDefinedType / pc:UserAttribute
Children Element pc:UserDefinedType / pc:UserAttribute
Instance
<pc:UserDefined xmlns:pc="http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15">
  <pc:UserAttribute description="" name="" type="" value="">{1,unbounded}</pc:UserAttribute>
</pc:UserDefined>
Source
<element name="UserDefined" type="pc:UserDefinedType" minOccurs="0" maxOccurs="1">
</element>

Element pc:GlyphType / pc:Labels

Namespace http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15
Annotations
Semantic labels / tags
Diagram
Attribute pc:LabelsType / @externalModelAttribute pc:LabelsType / @externalIdAttribute pc:LabelsType / @prefixAttribute pc:LabelsType / @commentsElement pc:LabelsType / pc:LabelComplex Type pc:LabelsType
Type Complex Type pc:LabelsType
Properties
content: complex
minOccurs: 0
maxOccurs: unbounded
Model Element pc:LabelsType / pc:Label
Children Element pc:LabelsType / pc:Label
Instance
<pc:Labels comments="" externalId="" externalModel="" prefix="" xmlns:pc="http://schema.primaresearch.org/PAGE/gts/pagecontent/2019-07-15">
  <pc:Label comments="" type="" value="">{0,unbounded}</pc:Label>
</pc:Labels>
Attributes
QName Type Use
Attribute pc:LabelsType / @comments string optional
Attribute pc:LabelsType / @externalId string optional
E.g. an RDF resource identifier
(to be used as subject or object of an RDF triple)
Attribute pc:LabelsType / @externalModel string optional
Reference to external model / ontology / schema
Attribute pc:LabelsType / @prefix string optional
Prefix for all labels (e.g. first part of an URI)
Source
<element name="Labels" type="pc:LabelsType" minOccurs="0" maxOccurs="unbounded">
  <annotation>
    <documentation>Semantic labels / tags</documentation>
  </annotation>
</element>