Schema documentation for CEF-AT-SHARE-SimpleTypes.xsd

A URL used as homepage of an entity (e.g. of a person, organization, resource etc.); it provides general information (for instance in the case of a resource, it may present a description of the resource, its creators and possibly include links to the URL where it can be accessed from)

maxLength	300
pattern	*(http://.)\|(https://.)\|(ftp://.)\|(www.)*

<xs:element name="url">
  <xs:annotation>
    <xs:documentation>A URL used as homepage of an entity (e.g. of a person, organization, resource etc.); it provides general information (for instance in the case of a resource, it may present a description of the resource, its creators and possibly include links to the URL where it can be accessed from)</xs:documentation>
    <xs:appinfo>
      <label>URL (Landing page)</label>
    </xs:appinfo>
  </xs:annotation>
  <xs:simpleType>
    <xs:restriction base="httpURI">
      <xs:maxLength value="300"/>
    </xs:restriction>
  </xs:simpleType>
</xs:element>

Specifies the media type of the resource and basically corresponds to the physical medium of the content representation. Each media type is described through a 				distinctive set of features. A resource may consist of parts attributed to different types of media.

maxLength	30
enumeration	text
enumeration	audio
enumeration	video
enumeration	image
enumeration	textNumerical

Complex Types

inputInfoType, outputInfoType

<xs:element name="mediaType">
  <xs:annotation>
    <xs:documentation>Specifies the media type of the resource and basically corresponds to the physical medium of the content representation. Each media type is described through a distinctive set of features. A resource may consist of parts attributed to different types of media.</xs:documentation>
    <xs:appinfo>
      <relation>one-to-many</relation>
      <label>Media type</label>
    </xs:appinfo>
  </xs:annotation>
  <xs:simpleType>
    <xs:restriction base="xs:string">
      <xs:maxLength value="30"/>
      <xs:enumeration value="text"/>
      <xs:enumeration value="audio"/>
      <xs:enumeration value="video"/>
      <xs:enumeration value="image"/>
      <xs:enumeration value="textNumerical"/>
    </xs:restriction>
  </xs:simpleType>
</xs:element>

The data format (usually corresponding to the mime-type) of the resource which is a formalized specifier for the format included or a data format (mime-type) that the tool/service accepts, preferrably in conformance with the values of the IANA (Internet Assigned Numbers Authority); you can select one of the pre-defined values or add a value, PREFERABLY FROM THE IANA MEDIA MIMETYPE RECOMMENDED VALUES (http://www.iana.org/assignments/media-types/media-types.xhtml)

enumeration

http://w3id.org/meta-share/omtd-share/DataFormat

The format of a computer file storing data

enumeration

http://w3id.org/meta-share/omtd-share/GateFormat

Formats used for the GATE framework

enumeration

http://w3id.org/meta-share/omtd-share/Gate_json

A Twitter-style JSON format used for GATE documents

enumeration

http://w3id.org/meta-share/omtd-share/GateXml

XML-based format for GATE components

enumeration

http://w3id.org/meta-share/omtd-share/FastInfoset

A compressed binary encoding of GATE XML

enumeration

http://w3id.org/meta-share/omtd-share/Datasift_json

Common format for social media data from http://datasift.com

enumeration

http://w3id.org/meta-share/omtd-share/BinaryFormat

Any format of a computer file in which information is stored in the form of ones and zeros, or in some other binary (two-state) sequence; used mainly for executable files or files that need to be interpreted by a computer program

enumeration

http://w3id.org/meta-share/omtd-share/Pdf

Data format for PDF files (Portable Document Format)

enumeration

http://w3id.org/meta-share/omtd-share/Solr

Solr format

enumeration

http://w3id.org/meta-share/omtd-share/WikiFormats

Superclass for wiki formats

enumeration

http://w3id.org/meta-share/omtd-share/MediaWikiMarkup

Wiki markup for formatting

enumeration

http://w3id.org/meta-share/omtd-share/RdfFormats

Formats for RDF (Resource Description Framework) resources

enumeration

http://w3id.org/meta-share/omtd-share/Nif

The NLP Interchange Format (NIF) is an RDF/OWL-based format that aims to achieve interoperability between Natural Language Processing (NLP) tools, language resources and annotations; it consists of specifications, ontologies and software (overview), which are combined under the version identifier "NIF 2.0", but are versioned individually

enumeration

http://w3id.org/meta-share/omtd-share/Rdf_xml

Data format for RDF (Resource Description Framework) XML format; RDF/XML is a serialisation for RDF

enumeration

http://w3id.org/meta-share/omtd-share/Obo

Serialization format for ontologies according to the Open Biomedical Ontologies model.

enumeration

http://w3id.org/meta-share/omtd-share/Owl

Superclass for formats used for OWL

enumeration

http://w3id.org/meta-share/omtd-share/Owl_xml

XML format for OWL ontologies

enumeration

http://w3id.org/meta-share/omtd-share/Turtle

Textual syntax for RDF that allows an RDF graph to be completely written in a compact and natural text form, with abbreviations for common usage patterns and datatypes.

enumeration

http://w3id.org/meta-share/omtd-share/DatabaseFormat

Formats used for databases

enumeration

http://w3id.org/meta-share/omtd-share/Jdbc

For JDBC databases

enumeration

http://w3id.org/meta-share/omtd-share/MsAccessDatabase

Data format for Microsoft Access database

enumeration

http://w3id.org/meta-share/omtd-share/Tbx

International standard for representing and exchanging information about terms, words and other lexical data

enumeration

http://w3id.org/meta-share/omtd-share/CorpusFormat

A format used by a specific type of corpus (collection of texts)

enumeration

http://w3id.org/meta-share/omtd-share/KeaCorpus

KEA-style (Keyphrase Extraction Algorithm) corpus

enumeration

http://w3id.org/meta-share/omtd-share/TigerXml

The TIGER XML format was created for encoding syntactic constituency structures in the German TIGER corpus. It has since been used for many other corpora as well. TIGERSearch is a linguistic search engine specifically targetting this format. The format has later been extended to also support semantic frame annotations.

enumeration

http://w3id.org/meta-share/omtd-share/Web1t

File format used by the Web1T n-gram corpus, a huge collection of n-grams collected from the internet.

enumeration

http://w3id.org/meta-share/omtd-share/Reuters21578Txt

Reuters-21578 corpus transformed into text format using ExtractReuters in the lucene-benchmarks project

enumeration

http://w3id.org/meta-share/omtd-share/AimedCorpusFormat

Format of the Aimed corpus (225 abstracts from MEDLINE) with the gold standard sentence, protein, protein-protein interaction annotations.

enumeration

http://w3id.org/meta-share/omtd-share/AclAnthologyCorpusFormat

Data format specific to the ACL Anthology Reference Corpus (http://acl-arc.comp.nus.edu.sg/), most probably version 20080325

enumeration

http://w3id.org/meta-share/omtd-share/Tuepp

Format of the Tubingen Partially Parsed Corpus of Written German (TuPP-D/Z) XML files; TPP D/Z (http://www.sfs.uni-tuebingen.de/de/ascl/ressourcen/corpora/tuepp-dz.html) is a collection of articles from the German newspaper taz (die tageszeitung) annotated and encoded in a XML format.

enumeration

http://w3id.org/meta-share/omtd-share/Imscwb

A tab-separated format with limited markup (e.g. for sentences, documents, but not recursive structures like parse-trees) used by the IMS Open Corpus Workbench.

enumeration

http://w3id.org/meta-share/omtd-share/Reuters21578Sgml

Reuters-21578 corpus in SGML format

enumeration

http://w3id.org/meta-share/omtd-share/BncFormat

Data format for the XML version of the British National Corpus (http://www.natcorp.ox.ac.uk/)

enumeration

http://w3id.org/meta-share/omtd-share/Tcf

An XML data exchange format developed within the WebLicht architecture to facilitate efficient interoperability between the tools; it allows the various linguistic annotations produced by the tools within WebLicht to be stored in one document; it supports incremental enrichment of linguistic annotations at various levels of analysis in a stand-off XMLbased format

enumeration

http://w3id.org/meta-share/omtd-share/Xml

Superclass for grouping together XML formats

enumeration

http://w3id.org/meta-share/omtd-share/Emma

Data format according to the EMMA (Extensible MultiModal Annotation markup language) specifications, cf. https://www.w3.org/TR/2007/CR-emma-20071211/

enumeration

http://w3id.org/meta-share/omtd-share/Pml

Format according to the Prague Markup Language (http://ufal.mff.cuni.cz/jazz/PML/index_en.html); PML is a generic data format based on XML intended for storing linguistically annotated data, such as the Prague Dependency Treebank, also annotation lexicons, etc.

enumeration

http://w3id.org/meta-share/omtd-share/Tei

Data format for TEI-encoded (Text Encoding Initiative) texts

enumeration

http://w3id.org/meta-share/omtd-share/Folia

FoLiA is an XML-based annotation format, suitable for the representation of linguistically annotated language resources

enumeration

http://w3id.org/meta-share/omtd-share/Tmx

The purpose of the TMX format is to provide a standard method to describe translation memory data that is being exchanged among tools and/or translation vendors, while introducing little or no loss of critical data during the process.

enumeration

http://w3id.org/meta-share/omtd-share/AlvisEnrichedDocumentFormat

Format for linguistic annotations of documents used for the ALVIS framework

enumeration

http://w3id.org/meta-share/omtd-share/Pls

Data format according to the Pronunciation Lexicon Specification (PLS)

enumeration

http://w3id.org/meta-share/omtd-share/MsWordDocx

Format for MS-Word documents open xml formats

enumeration

http://w3id.org/meta-share/omtd-share/SdlTm

Translation Memory format of the SDL alignment tool

enumeration

http://w3id.org/meta-share/omtd-share/Xpath

XPath is a language for addressing parts of an XML document, designed to be used by both XSLT and XPointer.

enumeration

http://w3id.org/meta-share/omtd-share/Xmi

Data format for the XML Metadata Interchange (XMI), which is an Object Management Group (OMG) standard for exchanging metadata information via Extensible Markup Language (XML)

enumeration

http://w3id.org/meta-share/omtd-share/XmlBioc

BioC is a simple format to share text data and annotations.

enumeration

http://w3id.org/meta-share/omtd-share/MsExcelXlsx

Spreadsheet format for open office ms-excel

enumeration

http://w3id.org/meta-share/omtd-share/InlineXml

Inline XML file format

enumeration

http://w3id.org/meta-share/omtd-share/Xhtml

Data format for XHTML (Extensible HyperText Markup Language)

enumeration

http://w3id.org/meta-share/omtd-share/Xces

Data format for documents and corpora using the XCES standard (Corpus Encoding Standard for XML), cf. http://www.xces.org/

enumeration

http://w3id.org/meta-share/omtd-share/XcesIlspVariant

A variant of XCES implemented for documents

enumeration

http://w3id.org/meta-share/omtd-share/DocumentFormat

Any format used for documents (textual resources)

enumeration

http://w3id.org/meta-share/omtd-share/Pubmed

Textual format used for PubMed articles

enumeration

http://w3id.org/meta-share/omtd-share/Html

HTML format

enumeration

http://w3id.org/meta-share/omtd-share/Html5Microdata

Format according to the specifications of HTML5 Microdata

enumeration

http://w3id.org/meta-share/omtd-share/Json_ld

Data format encoding Linked Data using JSON

enumeration

http://w3id.org/meta-share/omtd-share/Latex

Data format for documents using LaTeX (a high-quality typesetting system very popular for scientific documents)

enumeration

http://w3id.org/meta-share/omtd-share/BionlpFormats

Formats used  for BioNLP shared tasks

enumeration

http://w3id.org/meta-share/omtd-share/Json_genia

JSON format of the Genia dataset

enumeration

http://w3id.org/meta-share/omtd-share/Bionlp

File format used for the BioNLP Shared Task format

enumeration

http://w3id.org/meta-share/omtd-share/BionlpSt2013A1_a2

Format used in BioNLP Shared Task 2013

enumeration

http://w3id.org/meta-share/omtd-share/Cochrane

Format used in Cochrane texts

enumeration

http://w3id.org/meta-share/omtd-share/MsExcel

Hyperclass for MS-Excel documents

enumeration

http://w3id.org/meta-share/omtd-share/MsExcelXls

Data format for Microsoft Excel documents

enumeration

http://w3id.org/meta-share/omtd-share/Postscript

Data format for PostScript files

enumeration

http://w3id.org/meta-share/omtd-share/Sgml

SGML format

enumeration

http://w3id.org/meta-share/omtd-share/Rtf

Rich Text Format; proprietary data format of Microsoft

enumeration

http://w3id.org/meta-share/omtd-share/Text

Default value for the format of textual files; a textual file should be human-readable and must not contain binary data

enumeration

http://w3id.org/meta-share/omtd-share/Tex

Data format for documents using Tex (a typesetting system)

enumeration

http://w3id.org/meta-share/omtd-share/MsWord

Hyperclass for MS-Word documents

enumeration

http://w3id.org/meta-share/omtd-share/MsWordDoc

Data format for Microsoft Word documents

enumeration

http://w3id.org/meta-share/omtd-share/TabularFormat

Any format based on columns

enumeration

http://w3id.org/meta-share/omtd-share/Csv

Data format with comma-separated values

enumeration

http://w3id.org/meta-share/omtd-share/ConllFormat

Formats used in the CoNLL Shared Tasks

enumeration

http://w3id.org/meta-share/omtd-share/ConllU

Format used for CoNLL.

enumeration

http://w3id.org/meta-share/omtd-share/Conll2009

The CoNLL 2009 format targets semantic role labeling. Columns are tab-separated. Sentences are separated by a blank new line.

enumeration

http://w3id.org/meta-share/omtd-share/Conll2002

The CoNLL 2002 format encodes named entity spans. Fields are separated by a single space. Sentences are separated by a blank new line.

enumeration

http://w3id.org/meta-share/omtd-share/Conll2008

The CoNLL 2008 format targets syntactic and semantic dependencies. Columns are tab-separated. Sentences are separated by a blank new line.

enumeration

http://w3id.org/meta-share/omtd-share/Conll2000

The CoNLL 2000 format represents POS and Chunk tags. Fields in a line are separated by spaces. Sentences are separated by a blank new line.

enumeration

http://w3id.org/meta-share/omtd-share/Conll2003

The CoNLL 2004 format encodes named entity spans and chunk spans. Fields are separated by a single space. Sentences are separated by a blank new line. Named entities and chunks are encoded in the IOB1 format. I.e. a B prefix is only used if the category of the following span differs from the category of the current span.

enumeration

http://w3id.org/meta-share/omtd-share/Conll2006

The CoNLL 2006 (aka CoNLL-X) format targets dependency parsing. Columns are tab-separated. Sentences are separated by a blank new line.

enumeration

http://w3id.org/meta-share/omtd-share/Conll2012

The CoNLL 2012 format targets semantic role labeling and coreference. Columns are tab-separated. Sentences are separated by a blank new line.

enumeration

http://w3id.org/meta-share/omtd-share/Tsv

Format for files with tab-separated values

enumeration

http://w3id.org/meta-share/omtd-share/LinkedDataFormat

Formats used for linked data

enumeration

http://w3id.org/meta-share/omtd-share/Json

Superclass of JSON formats

enumeration

http://w3id.org/meta-share/omtd-share/Cadixe_json

AlvisAE protocol format

enumeration

http://w3id.org/meta-share/omtd-share/Kaf

KAF (also known as Knowledge Annotation Format) is a language neutral annotation format representing both morpho-syntactic and semantic annotation of documents through a stand-off multilayered structure

enumeration

http://w3id.org/meta-share/omtd-share/WebAnnotationFormat

A structured model and format to enable annotations to be shared and reused across different hardware and software platforms.

enumeration

http://w3id.org/meta-share/omtd-share/Uima_json

UIMA serialisation in JSON

enumeration

http://w3id.org/meta-share/omtd-share/WikipediaFormat

Formats used for wikipedia

enumeration

http://w3id.org/meta-share/omtd-share/WikipediaRevisionPair

Pairs of adjacent revisions of all articles

enumeration

http://w3id.org/meta-share/omtd-share/WikipediaDiscussion

Format for wikipedia discussion pages

enumeration

http://w3id.org/meta-share/omtd-share/WikipediaRevision

Format for wikipedia revision pages

enumeration

http://w3id.org/meta-share/omtd-share/WikipediaArticleInfo

Format of general article infos

enumeration

http://w3id.org/meta-share/omtd-share/Blikiwikipedia

The Java Wikipedia API (Bliki engine) is a parser library for converting Wikipedia wikitext notation to HTML.

enumeration

http://w3id.org/meta-share/omtd-share/WikipediaQuery

Reads all article pages that match a query created by the numerous parameters of this class.

enumeration

http://w3id.org/meta-share/omtd-share/WikipediaLink

Format for wikipedia links

enumeration

http://w3id.org/meta-share/omtd-share/WikipediaTemplateFilteredArticle

Format for wikipedia pages that contain or do not contain the templates specified in the template whitelist and template blacklist

enumeration

http://w3id.org/meta-share/omtd-share/WikipediaPage

Format of wikipedia pages in the database (articles, discussions, etc)

enumeration

http://w3id.org/meta-share/omtd-share/WikipediaArticle

Format for wikipedia articles

enumeration

http://w3id.org/meta-share/omtd-share/LexicalDataFormat

enumeration

http://w3id.org/meta-share/omtd-share/AnnotationFormat

Any format used for annotated textual documents

enumeration

http://w3id.org/meta-share/omtd-share/DkproTokenized

DkPro format for tokenized files containing one sentence per line and tokens split by whitespaces.

enumeration

http://w3id.org/meta-share/omtd-share/MalletLdaTopicProportions

Topic proportions in the shape [\t]\t\t...

enumeration

http://w3id.org/meta-share/omtd-share/Ptb

Penn Tree Bank formats

enumeration

http://w3id.org/meta-share/omtd-share/PtbChunked

Penn Treebank chunked format

enumeration

http://w3id.org/meta-share/omtd-share/PtbCombined

Penn Treebank combined format

enumeration

http://w3id.org/meta-share/omtd-share/I2b2

Format of the I2B2 challenge

enumeration

http://w3id.org/meta-share/omtd-share/MalletLdaTopicProportionsSorted

Topic proportions in the shape [\t]\t\t... sorted

enumeration

http://w3id.org/meta-share/omtd-share/Brat

BRAT stand-off format for annotations (BRAT is a online environment for collaborative text annotation, cf. http://brat.nlplab.org/)

enumeration

http://w3id.org/meta-share/omtd-share/Lll

Format of the LLL challenge

enumeration

http://w3id.org/meta-share/omtd-share/Chat

CHAT (Codes for the Human Analysis of Transcripts) transcription format; used by CHILDES corpora

enumeration

http://w3id.org/meta-share/omtd-share/FactoredTagLemFormat

Factored tag lemma format

enumeration

http://w3id.org/meta-share/omtd-share/NegraExport

Export format for annotated corpora in the NeGra project

enumeration

http://w3id.org/meta-share/omtd-share/Graf

GrAF (Graph Annotation Format) is an extension of the Linguistic Annotation Framework (LAF)

enumeration

http://w3id.org/meta-share/omtd-share/Naf

The NAF format is linguistic annotation format designed for complex NLP pipelines. NAF combines strengths of the Linguistic Annotation Framework (LAF) as described in Ide et al. (2003) and the NLP Interchange Format (Hellman et al. 2013, NIF).

enumeration

http://w3id.org/meta-share/omtd-share/Diaml

Format following Dialogue Act Markup Language (DiAML) which is defined within the ISO standard 24617-2

enumeration

http://w3id.org/meta-share/omtd-share/Tgrep2

Format for TGrep2 (search engine for searching syntactic parse trees represented as bracketed structures)

enumeration

http://w3id.org/meta-share/omtd-share/UimaCasFormat

Formats used for the UIMA CAS (Common Analysis System) objects

enumeration

http://w3id.org/meta-share/omtd-share/SerializedCas

The CAS is the native data model used by UIMA; there are various ways of saving CAS data, using XMI, XCAS, or binary formats; this is for the serialized format

enumeration

http://w3id.org/meta-share/omtd-share/BinaryCas

Binary format used for CAS data

Complex Types

inputInfoType, outputInfoType

<xs:element name="dataFormat" type="dataFormatType">
  <xs:annotation>
    <xs:documentation>The data format (usually corresponding to the mime-type) of the resource which is a formalized specifier for the format included or a data format (mime-type) that the tool/service accepts, preferrably in conformance with the values of the IANA (Internet Assigned Numbers Authority); you can select one of the pre-defined values or add a value, PREFERABLY FROM THE IANA MEDIA MIMETYPE RECOMMENDED VALUES (http://www.iana.org/assignments/media-types/media-types.xhtml)</xs:documentation>
    <xs:appinfo>
      <label>Data format</label>
    </xs:appinfo>
  </xs:annotation>
  <!--
		<xs:simpleType>
			<xs:restriction base="xs:string">
				<xs:maxLength value="100"/>
				<xs:enumeration value="text/plain">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>Plain Text</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/vnd.xmi+xml">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>XMI</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/xml">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>XML</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/json">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>JSON</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/x-tmx+xml">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>TMX</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/x-xces+xml">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>XCES</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/tei+xml">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>TEI</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/rdf+xml">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>RDF</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/xhtml+xml">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>XHTML</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="text/sgml">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>SGML</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="text/html">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>HTML</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/x-tex">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>TEX</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/rtf">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>RTF</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/x-latex">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>LATEX</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="text/csv">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>CSV</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="text/tab-separated-values">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>text with tab-separated-values</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/pdf">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>PDF</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/x-msaccess">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>MS-Access database</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/vnd.ms-excel">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>MS-Excel xls</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>MS-Excel xlsx</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/vnd.openxmlformats-officedocument.wordprocessingml.document">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>MS-Word docx</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/msword">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>MS-Word doc</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/x-SDL-TM">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>TM format of the SDL alignment tool</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="application/x-tbx">
					<xs:annotation>
						<xs:appinfo>
							<xs:label>Term Base eXchange</xs:label>
						</xs:appinfo>
					</xs:annotation>
				</xs:enumeration>
				<xs:enumeration value="other"/>
			</xs:restriction>
		</xs:simpleType>
-->
</xs:element>

The name of the character encoding used in the resource or accepted by the tool/service

maxLength	100
enumeration	US-ASCII
enumeration	windows-1250
enumeration	windows-1251
enumeration	windows-1252
enumeration	windows-1253
enumeration	windows-1254
enumeration	windows-1257
enumeration	ISO-8859-1
enumeration	ISO-8859-2
enumeration	ISO-8859-4
enumeration	ISO-8859-5
enumeration	ISO-8859-7
enumeration	ISO-8859-9
enumeration	ISO-8859-13
enumeration	ISO-8859-15
enumeration	KOI8-R
enumeration	UTF-8
enumeration	UTF-16
enumeration	UTF-16BE
enumeration	UTF-16LE
enumeration	windows-1255
enumeration	windows-1256
enumeration	windows-1258
enumeration	ISO-8859-3
enumeration	ISO-8859-6
enumeration	ISO-8859-8
enumeration	windows-31j
enumeration	EUC-JP
enumeration	x-EUC-JP-LINUX
enumeration	Shift_JIS
enumeration	ISO-2022-JP
enumeration	x-mswin-936
enumeration	GB18030
enumeration	x-EUC-CN
enumeration	GBK
enumeration	ISCII91
enumeration	x-windows-949
enumeration	EUC-KR
enumeration	ISO-2022-KR
enumeration	x-windows-950
enumeration	x-MS950-HKSCS
enumeration	x-EUC-TW
enumeration	Big5
enumeration	Big5-HKSCS
enumeration	TIS-620
enumeration	Big5_Solaris
enumeration	Cp037
enumeration	Cp273
enumeration	Cp277
enumeration	Cp278
enumeration	Cp280
enumeration	Cp284
enumeration	Cp285
enumeration	Cp297
enumeration	Cp420
enumeration	Cp424
enumeration	Cp437
enumeration	Cp500
enumeration	Cp737
enumeration	Cp775
enumeration	Cp838
enumeration	Cp850
enumeration	Cp852
enumeration	Cp855
enumeration	Cp856
enumeration	Cp857
enumeration	Cp858
enumeration	Cp860
enumeration	Cp861
enumeration	Cp862
enumeration	Cp863
enumeration	Cp864
enumeration	Cp865
enumeration	Cp866
enumeration	Cp868
enumeration	Cp869
enumeration	Cp870
enumeration	Cp871
enumeration	Cp874
enumeration	Cp875
enumeration	Cp918
enumeration	Cp921
enumeration	Cp922
enumeration	Cp930
enumeration	Cp933
enumeration	Cp935
enumeration	Cp937
enumeration	Cp939
enumeration	Cp942
enumeration	Cp942C
enumeration	Cp943
enumeration	Cp943C
enumeration	Cp948
enumeration	Cp949
enumeration	Cp949C
enumeration	Cp950
enumeration	Cp964
enumeration	Cp970
enumeration	Cp1006
enumeration	Cp1025
enumeration	Cp1026
enumeration	Cp1046
enumeration	Cp1047
enumeration	Cp1097
enumeration	Cp1098
enumeration	Cp1112
enumeration	Cp1122
enumeration	Cp1123
enumeration	Cp1124
enumeration	Cp1140
enumeration	Cp1141
enumeration	Cp1142
enumeration	Cp1143
enumeration	Cp1144
enumeration	Cp1145
enumeration	Cp1146
enumeration	Cp1147
enumeration	Cp1148
enumeration	Cp1149
enumeration	Cp1381
enumeration	Cp1383
enumeration	Cp33722
enumeration	ISO2022_CN_CNS
enumeration	ISO2022_CN_GB
enumeration	JISAutoDetect
enumeration	MS874
enumeration	MacArabic
enumeration	MacCentralEurope
enumeration	MacCroatian
enumeration	MacCyrillic
enumeration	MacDingbat
enumeration	MacGreek
enumeration	MacHebrew
enumeration	MacIceland
enumeration	MacRoman
enumeration	MacRomania
enumeration	MacSymbol
enumeration	MacThai
enumeration	MacTurkish
enumeration	MacUkraine

<xs:element name="characterEncoding">
  <xs:annotation>
    <xs:documentation>The name of the character encoding used in the resource or accepted by the tool/service</xs:documentation>
    <xs:appinfo>
      <label>Character encoding</label>
    </xs:appinfo>
  </xs:annotation>
  <xs:simpleType>
    <xs:restriction base="xs:string">
      <xs:maxLength value="100"/>
      <xs:enumeration value="US-ASCII"/>
      <xs:enumeration value="windows-1250"/>
      <xs:enumeration value="windows-1251"/>
      <xs:enumeration value="windows-1252"/>
      <xs:enumeration value="windows-1253"/>
      <xs:enumeration value="windows-1254"/>
      <xs:enumeration value="windows-1257"/>
      <xs:enumeration value="ISO-8859-1"/>
      <xs:enumeration value="ISO-8859-2"/>
      <xs:enumeration value="ISO-8859-4"/>
      <xs:enumeration value="ISO-8859-5"/>
      <xs:enumeration value="ISO-8859-7"/>
      <xs:enumeration value="ISO-8859-9"/>
      <xs:enumeration value="ISO-8859-13"/>
      <xs:enumeration value="ISO-8859-15"/>
      <xs:enumeration value="KOI8-R"/>
      <xs:enumeration value="UTF-8"/>
      <xs:enumeration value="UTF-16"/>
      <xs:enumeration value="UTF-16BE"/>
      <xs:enumeration value="UTF-16LE"/>
      <xs:enumeration value="windows-1255"/>
      <xs:enumeration value="windows-1256"/>
      <xs:enumeration value="windows-1258"/>
      <xs:enumeration value="ISO-8859-3"/>
      <xs:enumeration value="ISO-8859-6"/>
      <xs:enumeration value="ISO-8859-8"/>
      <xs:enumeration value="windows-31j"/>
      <xs:enumeration value="EUC-JP"/>
      <xs:enumeration value="x-EUC-JP-LINUX"/>
      <xs:enumeration value="Shift_JIS"/>
      <xs:enumeration value="ISO-2022-JP"/>
      <xs:enumeration value="x-mswin-936"/>
      <xs:enumeration value="GB18030"/>
      <xs:enumeration value="x-EUC-CN"/>
      <xs:enumeration value="GBK"/>
      <xs:enumeration value="ISCII91"/>
      <xs:enumeration value="x-windows-949"/>
      <xs:enumeration value="EUC-KR"/>
      <xs:enumeration value="ISO-2022-KR"/>
      <xs:enumeration value="x-windows-950"/>
      <xs:enumeration value="x-MS950-HKSCS"/>
      <xs:enumeration value="x-EUC-TW"/>
      <xs:enumeration value="Big5"/>
      <xs:enumeration value="Big5-HKSCS"/>
      <xs:enumeration value="TIS-620"/>
      <xs:enumeration value="Big5_Solaris"/>
      <xs:enumeration value="Cp037"/>
      <xs:enumeration value="Cp273"/>
      <xs:enumeration value="Cp277"/>
      <xs:enumeration value="Cp278"/>
      <xs:enumeration value="Cp280"/>
      <xs:enumeration value="Cp284"/>
      <xs:enumeration value="Cp285"/>
      <xs:enumeration value="Cp297"/>
      <xs:enumeration value="Cp420"/>
      <xs:enumeration value="Cp424"/>
      <xs:enumeration value="Cp437"/>
      <xs:enumeration value="Cp500"/>
      <xs:enumeration value="Cp737"/>
      <xs:enumeration value="Cp775"/>
      <xs:enumeration value="Cp838"/>
      <xs:enumeration value="Cp850"/>
      <xs:enumeration value="Cp852"/>
      <xs:enumeration value="Cp855"/>
      <xs:enumeration value="Cp856"/>
      <xs:enumeration value="Cp857"/>
      <xs:enumeration value="Cp858"/>
      <xs:enumeration value="Cp860"/>
      <xs:enumeration value="Cp861"/>
      <xs:enumeration value="Cp862"/>
      <xs:enumeration value="Cp863"/>
      <xs:enumeration value="Cp864"/>
      <xs:enumeration value="Cp865"/>
      <xs:enumeration value="Cp866"/>
      <xs:enumeration value="Cp868"/>
      <xs:enumeration value="Cp869"/>
      <xs:enumeration value="Cp870"/>
      <xs:enumeration value="Cp871"/>
      <xs:enumeration value="Cp874"/>
      <xs:enumeration value="Cp875"/>
      <xs:enumeration value="Cp918"/>
      <xs:enumeration value="Cp921"/>
      <xs:enumeration value="Cp922"/>
      <xs:enumeration value="Cp930"/>
      <xs:enumeration value="Cp933"/>
      <xs:enumeration value="Cp935"/>
      <xs:enumeration value="Cp937"/>
      <xs:enumeration value="Cp939"/>
      <xs:enumeration value="Cp942"/>
      <xs:enumeration value="Cp942C"/>
      <xs:enumeration value="Cp943"/>
      <xs:enumeration value="Cp943C"/>
      <xs:enumeration value="Cp948"/>
      <xs:enumeration value="Cp949"/>
      <xs:enumeration value="Cp949C"/>
      <xs:enumeration value="Cp950"/>
      <xs:enumeration value="Cp964"/>
      <xs:enumeration value="Cp970"/>
      <xs:enumeration value="Cp1006"/>
      <xs:enumeration value="Cp1025"/>
      <xs:enumeration value="Cp1026"/>
      <xs:enumeration value="Cp1046"/>
      <xs:enumeration value="Cp1047"/>
      <xs:enumeration value="Cp1097"/>
      <xs:enumeration value="Cp1098"/>
      <xs:enumeration value="Cp1112"/>
      <xs:enumeration value="Cp1122"/>
      <xs:enumeration value="Cp1123"/>
      <xs:enumeration value="Cp1124"/>
      <xs:enumeration value="Cp1140"/>
      <xs:enumeration value="Cp1141"/>
      <xs:enumeration value="Cp1142"/>
      <xs:enumeration value="Cp1143"/>
      <xs:enumeration value="Cp1144"/>
      <xs:enumeration value="Cp1145"/>
      <xs:enumeration value="Cp1146"/>
      <xs:enumeration value="Cp1147"/>
      <xs:enumeration value="Cp1148"/>
      <xs:enumeration value="Cp1149"/>
      <xs:enumeration value="Cp1381"/>
      <xs:enumeration value="Cp1383"/>
      <xs:enumeration value="Cp33722"/>
      <xs:enumeration value="ISO2022_CN_CNS"/>
      <xs:enumeration value="ISO2022_CN_GB"/>
      <xs:enumeration value="JISAutoDetect"/>
      <xs:enumeration value="MS874"/>
      <xs:enumeration value="MacArabic"/>
      <xs:enumeration value="MacCentralEurope"/>
      <xs:enumeration value="MacCroatian"/>
      <xs:enumeration value="MacCyrillic"/>
      <xs:enumeration value="MacDingbat"/>
      <xs:enumeration value="MacGreek"/>
      <xs:enumeration value="MacHebrew"/>
      <xs:enumeration value="MacIceland"/>
      <xs:enumeration value="MacRoman"/>
      <xs:enumeration value="MacRomania"/>
      <xs:enumeration value="MacSymbol"/>
      <xs:enumeration value="MacThai"/>
      <xs:enumeration value="MacTurkish"/>
      <xs:enumeration value="MacUkraine"/>
    </xs:restriction>
  </xs:simpleType>
</xs:element>

Specifies the external classification schemes

Complex Type

domainInfoType

<xs:element name="conformanceToClassificationScheme" type="xs:string">
  <xs:annotation>
    <xs:documentation>Specifies the external classification schemes</xs:documentation>
    <xs:appinfo>
      <label>Conformance to classification scheme</label>
      <action>If the user fills in domainID, add by default the value "EUROVOC"; if the user fills in domainOther, prompt them to add a value (free text)</action>
    </xs:appinfo>
  </xs:annotation>
</xs:element>

Specifies the annotation level of the resource or the annotation type a tool/ service requires or produces as an output

enumeration

http://w3id.org/meta-share/omtd-share/AnnotationType

Category/class of the annotations (metadata) that are added to the data/text that is processed

enumeration

http://w3id.org/meta-share/omtd-share/Domain-specificAnnotation

Any kind of annotation that is used for specific domains (e.g. genes and proteins from the biomedical domain, plants from agriculture etc.)

enumeration

http://w3id.org/meta-share/omtd-share/AgriculturalEntity

Any kind of annotation pertaining to entities of the agricultural domain; the use of the AGROVOC thesaurus is recommended

enumeration

http://w3id.org/meta-share/omtd-share/Rna

Any of various nucleic acids that contain ribose and uracil as structural components and are associated with the control of cellular chemical activities

enumeration

http://w3id.org/meta-share/omtd-share/ProteinFamily

A protein family is a group of proteins that share a common evolutionary origin, reflected by their related functions and similarities in sequence or structure [https://www.ebi.ac.uk/training/online/course/introduction-protein-classification-ebi/protein-classification/what-are-protein-families]

enumeration

http://w3id.org/meta-share/omtd-share/Organism

An individual animal, plant, or single-celled life form [https://en.oxforddictionaries.com/definition/organism]

enumeration

http://w3id.org/meta-share/omtd-share/Phenotype

The physical appearance or biochemical characteristic of an organism as a result of the interaction of its genotype and the environment [http://www.biology-online.org/dictionary/Phenotype]

enumeration

http://w3id.org/meta-share/omtd-share/PhysicoChemicalProperty

Physical and chemical property of substances

enumeration

http://w3id.org/meta-share/omtd-share/WheatRelatedSpecies

Wheat-related species

enumeration

http://w3id.org/meta-share/omtd-share/Marker

Marker

enumeration

http://w3id.org/meta-share/omtd-share/Gene

Specific sequence of nucleotides along a molecule of DNA (or, in the case of some viruses, RNA) which represents functional units of heredity [http://artemide.art.uniroma2.it:8081/agrovoc/agrovoc/en/page/c_3214]

enumeration

http://w3id.org/meta-share/omtd-share/GeneFamily

A gene family is a set of several similar genes, formed by duplication of a single original gene, and generally with similar biochemical functions [https://en.wikipedia.org/wiki/Gene_family]

enumeration

http://w3id.org/meta-share/omtd-share/GrapeVariety

A type of grape

enumeration

http://w3id.org/meta-share/omtd-share/Habitat

The place or environment where an organism, plant or animal naturally or normally lives and grows

enumeration

http://w3id.org/meta-share/omtd-share/LinguisticEntity

Any kind of annotation pertaining to entities of linguistics; the use of OLIA is recommended

enumeration

http://w3id.org/meta-share/omtd-share/ScientificUnit

Scientific unit

enumeration

http://w3id.org/meta-share/omtd-share/NeuroscienceEntity

Any kind of annotation pertaining to entities of neuroscience

enumeration

http://w3id.org/meta-share/omtd-share/ScientificValue

Scientific value

enumeration

http://w3id.org/meta-share/omtd-share/ChemicalEntity

Any kind of annotation pertaining to entities from chemistry

enumeration

http://w3id.org/meta-share/omtd-share/SocialSciencesEntity

Any kind of annotation that pertains to entities of social sciences; the use of TheSoz is recommended

enumeration

http://w3id.org/meta-share/omtd-share/TheoreticalFrame

Theoretical frame

enumeration

http://w3id.org/meta-share/omtd-share/MethodOfResearch

Method of research

enumeration

http://w3id.org/meta-share/omtd-share/AllbusVariable

ALLBUS variable

enumeration

http://w3id.org/meta-share/omtd-share/OfficialText

Official text

enumeration

http://w3id.org/meta-share/omtd-share/HistoricalEvent

Historical event

enumeration

http://w3id.org/meta-share/omtd-share/Media

The main means of mass communication (broadcasting, publishing, and the Internet) regarded collectively [https://en.oxforddictionaries.com/definition/media]

enumeration

http://w3id.org/meta-share/omtd-share/BiologicalEnity

Any kind of annotation pertaining to entities of biology

enumeration

http://w3id.org/meta-share/omtd-share/Neuron

A nerve cell that carries information between the brain and other parts of the body

enumeration

http://w3id.org/meta-share/omtd-share/Metabolite

Any substance involved in metabolism (= the chemical processes in the body needed for life) [https://dictionary.cambridge.org/dictionary/english/metabolite]

enumeration

http://w3id.org/meta-share/omtd-share/Species

A set of animals or plants in which the members have similar characteristics to each other and can breed with each other

enumeration

http://w3id.org/meta-share/omtd-share/Chemical

Any substance (as an acid) that is formed when two or more other substances act upon one another or that is used to produce a change in another substance [https://www.merriam-webster.com/dictionary/chemical]

enumeration

http://w3id.org/meta-share/omtd-share/Protein

Any of various naturally occurring extremely complex substances that consist of amino-acid residues joined by peptide bonds, contain the elements carbon, hydrogen, nitrogen, oxygen, usually sulfur, and occasionally other elements (such as phosphorus or iron), and include many essential biological compounds (such as enzymes, hormones, or antibodies) [https://www.merriam-webster.com/dictionary/protein]

enumeration

http://w3id.org/meta-share/omtd-share/IonicChannel

A single protein or protein complex that traverses the lipid bilayer of cell membrane and form a channel to facilitate the movement of ions through the membrane according to their electrochemical gradient [http://www.biology-online.org/dictionary/Ion_channel]

enumeration

http://w3id.org/meta-share/omtd-share/IonicCurrent

The influx and/or efflux of ions through an ion channel

enumeration

http://w3id.org/meta-share/omtd-share/IonicConductance

Ionic conductance

enumeration

http://w3id.org/meta-share/omtd-share/Synapse

A specialized structure or junction that allows cell to cell communication [http://www.biology-online.org/dictionary/Synapse]

enumeration

http://w3id.org/meta-share/omtd-share/BiologicalActivity

Biological activity

enumeration

http://w3id.org/meta-share/omtd-share/BrainRegion

Part of the brain

enumeration

http://w3id.org/meta-share/omtd-share/ModelOrganism_species

Model organism/species

enumeration

http://w3id.org/meta-share/omtd-share/Relation

Any type of relation that holds between two or more entities of a specific domain

enumeration

http://w3id.org/meta-share/omtd-share/ScholarlyCommunicationAnnotation

Any type of annotation that is relevant to scholarly analtyics (e.g. citations, funding information etc.)

enumeration

http://w3id.org/meta-share/omtd-share/Topic

The subject of a text or conversation, what it is about

enumeration

http://w3id.org/meta-share/omtd-share/Citation

Reference to a book, paper, or author, especially in a scholarly work.

enumeration

http://w3id.org/meta-share/omtd-share/Keyword

A word or group of words used to describe or index the contents of a document

enumeration

http://w3id.org/meta-share/omtd-share/DocumentSection

Any subdivision of a document, e.g. a chapter, abstract, etc.

enumeration

http://w3id.org/meta-share/omtd-share/Funding

Annotation related to the funding of a resource (e.g. funder, funding project, etc.)

enumeration

http://w3id.org/meta-share/omtd-share/DiscourseAnnotationType

Any type of annotation relevant to discourse

enumeration

http://w3id.org/meta-share/omtd-share/Contradiction

A set of statements that contradict each other (i.e. one of them asserts the truth and the other the falsity of the proposition)

enumeration

http://w3id.org/meta-share/omtd-share/SpeechAct

A speech act is an act that a speaker performs when making an utterance, including the following: (a) A general act (illocutionary act) that a speaker performs, analyzable as including: the uttering of words (utterance acts), making reference and predicating (propositional acts), and a particular intention in making the utterance (illocutionary force). (b) An act involved in the illocutionary act, including utterance acts and propositional acts, (c) The production of a particular effect in the addressee (perlocutionary act) [http://www.glossary.sil.org/term/speech-act]

enumeration

http://w3id.org/meta-share/omtd-share/DialogueAct

A dialogue act has two main components:  a communicative function and a semantic content.   The semantic content specifies the objects, relations, actions, events, etc. that the dialogue act is about; the communicative function can be viewed as a specification of the way an addressee uses the semantic content to update his or her information state when  he  or  she  understands  the  corresponding  stretch  of dialogue. [http://www.lrec-conf.org/proceedings/lrec2010/pdf/560_Paper.pdf]

enumeration

http://w3id.org/meta-share/omtd-share/Coreference

Coreference is the reference in one expression to the same referent in another expression. [http://www.glossary.sil.org/term/coreference]

enumeration

http://w3id.org/meta-share/omtd-share/EntityMentionPair

The pair of an entity and all the mentions of this entity formulated in various ways; used in co-reference resolution

enumeration

http://w3id.org/meta-share/omtd-share/DiscourceRelation

The relation that holds between two segments of discourse; e.g. causal, temporal etc.

enumeration

http://w3id.org/meta-share/omtd-share/AudienceReaction

The response of the target recipients (audience) to a system, process or event

enumeration

http://w3id.org/meta-share/omtd-share/ModalityAnnotationType

enumeration

http://w3id.org/meta-share/omtd-share/GazeEyeMovements

enumeration

http://w3id.org/meta-share/omtd-share/LipMovements

enumeration

http://w3id.org/meta-share/omtd-share/HandArmGestures

enumeration

http://w3id.org/meta-share/omtd-share/FacialExpressions

enumeration

http://w3id.org/meta-share/omtd-share/BodyMovements

enumeration

http://w3id.org/meta-share/omtd-share/HeadMovements

enumeration

http://w3id.org/meta-share/omtd-share/HandManipulationOfObjects

enumeration

http://w3id.org/meta-share/omtd-share/Term

A term is a designation consisting of one or more words representing a general concept in a special language in a specific subject field [ISO 704:2009]

enumeration

http://w3id.org/meta-share/omtd-share/PartOfSpeech

A division of words based on common grammatical features

enumeration

http://w3id.org/meta-share/omtd-share/ParsingType

Any type of annotation that pertains to the syntactic level

enumeration

http://w3id.org/meta-share/omtd-share/SyntacticoSemanticLink

A link between the syntactic unit and the semantic unit (sense) of a word

enumeration

http://w3id.org/meta-share/omtd-share/DocumentAnnotationType

Any kind of annotation that is used to describe a document (e.g. identifier, size, location, language etc.)

enumeration

http://w3id.org/meta-share/omtd-share/StructuralAnnotationType

Any type of annotation that pertains to the structure of a document

enumeration

http://w3id.org/meta-share/omtd-share/Sentence

A group of words, usually containing a verb, that expresses a thought in the form of a statement, question, instruction, or exclamation and starts with a capital letter when written [https://dictionary.cambridge.org/dictionary/english/sentence]

enumeration

http://w3id.org/meta-share/omtd-share/Phrase

A phrase is a syntactic structure that consists of more than one word but lacks the subject-predicate organization of a clause. [http://www.glossary.sil.org/term/phrase]

enumeration

http://w3id.org/meta-share/omtd-share/Word

A word is a unit which is a constituent at the phrase level and above. It is sometimes identifiable according to such criteria as (a) being the minimal possible unit in a reply, (b) having features such as a regular stress pattern, and phonological changes conditioned by or blocked at word boundaries, (c) being the largest unit resistant to insertion of new constituents within its boundaries, or (d) being the smallest constituent that can be moved within a sentence without making the sentence ungrammatical. A word is sometimes placed, in a hierarchy of grammatical constituents, above the morpheme level and below the phrase level. [http://www.glossary.sil.org/term/word]
In annotation, words are often used as equivalent to tokens; thus, for instance, punctuation marks (traditionally not considered as words) will also be annotated as "word".

enumeration

http://w3id.org/meta-share/omtd-share/MultiWordUnit

A combination of words that are considered as forming one semantic unit

enumeration

http://w3id.org/meta-share/omtd-share/Token

A set of characters surrounded by spaces or punctuation marks, as well as punctuation marks themselves

enumeration

http://w3id.org/meta-share/omtd-share/Clause

A clause is a subdivision of a sentence containing a subject (argument) and predicate. It is possible to have a word that implies or refers to a predicate rather than one explicitly stated. [Pei & Gaynor 1980: 40, http://linguistics-ontology.org/gold/2010/Clause]

enumeration

http://w3id.org/meta-share/omtd-share/Paragraph

A division of a text, usually about a single theme, consisting of one or more sentences and marked by a new line, indentation or other conventions.

enumeration

http://w3id.org/meta-share/omtd-share/SemanticAnnotationType

Any type of annotation pertaining to the semantic level

enumeration

http://w3id.org/meta-share/omtd-share/Subjectivity

The linguistic expression of somebodys opinions, sentiments, emotions, evaluations, beliefs, speculations (private states, i.e. states that are not open to objective observation or verification). [http://www.mavir.net/docs/JWiebe-Subjectivity-nov2010.pdf]

enumeration

http://w3id.org/meta-share/omtd-share/SemanticFrame

A schematic representation of a situation involving various participants, props and other conceptual roles, each of which is a frame element

enumeration

http://w3id.org/meta-share/omtd-share/Sentiment

The affective state (judgement, feeling) of a person or group towards an entity or event

enumeration

http://w3id.org/meta-share/omtd-share/NamedEntity

A word or phrase referring to an entity, identified and annotated as such with a name (label); examples include organizations, persons, places etc.

enumeration

http://w3id.org/meta-share/omtd-share/Date

A text unit that denotes a date, a specific point in time

enumeration

http://w3id.org/meta-share/omtd-share/Organization

A word or group of words that denotes an organization, such as company, association, institution etc.

enumeration

http://w3id.org/meta-share/omtd-share/Person

A word or group of words that refers to a person

enumeration

http://w3id.org/meta-share/omtd-share/Location

A word or group of words that denotes a geographical entity

enumeration

http://w3id.org/meta-share/omtd-share/SpectralData

Spectral data is essentially data derived by the use of spectroscopic instruments

enumeration

http://w3id.org/meta-share/omtd-share/SemanticClass

A division of words into classes based on their common semantic features

enumeration

http://w3id.org/meta-share/omtd-share/Event

A thing that happens or takes place, especially one of importance [https://en.oxforddictionaries.com/definition/event]

enumeration

http://w3id.org/meta-share/omtd-share/LexicalSemanticRelation

A relation holding between two or more words based on their meanings

enumeration

http://w3id.org/meta-share/omtd-share/SemanticRole

A semantic role is the underlying relationship that a participant has with the main verb in a clause [http://www.glossary.sil.org/term/semantic-role]

enumeration

http://w3id.org/meta-share/omtd-share/Readability

The ease with which a reader can understand a written text. [https://en.wikipedia.org/wiki/Readability]

enumeration

http://w3id.org/meta-share/omtd-share/PersuasiveExpression

A word or phrase used for persuasion purposes

enumeration

http://w3id.org/meta-share/omtd-share/TemporalExpression

A linguistic expression (word, group of words, group of numbers etc.) that denotes time (a point in time, duration, frequency)

enumeration

http://w3id.org/meta-share/omtd-share/QuestionTopicalTarget

The segment of a question that describes the entity about which the question is made

enumeration

http://w3id.org/meta-share/omtd-share/Emotion

An affective state of consciousness in which joy, sorrow, fear, hate, or the like, is experienced, as distinguished from cognitive and volitional states of consciousness [http://www.dictionary.com/browse/emotion]

enumeration

http://w3id.org/meta-share/omtd-share/Polarity

A feature that distinguishes between positive, negative or neutral; in sentiment analysis, it refers to determining whether the expressed opinion in a document, a sentence or an entity feature/aspect is positive, negative, or neutral. [adapted from Wikipedia]

enumeration

http://w3id.org/meta-share/omtd-share/WordSense

Corresponds to the structural part of a lexical entry that contains the relevant semantic, grammatical, and anthropological information for a lexical unit. [adapted from http://www.glossary.sil.org/term/sense]

enumeration

http://w3id.org/meta-share/omtd-share/CertaintyLevel

Degree of certainty about the validity of what is being asserted in the text

enumeration

http://w3id.org/meta-share/omtd-share/MorphologicalAnnotationType

Any type of annotation pertaining to the morphological level

enumeration

http://w3id.org/meta-share/omtd-share/MorphologicalFeature

Property of a word that is expressed in its inflected form; examples include person, tense, gender, case etc.

enumeration

http://w3id.org/meta-share/omtd-share/Compound

A single word composed of two or more free morphemes

enumeration

http://w3id.org/meta-share/omtd-share/Stem

A stem is the root or roots of a word, together with any derivational affixes, to which inflectional affixes are added. [http://www.glossary.sil.org/term/stem]

enumeration

http://w3id.org/meta-share/omtd-share/DerivationalFeature

Any feature relevant to the derivation process of a word (e.g. marking affixes, their meaning etc.)

enumeration

http://w3id.org/meta-share/omtd-share/Affix

enumeration

http://w3id.org/meta-share/omtd-share/Syllable

enumeration

http://w3id.org/meta-share/omtd-share/Lemma

The canonical or citation form used for referring to a word and its inflected forms

Complex Types

inputInfoType, outputInfoType

<xs:element name="annotationType" type="annotationTypeType">
  <xs:annotation>
    <xs:documentation>Specifies the annotation level of the resource or the annotation type a tool/ service requires or produces as an output</xs:documentation>
    <xs:appinfo>
      <label>Annotation type</label>
    </xs:appinfo>
  </xs:annotation>
  <!--
		<xs:simpleType>
			<xs:restriction base="xs:string">
				<xs:maxLength value="150"/>
				<xs:enumeration value="alignment"/>
				<xs:enumeration value="segmentation"/>
				<xs:enumeration value="tokenization"/>
				<xs:enumeration value="segmentationSentence"/>
				<xs:enumeration value="segmentationParagraph"/>
				<xs:enumeration value="lemmatization"/>
				<xs:enumeration value="stemming"/>
				<xs:enumeration value="structuralAnnotation"/>
				<xs:enumeration value="morphosyntacticAnnotation-bPosTagging"/>
				<xs:enumeration value="morphosyntacticAnnotation-posTagging"/>
				<xs:enumeration value="syntacticAnnotation-constituencyTrees"/>
				<xs:enumeration value="syntacticAnnotation-dependencyTrees"/>
				<xs:enumeration value="syntacticAnnotation-subcategorizationFrames"/>
				<xs:enumeration value="syntacticosemanticAnnotation-links"/>
				<xs:enumeration value="semanticAnnotation"/>
				<xs:enumeration value="semanticAnnotation-certaintyLevel"/>
				<xs:enumeration value="semanticAnnotation-emotions"/>
				<xs:enumeration value="semanticAnnotation-entityMentions"/>
				<xs:enumeration value="semanticAnnotation-events"/>
				<xs:enumeration value="semanticAnnotation-namedEntities"/>
				<xs:enumeration value="semanticAnnotation-polarity"/>
				<xs:enumeration value="semanticAnnotation-semanticClasses"/>
				<xs:enumeration value="semanticAnnotation-semanticRelations"/>
				<xs:enumeration value="semanticAnnotation-semanticRoles"/>
				<xs:enumeration value="semanticAnnotation-wordSenses"/>
				<xs:enumeration value="translation"/>
				<xs:enumeration value="transliteration"/>
				<xs:enumeration value="discourseAnnotation"/>
				<xs:enumeration value="other"/>
			</xs:restriction>
		</xs:simpleType>
-->
</xs:element>

A name or a url reference to the typesystem used in the annotation of the resource or used by the tool/service

Complex Types

inputInfoType, outputInfoType

<xs:element name="typesystem">
  <xs:annotation>
    <xs:documentation>A name or a url reference to the typesystem used in the annotation of the resource or used by the tool/service</xs:documentation>
    <xs:appinfo>
      <label>Typesystem</label>
    </xs:appinfo>
  </xs:annotation>
  <xs:simpleType>
    <xs:restriction base="xs:string">
      <xs:maxLength value="500"/>
    </xs:restriction>
  </xs:simpleType>
</xs:element>

A name or a url reference to the annotation schema used in the annotation of the resource or used by the tool/service

Complex Types

inputInfoType, outputInfoType

<xs:element name="annotationSchema">
  <xs:annotation>
    <xs:documentation>A name or a url reference to the annotation schema used in the annotation of the resource or used by the tool/service</xs:documentation>
    <xs:appinfo>
      <label>Annotation schema</label>
    </xs:appinfo>
  </xs:annotation>
  <xs:simpleType>
    <xs:restriction base="xs:string">
      <xs:maxLength value="500"/>
    </xs:restriction>
  </xs:simpleType>
</xs:element>

A name or a url reference to the resource (e.g. tagset, ontology, term lexicon etc.) used in the annotation of the resource or used by the tool/service

Complex Types

inputInfoType, outputInfoType

<xs:element name="annotationResource">
  <xs:annotation>
    <xs:documentation>A name or a url reference to the resource (e.g. tagset, ontology, term lexicon etc.) used in the annotation of the resource or used by the tool/service</xs:documentation>
    <xs:appinfo>
      <label>Annotation resource</label>
    </xs:appinfo>
  </xs:annotation>
  <xs:simpleType>
    <xs:restriction base="xs:string">
      <xs:maxLength value="500"/>
    </xs:restriction>
  </xs:simpleType>
</xs:element>

Specifies the size of the resource with regard to the SizeUnit measurement in form of a number

<xs:element name="size">
  <xs:annotation>
    <xs:documentation>Specifies the size of the resource with regard to the SizeUnit measurement in form of a number</xs:documentation>
  </xs:annotation>
  <xs:simpleType>
    <xs:restriction base="xs:string">
      <xs:maxLength value="100"/>
    </xs:restriction>
  </xs:simpleType>
</xs:element>

Specifies the unit that is used when providing information on the size of the resource or of resource parts

maxLength	30
enumeration	terms
enumeration	entries
enumeration	files
enumeration	items
enumeration	texts
enumeration	sentences
enumeration	bytes
enumeration	tokens
enumeration	words
enumeration	keywords
enumeration	idiomaticExpressions
enumeration	neologisms
enumeration	multiWordUnits
enumeration	expressions
enumeration	concepts
enumeration	lexicalTypes
enumeration	kb
enumeration	mb
enumeration	gb
enumeration	rules
enumeration	translationUnits
enumeration	phrases
enumeration	segments
enumeration	other

<xs:element name="sizeUnit">
  <xs:annotation>
    <xs:documentation>Specifies the unit that is used when providing information on the size of the resource or of resource parts</xs:documentation>
  </xs:annotation>
  <xs:simpleType>
    <xs:restriction base="xs:string">
      <xs:maxLength value="30"/>
      <xs:enumeration value="terms"/>
      <xs:enumeration value="entries"/>
      <xs:enumeration value="files"/>
      <xs:enumeration value="items"/>
      <xs:enumeration value="texts"/>
      <xs:enumeration value="sentences"/>
      <xs:enumeration value="bytes"/>
      <xs:enumeration value="tokens"/>
      <xs:enumeration value="words"/>
      <xs:enumeration value="keywords"/>
      <xs:enumeration value="idiomaticExpressions"/>
      <xs:enumeration value="neologisms"/>
      <xs:enumeration value="multiWordUnits"/>
      <xs:enumeration value="expressions"/>
      <xs:enumeration value="concepts"/>
      <xs:enumeration value="lexicalTypes"/>
      <xs:enumeration value="kb"/>
      <xs:enumeration value="mb"/>
      <xs:enumeration value="gb"/>
      <xs:enumeration value="rules"/>
      <xs:enumeration value="translationUnits"/>
      <xs:enumeration value="phrases"/>
      <xs:enumeration value="segments"/>
      <xs:enumeration value="other"/>
    </xs:restriction>
  </xs:simpleType>
</xs:element>

Specifies the segmentation unit in terms of which the resource has been segmented or the level of segmentation a tool/service requires/outputs

maxLength	50
enumeration	paragraph
enumeration	sentence
enumeration	clause
enumeration	word
enumeration	wordGroup
enumeration	utterance
enumeration	phrase
enumeration	token
enumeration	other

<xs:element name="segmentationLevel">
  <xs:annotation>
    <xs:documentation>Specifies the segmentation unit in terms of which the resource has been segmented or the level of segmentation a tool/service requires/outputs</xs:documentation>
    <xs:appinfo>
      <label>Segmentation level</label>
    </xs:appinfo>
  </xs:annotation>
  <xs:simpleType>
    <xs:restriction base="xs:string">
      <xs:maxLength value="50"/>
      <xs:enumeration value="paragraph"/>
      <xs:enumeration value="sentence"/>
      <xs:enumeration value="clause"/>
      <xs:enumeration value="word"/>
      <xs:enumeration value="wordGroup"/>
      <xs:enumeration value="utterance"/>
      <xs:enumeration value="phrase"/>
      <xs:enumeration value="token"/>
      <xs:enumeration value="other"/>
    </xs:restriction>
  </xs:simpleType>
</xs:element>

Groups information on the size of the resource or of resource parts

size , sizeUnit

<sizeInfo xmlns="https://cef-at-service-catalogue.eu/CEF-AT-SHARE_Schema/Schema/">
  <size>{1,1}</size>
  <sizeUnit>{1,1}</sizeUnit>
</sizeInfo>

<xs:element name="sizeInfo" type="sizeInfoType">
  <xs:annotation>
    <xs:documentation>Groups information on the size of the resource or of resource parts</xs:documentation>
    <xs:appinfo>
      <label>Size</label>
    </xs:appinfo>
  </xs:annotation>
</xs:element>

pattern

(http://.*)|(https://.*)|(ftp://.*)|(www.*)

<xs:simpleType name="httpURI">
  <xs:restriction base="xs:anyURI">
    <xs:pattern value="http://.*"/>
    <xs:pattern value="https://.*"/>
    <xs:pattern value="ftp://.*"/>
    <xs:pattern value="www.*"/>
  </xs:restriction>
</xs:simpleType>

maxLength	100
pattern	[^@]+@[^\.]+\..+\|

<xs:simpleType name="emailAddress">
  <xs:restriction base="xs:string">
    <xs:maxLength value="100"/>
    <xs:pattern value="[^@]+@[^\.]+\..+|"/>
  </xs:restriction>
</xs:simpleType>

Groups information on the size of the resource or of resource parts

size , sizeUnit

<xs:complexType name="sizeInfoType">
  <xs:annotation>
    <xs:documentation>Groups information on the size of the resource or of resource parts</xs:documentation>
    <xs:appinfo>
      <render-short>{size} {sizeUnit}</render-short>
    </xs:appinfo>
  </xs:annotation>
  <xs:sequence>
    <xs:element name="size">
      <xs:annotation>
        <xs:documentation>Specifies the size of the resource with regard to the SizeUnit measurement in form of a number</xs:documentation>
      </xs:annotation>
      <xs:simpleType>
        <xs:restriction base="xs:string">
          <xs:maxLength value="100"/>
        </xs:restriction>
      </xs:simpleType>
    </xs:element>
    <xs:element name="sizeUnit">
      <xs:annotation>
        <xs:documentation>Specifies the unit that is used when providing information on the size of the resource or of resource parts</xs:documentation>
      </xs:annotation>
      <xs:simpleType>
        <xs:restriction base="xs:string">
          <xs:maxLength value="30"/>
          <xs:enumeration value="terms"/>
          <xs:enumeration value="entries"/>
          <xs:enumeration value="files"/>
          <xs:enumeration value="items"/>
          <xs:enumeration value="texts"/>
          <xs:enumeration value="sentences"/>
          <xs:enumeration value="bytes"/>
          <xs:enumeration value="tokens"/>
          <xs:enumeration value="words"/>
          <xs:enumeration value="keywords"/>
          <xs:enumeration value="idiomaticExpressions"/>
          <xs:enumeration value="neologisms"/>
          <xs:enumeration value="multiWordUnits"/>
          <xs:enumeration value="expressions"/>
          <xs:enumeration value="concepts"/>
          <xs:enumeration value="lexicalTypes"/>
          <xs:enumeration value="kb"/>
          <xs:enumeration value="mb"/>
          <xs:enumeration value="gb"/>
          <xs:enumeration value="rules"/>
          <xs:enumeration value="translationUnits"/>
          <xs:enumeration value="phrases"/>
          <xs:enumeration value="segments"/>
          <xs:enumeration value="other"/>
        </xs:restriction>
      </xs:simpleType>
    </xs:element>
  </xs:sequence>
</xs:complexType>

QName	Type	Use
lang	xs:language	optional

<xs:complexType name="myString">
  <xs:simpleContent>
    <xs:extension base="xs:string">
      <xs:attribute name="lang" type="xs:language"/>
    </xs:extension>
  </xs:simpleContent>
</xs:complexType>

Complex Type

myString

<xs:attribute name="lang" type="xs:language"/>

XML Schema documentation generated by <oXygen/>^® XML Editor.

Showing: