This paper is available on arxiv under CC 4.0 license. Authors: (1) Vogt, Lars, TIB Leibniz Information Centre for Science and Technology; (2) Konrad, Marcel, TIB Leibniz Information Centre for Science and Technology; (3) Prinz, Manuel, TIB Leibniz Information Centre for Science and Technology. Table of Links Abstract & Introduction Interoperability Semantic interoperability and what natural languages like English can teach us Requirements for successfully communicating terms and statements Parallels between the structure of natural language statements and data schemata with implications for semantic interoperability What makes a term a good term and a schema a good schema? The need for a machine-actionable Rosetta Stone for (meta)data that acts as an interlingua for specifying reference terms and reference schemata to support cognitive and semantic interoperability Rosetta Stone and machine-readability: UPRIs, XML Schema datatypes, and RDF for communicating terms, datatypes, and statements Rosetta Stone and machine-interpretability: Wikidata and a modeling paradigm for (meta)data statements based on English Rosetta Stone and semantic interoperability: Specifying term mappings and schema crosswalks Rosetta Stone and cognitive interoperability: Specifying display templates and using a query builder Discussion Related work Conclusion, Acknowledgements, & References The need for a machine-actionable Rosetta Stone for (meta)data that acts as an interlingua for specifying reference terms and reference schemata to support cognitive and semantic interoperability Above, we discussed the role of terms and statement structures (i.e., syntax trees and (meta)data schemata) in reliably communicating the meaning and thus the semantic content of (meta)data statements. Statement structures specify syntactic positions or slots with semantic roles or constraint specifications for a given statement type. To achieve semantic interoperability, we therefore need controlled vocabularies (i.e., ) and across ontologies for FAIR terms and their . And we need a schemata and and ontologies ontological and referential term mappings terminological interoperability (meta)dat ontological and referential schema crosswalks for FAIR (meta)data statements their schematic interoperability. We also discussed why we think that it is impossible to agree on a best term for every possible type of entity and a best schema for every possible type of statement, due to varying frames of reference and operational priorities. Therefore, we think that we need something like a to support the establishment of semantic interoperability across different terms and different schemata for a given type of (meta)data statement. This Rosetta Stone needs to function like an , with which term mappings and schema crosswalks can be easily specified and operationalized. The building blocks of the are and Each entity type must have specified a corresponding reference term, and each statement type must have a corresponding reference schema. Terms from controlled vocabularies can be mapped to their corresponding reference term, and schemata to their corresponding reference schema. Constraint specifications for slots of reference schemata must refer to reference terms in the case of resources, and to reference datatype specifications in the case of values. These three types of building blocks take over the role of , so that it would no longer be necessary to specify schema crosswalks for every possible pair of schemata of a given type of (meta)data statement and to specify term mappings for every possible pair of terms. This would minimize the number of schema crosswalks and term mappings that need to be specified in order to achieve schematic and terminological interoperability for a given type of statement (Fig. 4). machine-actionable Rosetta Stone interlingua interlingua reference terms, reference datatype specifications, reference schemata. mediating connectors Ideally, a reference schema is based on a generic Rosetta modeling paradigm that allows the reconstruction of the natural language statement underlying the datum. At the same time, it should document this statement using a formalized structure to ensure its human- and machine-actionability. With respect to human-actionability, the Rosetta modeling paradigm should reflect as closely as possible the structure of natural language statements, favoring lean over complex models, with the aim of reducing overall modeling complexity and modeling burden. Many schemata are very complex and include positions with resources that do not directly align with any input slot (e.g., and in Fig. 2E). Such schemata are not suitable for use as reference schemata. ‘scalar measurement datum’ ‘scalar value specification’ Schemata that conform to the Rosetta modeling paradigm should be easy to understand and to apply, allowing any producer of (meta)data to specify new reference schemata for types of statements that do not yet have a reference schema assigned to them, and allowing any application developer to readily use their (meta)data. It should not require experience in semantics and knowledge engineering on the part of the data producer and the application developer. With regard to the structure of reference schemata, we need to consider the machine-actionability of the resulting (meta)data statements. In other words, we need to consider which operations are important for such a reference schema. While reasoning is important for domain knowledge, especially when developing ontologies, other types of operations such as searching and exploring are more important in the context of empirical (meta)data and (meta)data management in general. However, regardless of the choice of operations and associated tools, the application of reference schemata must result in FAIR (meta)data that are machine-readable and machine-interpretable in order to be machine-actionable. As for reference terms, they should ideally be collected in a large, controlled cross-domain vocabulary and should be machine-readable and machine-interpretable to be machine-actionable.

The Need for a Machine-actionable Rosetta Stone for (meta)data That Acts as an Interlingua

About Author

Comments

TOPICS

THIS ARTICLE WAS FEATURED IN

Related Stories

A Simple Guide to Blockchain Queries

Towards a Rosetta Stone for (meta)data: Interoperability

Semantic Interoperability and What Natural Languages Like English Can Teach us

Towards a Rosetta Stone for (meta)data: What Makes a Term a Good Term and a Schema a Good Schema?

Rosetta Stone and Semantic Interoperability: Specifying Term Mappings and Schema Crosswalks

Parallels Between the Structure of Natural Language Statements and Data Schemata

A Simple Guide to Blockchain Queries

Towards a Rosetta Stone for (meta)data: Interoperability

Semantic Interoperability and What Natural Languages Like English Can Teach us

Towards a Rosetta Stone for (meta)data: What Makes a Term a Good Term and a Schema a Good Schema?

Rosetta Stone and Semantic Interoperability: Specifying Term Mappings and Schema Crosswalks

Parallels Between the Structure of Natural Language Statements and Data Schemata

Light-Mode

Classic

Newspaper

Minty

Dark-Mode

Neon Noir

Minty

HN StartUps