Skip to content

Latest commit

 

History

History
312 lines (210 loc) · 25.1 KB

reference.md

File metadata and controls

312 lines (210 loc) · 25.1 KB

Translator Reasoner API

Components

Query

The Query class is used to package a user request for information. A Query object consists of a required Message object with optional additional properties. Additional properties are intended to convey implementation-specific or query-independent parameters. For example, an additional property specifying a log level could allow a user to override the default log level in order to receive more fine-grained log information when debugging an issue.

Fixed Fields
Field Name Type Description
message Message REQUIRED. The query Message is a serialization of the user request. Content of the Message object depends on the intended TRAPI operation. For example, the fill operation requires a non-empty query_graph field as part of the Message, whereas other operations, e.g. overlay, require non-empty results and knowledge_graph fields.
log_level any The least critical level of logs to return
workflow any List of workflow steps to be executed.

AsyncQuery

The AsyncQuery class is effectively the same as the Query class but it requires a callback property.

Fixed Fields
Field Name Type Description
callback string REQUIRED. Upon completion, this server will send a POST request to the callback URL with Content-Type: application/json header and request body containing a JSON-encoded Response object. The server MAY POST Response objects before work is fully complete to provide interim results with a Response.status value of 'Running'. If a POST operation to the callback URL does not succeed, the server SHOULD retry the POST at least once.
message Message REQUIRED. The query Message is a serialization of the user request. Content of the Message object depends on the intended TRAPI operation. For example, the fill operation requires a non-empty query_graph field as part of the Message, whereas other operations, e.g. overlay, require non-empty results and knowledge_graph fields.
log_level any The least critical level of logs to return
workflow any List of workflow steps to be executed.

Response

The Response object contains the main payload when a TRAPI query endpoint interprets and responds to the submitted query successfully (i.e., HTTP Status Code 200). The message property contains the knowledge of the response (query graph, knowledge graph, and results). The status, description, and logs properties provide additional details about the response.

Fixed Fields
Field Name Type Description
message Message REQUIRED. Contains the knowledge of the response (query graph, knowledge graph, and results).
status string One of a standardized set of short codes, e.g. Success, QueryNotTraversable, KPsNotAvailable
description string A brief human-readable description of the outcome
logs [LogEntry] Log entries containing errors, warnings, debugging information, etc
workflow any List of workflow steps that were executed.

Message

The message object holds the main content of a Query or a Response in three properties: query_graph, results, and knowledge_graph. The query_graph property contains the query configuration, the results property contains any answers that are returned by the service, and knowledge_graph property contains lists of edges and nodes in the thought graph corresponding to this message. The content of these properties is context-dependent to the encompassing object and the TRAPI operation requested.

Fixed Fields
Field Name Type Description
results [Result] List of all returned Result objects for the query posed. The list SHOULD NOT be assumed to be ordered. The 'score' property,
if present, MAY be used to infer result rankings.
query_graph any QueryGraph object that contains a serialization of a query in the form of a graph
knowledge_graph any KnowledgeGraph object that contains lists of nodes and edges in the thought graph corresponding to the message

LogEntry

The LogEntry object contains information useful for tracing and debugging across Translator components. Although an individual component (for example, an ARA or KP) may have its own logging and debugging infrastructure, this internal information is not, in general, available to other components. In addition to a timestamp and logging level, LogEntry includes a string intended to be read by a human, along with one of a standardized set of codes describing the condition of the component sending the message.

Fixed Fields
Field Name Type Description
timestamp string Timestamp in ISO 8601 format
level any
code string One of a standardized set of short codes e.g. QueryNotTraversable, KPNotAvailable, KPResponseMalformed
message string A human-readable log message

LogLevel

Logging level

string

one of:

  • ERROR
  • WARNING
  • INFO
  • DEBUG

Result

A Result object specifies the nodes and edges in the knowledge graph that satisfy the structure or conditions of a user-submitted query graph. It must contain a NodeBindings object (list of query graph node to knowledge graph node mappings) and an EdgeBindings object (list of query graph edge to knowledge graph edge mappings).

Fixed Fields
Field Name Type Description
node_bindings Map[string, [NodeBinding]] REQUIRED. The dictionary of Input Query Graph to Result Knowledge Graph node bindings where the dictionary keys are the key identifiers of the Query Graph nodes and the associated values of those keys are instances of NodeBinding schema type (see below). This value is an array of NodeBindings since a given query node may have multiple knowledge graph Node bindings in the result.
edge_bindings Map[string, [EdgeBinding]] REQUIRED. The dictionary of Input Query Graph to Result Knowledge Graph edge bindings where the dictionary keys are the key identifiers of the Query Graph edges and the associated values of those keys are instances of EdgeBinding schema type (see below). This value is an array of EdgeBindings since a given query edge may resolve to multiple knowledge graph edges in the result.
score number A numerical score associated with this result indicating the relevance or confidence of this result relative to others in the returned set. Higher MUST be better.

NodeBinding

Fixed Fields
Field Name Type Description
id CURIE REQUIRED. An instance of NodeBinding is a single KnowledgeGraph Node mapping, identified by the corresponding 'id' object key identifier of the Node within the Knowledge Graph. Instances of NodeBinding may include extra annotation (such annotation is not yet fully standardized).

EdgeBinding

A instance of EdgeBinding is a single KnowledgeGraph Edge mapping, identified by the corresponding 'id' object key identifier of the Edge within the Knowledge Graph. Instances of EdgeBinding may include extra annotation (such annotation is not yet fully standardized).

Fixed Fields
Field Name Type Description
id string REQUIRED. The key identifier of a specific KnowledgeGraph Edge.
attributes [Attribute] A list of attributes providing further information about the edge binding. This is not intended for capturing edge attributes and should only be used for properties that vary from result to result.

KnowledgeGraph

The knowledge graph associated with a set of results. The instances of Node and Edge defining this graph represent instances of biolink:NamedThing (concept nodes) and biolink:Association (relationship edges) representing (Attribute) annotated knowledge returned from the knowledge sources and inference agents wrapped by the given TRAPI implementation.

Fixed Fields
Field Name Type Description
nodes Map[string, Node] REQUIRED. Dictionary of Node instances used in the KnowledgeGraph, referenced elsewhere in the TRAPI output by the dictionary key.
edges Map[string, Edge] REQUIRED. Dictionary of Edge instances used in the KnowledgeGraph, referenced elsewhere in the TRAPI output by the dictionary key.

QueryGraph

A graph representing a biomedical question. It serves as a template for each result (answer), where each bound knowledge graph node/edge is expected to obey the constraints of the associated query graph element.

Fixed Fields
Field Name Type Description
nodes Map[string, QNode] REQUIRED. The node specifications. The keys of this map are unique node identifiers and the corresponding values include the constraints on bound nodes.
edges Map[string, QEdge] REQUIRED. The edge specifications. The keys of this map are unique edge identifiers and the corresponding values include the constraints on bound edges, in addition to specifying the subject and object QNodes.

QNode

A node in the QueryGraph used to represent an entity in a query. If a CURIE is not specified, any nodes matching the category of the QNode will be returned in the Results.

Fixed Fields
Field Name Type Description
ids [CURIE] CURIE identifier for this node
categories [BiolinkEntity]
is_set boolean Boolean that if set to true, indicates that this QNode MAY have multiple KnowledgeGraph Nodes bound to it within each Result. The nodes in a set should be considered as a set of independent nodes, rather than a set of dependent nodes, i.e., the answer would still be valid if the nodes in the set were instead returned individually. Multiple QNodes may have is_set=True. If a QNode (n1) with is_set=True is connected to a QNode (n2) with is_set=False, each n1 must be connected to n2. If a QNode (n1) with is_set=True is connected to a QNode (n2) with is_set=True, each n1 must be connected to at least one n2.
constraints [QueryConstraint] A list of contraints applied to a query node. If there are multiple items, they must all be true (equivalent to AND)

QEdge

An edge in the QueryGraph used as an filter pattern specification in a query. If the optional predicate property is not specified, it is assumed to be a wildcard match to the target knowledge space. If specified, the ontological inheritance hierarchy associated with the term provided is assumed, such that edge bindings returned may be an exact match to the given QEdge predicate term, or to a term that is a descendant of the QEdge predicate term.

Fixed Fields
Field Name Type Description
predicates [BiolinkPredicate]
subject string REQUIRED. Corresponds to the map key identifier of the subject concept node anchoring the query filter pattern for the query relationship edge.
object string REQUIRED. Corresponds to the map key identifier of the object concept node anchoring the query filter pattern for the query relationship edge.
constraints [QueryConstraint] A list of contraints applied to a query edge. If there are multiple items, they must all be true (equivalent to AND)

Node

A node in the KnowledgeGraph which represents some biomedical concept. Nodes are identified by the keys in the KnowledgeGraph Node mapping.

Fixed Fields
Field Name Type Description
name string Formal name of the entity
categories [BiolinkEntity]
attributes [Attribute] A list of attributes describing the node

Attribute

Generic attribute for a node or an edge that expands the key-value pair concept by including fields for additional metadata. These fields can be used to describe the source of the statement made in a key-value pair of the attribute object, or describe the attribute's value itself including its semantic type, or a url providing additional information about it. An attribute may be further qualified with a SubAttribute (for example to provide confidence intervals on a value).

Fixed Fields
Field Name Type Description
attribute_type_id CURIE REQUIRED. The 'key' of the attribute object, holding a CURIE of an ontology property defining the attribute (preferably the CURIE of a Biolink association slot). This property captures the relationship asserted to hold between the value of the attribute, and the node or edge from which it hangs. For example, that a value of '0.000153' represents a p-value supporting an edge, or that a value of 'ChEMBL' represents the original source of the knowledge expressed in the edge.
original_attribute_name string The term used by the original source of an attribute to describe the meaning or significance of the value it captures. This may be a column name in a source tsv file, or a key in a source json document for the field in the data that held the attribute's value. Capturing this information where possible lets us preserve what the original source said. Note that the data type is string' but the contents of the field could also be a CURIE of a third party ontology term.
value any REQUIRED. Value of the attribute. May be any data type, including a list.
value_type_id any CURIE describing the semantic type of an attribute's value. Use a Biolink class if possible, otherwise a term from an external ontology. If a suitable CURIE/identifier does not exist, enter a descriptive phrase here and submit the new type for consideration by the appropriate authority.
attribute_source string The source of the core assertion made by the key-value pair of an attribute object. Use a CURIE or namespace designator for this resource where possible.
value_url string Human-consumable URL linking to a web document that provides additional information about an attribute's value (not the node or the edge fom which it hangs).
description string Human-readable description for the attribute and its value.
attributes [SubAttribute] A list of attributes providing further information about the parent attribute (for example to provide provenance information about the parent attribute).

SubAttribute

Second-level attribute that can be used to qualify information contained in a primary Attribute attached to a Node or Edge.

Fixed Fields
Field Name Type Description
attribute_type_id CURIE REQUIRED. The 'key' of the attribute object, holding a CURIE of an ontology property defining the attribute (preferably the CURIE of a Biolink association slot). This property captures the relationship asserted to hold between the value of the attribute, and the node or edge from which it hangs. For example, that a value of '0.000153' represents a p-value supporting an edge, or that a value of 'ChEMBL' represents the original source of the knowledge expressed in the edge.
original_attribute_name string The term used by the original source of an attribute to describe the meaning or significance of the value it captures. This may be a column name in a source tsv file, or a key in a source json document for the field in the data that held the attribute's value. Capturing this information where possible lets us preserve what the original source said. Note that the data type is string' but the contents of the field could also be a CURIE of a third party ontology term.
value any REQUIRED. Value of the attribute. May be any data type, including a list.
value_type_id any CURIE describing the semantic type of an attribute's value. Use a Biolink class if possible, otherwise a term from an external ontology. If a suitable CURIE/identifier does not exist, enter a descriptive phrase here and submit the new type for consideration by the appropriate authority.
attribute_source string The source of the core assertion made by the key-value pair of an attribute object. Use a CURIE or namespace designator for this resource where possible.
value_url string Human-consumable URL linking to a web document that provides additional information about an attribute's value (not the node or the edge fom which it hangs).
description string Human-readable description for the attribute and its value.

Edge

A specification of the semantic relationship linking two concepts that are expressed as nodes in the knowledge "thought" graph resulting from a query upon the underlying knowledge source.

Fixed Fields
Field Name Type Description
predicate any
subject CURIE REQUIRED. Corresponds to the map key CURIE of the subject concept node of this relationship edge.
object CURIE REQUIRED. Corresponds to the map key CURIE of the object concept node of this relationship edge.
attributes [Attribute] A list of additional attributes for this edge

BiolinkEntity

Compact URI (CURIE) for a Biolink class, biolink:NamedThing or a child thereof. The CURIE must use the prefix 'biolink:' followed by the PascalCase class name.

string (pattern: ^biolink:[A-Z][a-zA-Z]*$)

Example
"biolink:PhenotypicFeature"

BiolinkPredicate

CURIE for a Biolink 'predicate' slot, taken from the Biolink slot ('is_a') hierarchy rooted in biolink:related_to (snake_case). This predicate defines the Biolink relationship between the subject and object nodes of a biolink:Association defining a knowledge graph edge.

string (pattern: ^biolink:[a-z][a-z_]*$)

Example
"biolink:interacts_with"

CURIE

A Compact URI, consisting of a prefix and a reference separated by a colon, such as UniProtKB:P00738. Via an external context definition, the CURIE prefix and colon may be replaced by a URI prefix, such as http://identifiers.org/uniprot/, to form a full URI.

string

MetaKnowledgeGraph

Knowledge-map representation of this TRAPI web service. The meta knowledge graph is composed of the union of most specific categories and predicates for each node and edge.

Fixed Fields
Field Name Type Description
nodes Map[string, MetaNode] REQUIRED. Collection of the most specific node categories provided by this TRAPI web service, indexed by Biolink class CURIEs. A node category is only exposed here if there is node for which that is the most specific category available.
edges [MetaEdge] REQUIRED. List of the most specific edges/predicates provided by this TRAPI web service. A predicate is only exposed here if there is an edge for which the predicate is the most specific available.

MetaNode

Description of a node category provided by this TRAPI web service.

Fixed Fields
Field Name Type Description
id_prefixes [string] REQUIRED. List of CURIE prefixes for the node category that this TRAPI web service understands and accepts on the input.
attributes [MetaAttribute] Node attributes provided by this TRAPI web service.

MetaEdge

Edge in a meta knowledge map describing relationship between a subject Biolink class and an object Biolink class.

Fixed Fields
Field Name Type Description
subject BiolinkEntity REQUIRED. Subject node category of this relationship edge.
predicate BiolinkPredicate REQUIRED. Biolink relationship between the subject and object categories.
object BiolinkEntity REQUIRED. Object node category of this relationship edge.
attributes [MetaAttribute] Edge attributes provided by this TRAPI web service.

MetaAttribute

Fixed Fields
Field Name Type Description
attribute_type_id CURIE REQUIRED. Type of an attribute provided by this TRAPI web service (preferably the CURIE of a Biolink association slot)
attribute_source string Source of an attribute provided by this TRAPI web service.
original_attribute_names [string] Names of an the attribute as provided by the source.
constraint_use boolean Indicates whether this attribute can be used as a query constraint.
constraint_name string Human-readable name or label for the constraint concept. Required whenever constraint_use is true.

QueryConstraint

Generic query constraint for a query node or query edge

Fixed Fields
Field Name Type Description
id any REQUIRED. CURIE of the concept being constrained. For properties defined by the Biolink model this SHOULD be a biolink CURIE. otherwise, if possible, from the EDAM ontology. If a suitable CURIE does not exist, enter a descriptive phrase here and submit the new type for consideration by the appropriate authority.
name string REQUIRED. Human-readable name or label for the constraint concept. If appropriate, it SHOULD be the term name of the CURIE used as the 'id'. This is redundant but required for human readability.
not boolean
operator string REQUIRED. Relationship between the database value and the constraint value for the specified id. The operators ==, >, and < mean is exactly equal to, is greater than, and is less than, respectively. The 'matches' operator indicates that the value is a regular expression to be evaluated. If value is a list type, then at least one evaluation must be true (equivalent to OR). This means that the == operator with a list acts like a SQL 'IN' clause. The 'not' property negates the operator such that not and == means 'not equal to' (or 'not in' for a list), and not > means <=, and not < means >=, and not matches means does not match. The '==' operator SHOULD NOT be used in a manner that describes an "is a" subclass relationship for the parent QNode.
value any REQUIRED. Value of the attribute. May be any data type, including a list. If the value is a list and there are multiple items, at least one comparison must be true (equivalent to OR). If 'value' is of data type 'object', the keys of the object MAY be treated as a list. A 'list' data type paired with the '>' or '<' operators will encode extraneous comparisons, but this is permitted as it is in SQL and other languages.
unit_id any CURIE of the units of the value or list of values in the 'value' property. The Units of Measurement Ontology (UO) should be used if possible. The unit_id MUST be provided for (lists of) numerical values that correspond to a quantity that has units.
unit_name any Term name that is associated with the CURIE of the units of the value or list of values in the 'value' property. The Units of Measurement Ontology (UO) SHOULD be used if possible. This property SHOULD be provided if a unit_id is provided. This is redundant but recommended for human readability.