Topics

The topics will be available from the TREC Active Participants Site as a newline-delimited set of JSON objects with the following entries:

Submissions

Each submission will take the form of a JSON document, whose root object has the following entries:

Output objects are to be JSON objects with the following entries:

Submissions consisting of malformed or invalid content will be discarded.

Answer Processing

Text in the answer entry will be sentence-tokenized using the SpaCy sentence tokenizer. Any text within square brackets (“[”, “]”) will be treated as a citation list; the structure of a citation list will be assumed to be a comma-delimited list of PMID citations. As described in the task description, only the first three entries in each citation list will be considered; any remaining entries will be discarded.

Each citation PMID is to correspond to one of the entries in the references list. Further, each citation will be assumed to be “assigned” to its enclosing sentence. Square-bracket runs that take place after sentence-final punctuation (i.e., outside of a sentence), will be discarded. Square-bracket runs that occur at the beginning of the document (i.e., before any sentences) will be discarded.

Citation numbers in the answer text are subject to the following constraints:

  1. Citation numbers must be PMIDs restricted to the set of allowed PMIDs in the release dataset.
  2. Citation PMIDs must have corresponding entries in the references list; e.g., if the text contains 12 PMIDs but there are only 11 entries in the references list, the PMID that does not appear in the references list would represent an erroneous citation.

PMIDs included in the references lists are subject to the following constraints:

  1. PMID must exist within the release dataset
  2. PMIDs must occur once and only once in the references list
  3. PMIDs included in the references list must be referred to at least once by a citation in the answer text

Please ask on the mailing list if there are any questions or issues with this specification.