Skip to content

Better way to release internally parallel treebanks (e.g. parallel learner treebanks) #1182

@harisont

Description

@harisont

We recently released a parallel learner treebank (learner productions // teacher corrections).
To do that, we followed the same approach as all other treebanks of this kind:

  • Parallel: no in the metadata (because Parallel: ITSELF is neither allowed nor particularly clear)
  • corrections in a separate file in the not-to-release folder.

I wonder whether we could come up with a solution that:

  1. makes it clear from the website that these treebanks are, indeed, parallel
  2. allows including both "halves" in the official release.

I don't have strong opinions about what the best way to achieve this would be, so I'm eager to read your suggestions.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions