Skip to content

Table Detection data mismatch in Word subset #42

@vm7608

Description

@vm7608

I have downloaded and checked the TableBank dataset from your dataset homepage

I have found some issues in the annotations, the README denotes the number of tables in the Table Detection task as follows:

Task Word Latex Word+Latex
Table detection 163,417 253,817 417,234

But I ran my script to check the data annotations, it showed that there were only 101889 tables in the Word subset.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions