GithubHelp home page GithubHelp logo

gt-labelling's People

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

gt-labelling's Issues

missing metadata: e.g. for leveling, antiqua, special characters, letter spaced, umlaut, old greek

@bertsky:
I am missing some metadata for the following cases:

Can you add something? @bertsky

tables: improve documentation

To me, it's not obvious to what certain labels pertain in particular:

  • <xsd:enumeration value="content-type/metadata/structure">
    <xsd:annotation>
    <xsd:documentation xml:lang="en">Structure of an object of some sort
    Examples:
    Document structure,
    Table structure</xsd:documentation>
    </xsd:annotation>
    </xsd:enumeration>
    <xsd:enumeration value="content-type/metadata/structure/toc">
    <xsd:annotation>
    <xsd:documentation xml:lang="en">Table of contents of a book, newspaper etc.</xsd:documentation>
    </xsd:annotation>
    </xsd:enumeration>
    → what exactly is this for?
  • <xsd:enumeration value="activityDomain/computing/visual/analysisRecognition/tables">
    <xsd:annotation>
    <xsd:documentation xml:lang="en">The recognition of table/form structure and/or contents.
    Examples:
    Stock exchange data in a newspaper,
    Filled in questionaire form
    Related:
    OCR
    Object / shape recognition (e.g. table separator detection)</xsd:documentation>
    </xsd:annotation>
    → is this detection (whether, identity) or recognition (what, structure)?
  • <xsd:enumeration value="contentOfInterest/visual/composite/tables"/>
    → "material contains tables" ?
  • <xsd:enumeration value="granularity/logical/table/column">
    <xsd:annotation>
    <xsd:documentation xml:lang="en">Table column</xsd:documentation>
    → "annotation contains TableRegion/Roles/TableCellRole/@columnIndex" ?
  • <xsd:enumeration value="granularity/logical/table/row">
    <xsd:annotation>
    <xsd:documentation xml:lang="en">Table row</xsd:documentation>
    → "annotation contains TableRegion/Roles/TableCellRole/@rowIndex" ?
  • <xsd:enumeration value="granularity/logical/table/cell">
    <xsd:annotation>
    <xsd:documentation xml:lang="en">Table cell</xsd:documentation>
    → "annotation contains one TableRegion/TextRegion for each table cell" ?
  • <xsd:enumeration value="content-encoding/structured/tabular">
    <xsd:annotation>
    <xsd:documentation xml:lang="en">Content encoded in tabular form
    Examples:
    A tab-separated table with headings and values</xsd:documentation>
    </xsd:annotation>
    </xsd:enumeration>
    → does that mean: continuous text that happens to be typeset with tabs or ellipses, but does not formally constitute a table (and should not get annotated as such)?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.