GithubHelp home page GithubHelp logo

antipodite / concepticon-data Goto Github PK

View Code? Open in Web Editor NEW

This project forked from concepticon/concepticon-data

0.0 0.0 0.0 124.02 MB

The curation repository for the data behind Concepticon.

Home Page: https://concepticon.clld.org

Python 0.08% TeX 96.52% Rascal 0.65% HTML 0.90% JavaScript 1.86%

concepticon-data's Introduction

CLLD Concepticon

Build Status

The data underlying the Concepticon of the CLLD project is maintained in this repository. Here, you can find

Concepticon Data

  • For an overview on the status of all currently linked conceptlists, see here.
  • For basic information on metadata, see here.
  • For information on how you can contribute to the project or profit from the data sources we offer, see here.

Data Structure

  • conceptlists/ folder contains conceptlists with links to IDs in concepticon.tsv, the lists are named after the first person who proposed them, the year of the reference publication in which we extracted them, and the number of concepts. All these three parts of information are separated by a dash. Furthermore, in cases where two lists would have an identical name, we add alphabetical letters to the lists to distinguish them. Files need to have the columns "GLOSS" (some still have "ENGLISH" instead, but this needs to be changed), additionally, most (if not all files) have a "NUMBER" field indicating the number in the reference, which is also important for ordering the entries as given in the original source. Additional columns are more or less free to the user, but we tried to be consistent.
  • conceptlists.tsv contains metadata about the lists in conceptlists/.
  • references/references.bib the bibtex file showing links to all concept lists (bibtex-key identical to the name of the conceptlist file, without file-ending. File further contains links to the references in which the conceptlists were published (references stored in the "crossref" field).
  • sources/ contains pdf-files of each conceptlist (only the list-parts, not the full publications for copyright reasons), naming is the same as for the conceptlists, but with the ending ".pdf" instead of ".tsv".
  • concepticon.tsv the backbone concept list. All concepts from individual concept lists are linked to entries in this file.
  • concept_set_meta/ contains lists of metadata, relating concept sets to additional information, e.g. on Wikipedia. These lists are described by accompanying metadata files following the recommendations of the Model for Tabular Data and Metadata on the Web.
  • app/ contains data for running the JavaScript-based Concepticon lookup tool.

Update policy

We try to release concepticon-data (as well as the concepticon web app) regularly at least once a year. Generally, new releases should only become more comprehensive, i.e. all data ever released should also be part of the newest release. Occasionally, though, we may have to correct an erratum, which may result in some data being removed, or changes in identifiers of objects. So whenever a link to the web app breaks or a script using the concepticon-data API throws an error, you should consult the list of errata to see, whether an error correction may be the reason for this behaviour.

pyconcepticon

pyconcepticon provides a Python package to programmatically access Concepticon data.

concepticon-data's People

Contributors

anaphory avatar annikatjuka avatar blag avatar carolinhu avatar chrzyki avatar cysouw avatar evoling avatar ilchec avatar kristina-pianykh avatar laiyunfan avatar lannin avatar lingulist avatar macyl avatar natalia-morozova avatar phylostar avatar schweikhard avatar simongreenhill avatar stasreichert avatar tresoldi avatar wu-urbanek avatar xrotwang avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.