GithubHelp home page GithubHelp logo

loanpydatahub / gerstnerhungarian Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 12.43 MB

CLDF dataset derived from a preprint of 'Új magyar etimológiai szótár' [New Hungarian Etymological Dictionary] by Károly Gerstner (ed.)

Home Page: https://uesz.nytud.hu/index.html

License: Creative Commons Attribution 4.0 International

TeX 2.93% Python 93.01% Shell 1.08% Makefile 2.98%
dataset etymology hungarian

gerstnerhungarian's Introduction

CLDF dataset derived from a preprint of 'Új magyar etimológiai szótár' [New Hungarian Etymological Dictionary] by Károly Gerstner (ed.)

How to cite

If you use these data please cite

  • the original source

    Gerstner, Károly (ed.) (2011-2023). Új magyar Etimológiai Szótár. Hungarian Academy of Sciences, Budapest. http://uesz.nytud.hu/.

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY 4.0 license

Available online at http://uesz.nytud.hu/

Conceptlists in Concepticon:

Notes

License: CC BY 4.0 CircleCI Documentation Status

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 1
  • Concepts: 110
  • Lexemes: 160
  • Sources: 1
  • Synonymy: 1.45
  • Invalid lexemes: 0
  • Tokens: 724
  • Segments: 39 (0 BIPA errors, 0 CLTS sound class errors, 39 CLTS modified)
  • Inventory size (avg): 39.00

Contributors

Name GitHub user Description Role
Károly Gerstner ÚESz [New Hungarian Etymological Dictionary], continuation of EWUng Editor
Loránd Benkő EWUng [Etymological Dictionary of Hungarian], continuation of TESz Editor
Loránd Benkő TESz [A Historical-Etymological Dictionary of Hungarian] Editor
Viktor Martinović @martino-vic CLDF conversion Other
Johann-Mattis List @LinguList CLDF conversion Other

CLDF Datasets

The following CLDF datasets are available in cldf:

gerstnerhungarian's People

Contributors

dependabot[bot] avatar martino-vic avatar

Watchers

 avatar  avatar

gerstnerhungarian's Issues

create command for terminal

@LinguList added this script to create backwards reconstructions based on previously extracted soundchanges, see column "rc" in forms.csv. Should turn this into a command-line command at one point.

Btw, the recostructions are customisable through the parameters, now I made it so that there are max 10 false positives per word. It is also possible to spell out the combinations and run some filters over it :)

add custom columns to EntryTable?

@LinguList I am trying to add custom columns "Year", "Etymology", and "Loan" to the EntryTable, but nothing is happening. I guess I have to define those new columns somehow in the beginning with attr.ib but I'm not sure how to do that, or if it's possible at all to do something like this.

Wait a bit more with versions ;)

I would suggest to tie versions to actual publications, or if you prepare one. If you make a version for every smaller change, Zenodo has to work a lot. Our procedure is: we do new versions always when we have an upcoming publication where we want everything to be clearly identifiable with this particular version.

concept mappings need to be taken with care

The automated procedure surely has its advantages, but it is dangerous to use it without checking the data, and in this case, this becomes very clear. Since the senses offered by the author are so numerous, finding a hit is trivial, but this hit may often be misleading. So I would suggest to raise the bars and think of a future workflow by which we can avoid that you will end up comparing apples with pears, due to problematic mappings, since Concepticon was designed to make concept comparisons more exact in linguistics.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.