GithubHelp home page GithubHelp logo

Daniel Stoekl's Projects

alignment icon alignment

Simple Python library for doing (multiple) sequence alignment

ancient-greek-bert icon ancient-greek-bert

Pre-trained BERT Models for Ancient and Medieval Greek, and associated code for LaTeCH 2021 paper titled - "A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek"

bert icon bert

TensorFlow code and pre-trained models for BERT

bible-clusterer icon bible-clusterer

Web application to perform clustering of text data on LXX and SBL Greek New Testament

caccht icon caccht

The repository contains scripts for parsing and analyzing Hebrew texts.

deepfau icon deepfau

Submission to the ICDAR2017 Competition on the Classification of Medieval Handwritings in Latin Script

gpt4all icon gpt4all

gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue

handwriting icon handwriting

handwritten text recognition on IAM handwriting dataset

ithaca icon ithaca

Restoring and attributing ancient texts using deep neural networks

jpeg-sandbox icon jpeg-sandbox

Interactively edit individual DCT blocks in any JPEG image in the browser.

kannada-ocr-test-images-with-ground-truth icon kannada-ocr-test-images-with-ground-truth

This Kannada OCR benchmarking dataset contains 250 images, carefully chosen to have various kinds of recognition challenges. Some of the pages have italics and bold characters. Some of them have Halegannada poems and text; others are letterpress-printed pages, where the vowel modifiers appear as separate symbols and do not touch the consonants they go with. Some pages have interspersed English words; still others have tables with a lot of numeric data. In addition, there are old pages containing either a lot of broken characters or many words with two or more characters merged into a single connected component.

kraken icon kraken

OCR engine for all the languages

labelme icon labelme

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

named-entity-recognition icon named-entity-recognition

Notebooks for teaching Named Entity Recognition at the Cultural Heritage Data School, run by Cambridge Digital Humanities, June-July 2020

nn-svg icon nn-svg

Publication-ready NN-architecture schematics.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.