Topic: corpus-tools Goto Github
Some thing interesting about corpus-tools
Some thing interesting about corpus-tools
corpus-tools,ACoLi CoNLL libraries: Several tools for processing, manipulating and transforming TSV formats (CoNLL-RDF, CoNLL-Merge, CQP4RDF)
Organization: acoli-repo
corpus-tools,Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
User: adbar
Home Page: https://adrien.barbaresi.eu/blog/simple-multilingual-lemmatizer-python.html
corpus-tools,Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
User: adbar
Home Page: https://trafilatura.readthedocs.io
corpus-tools,Tools for creating speech corpora by extracting audio from YouTube videos
User: aitor-alvarez
corpus-tools,Article title, authors, date and body extraction dataset.
User: andythefactory
corpus-tools,Bitextor generates translation memories from multilingual websites
Organization: bitextor
Home Page: https://bitextor.readthedocs.io/en/latest/
corpus-tools,An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
User: blkserene
corpus-tools,CoNLL-U format parser
Organization: bureaucratic-labs
corpus-tools,📚 Icelandic Corpora Toolkit - A collection of scripts to use with various Icelandic text corpora
Organization: cadia-lvl
corpus-tools,Collector and speech cutter for librivox audiobooks
User: carlfm01
corpus-tools,General Missives in Text-Fabric
Organization: clariah
Home Page: http://resources.huygens.knaw.nl/vocgeneralemissiven
corpus-tools,Scripts for data conversion
Organization: cscfi
corpus-tools,An advanced, extensible web front-end for the Manatee-open corpus search engine
Organization: czcorpus
corpus-tools,An unofficial Python API that allows users to create a corpus of lyrical text from their favorite artists and billboard charts
User: edwardseley
corpus-tools,A Multi-Feature Tagger of English originally designed for multi-feature/multi-dimensional analysis (MDA) (Biber 1988; 1995) of situational variation in standard written and spoken English
User: elenlefoll
corpus-tools,UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Organization: grammarly
Home Page: https://ua-gec-dataset.grammarly.ai/
corpus-tools,OpusFilter - Parallel corpus processing toolkit
Organization: helsinki-nlp
corpus-tools,Software for multi-level annotation of linguistic corpora
Organization: infraling
corpus-tools,A library of functions enabling complex corpus search in context (KWIC), search aggregation, bag-of-words building & keyphrase extraction.
User: jaytimm
corpus-tools,A parser for annotated MuseScore 3 files.
User: johentsch
Home Page: https://ms3.readthedocs.io
corpus-tools,Rezonator: Dynamics of human engagement
User: johnwdubois
corpus-tools,Scripts for building a geo-located web corpus using Common Crawl data
User: jonathandunn
corpus-tools,Measure the similarity of text corpora for 74 languages
User: jonathandunn
corpus-tools,An open source reimplementation of Benny Brodda's BETA in Python
User: koskenni
corpus-tools,A set of workflows for corpus building through OCR, post-correction and normalisation
Organization: languagemachines
corpus-tools,SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/
User: lennes
corpus-tools,Repositório para disponibilização de bases de dados do Wikipedia e Simple Wikipedia pré-processadas, além de scripts de pré-processamento e geração de bases em Python.
User: levimatheus
corpus-tools,Searching in-memory corpus with Corpus Query Language (CQL)
User: liao961120
Home Page: https://yongfu.name/concordancer/
corpus-tools,Script that sets up and configures an entire CQPweb server installation
User: linguista
corpus-tools,Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
User: m4t1ss
corpus-tools,Library for Python to use Korp API
User: mikahama
corpus-tools,A concordancing program for English with a GUI interface that can read .docx, .srt, and plaintext files and export concordance lines to .txt,. docx, .tsv, .xlsx, and .html.
User: mikesuhan
corpus-tools,MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include semantic tags from Biber (2006) and Biber et al. (1999), including other specific tags.
User: mshakirdr
corpus-tools,Utilities for Processing the HCRC Map Task Corpus
User: nathanduran
corpus-tools,Utilities for Processing the Meeting Recorder Dialogue Act Corpus
User: nathanduran
corpus-tools,Utilities for Processing the Switchboard Dialogue Act Corpus
User: nathanduran
corpus-tools,Python library for extracting quantitative, reproducible metrics of multi-level alignment between two speakers in naturalistic language corpora.
User: nickduran
corpus-tools,An open-source web-based application for multi-task lexical normalisation
Organization: nlp-tlp
Home Page: https://lexiclean.nlp-tlp.org/
corpus-tools,🛠 Tools to create, edit and export texts and annotations
Organization: openpecha
Home Page: https://toolkit.openpecha.org
corpus-tools,This package provides utility classes and static methods for Python that make use of different third party software commonly used in text processing such as: Unitex-GramLab, TreeTagger, Apache-Tika and Google-Tesseract.
User: petar-popovic-bg
corpus-tools,Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora
User: praaline
corpus-tools,Web based database for sign language lexicons and corpuses. Fork of NGT-signbank (https://github.com/Signbank/Global-signbank).
Organization: signbank
Home Page: https://signbank.csc.fi
corpus-tools,Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
User: silenterus
corpus-tools,Online parallel text alignment tool.
User: tienzhao
Home Page: http://suoyan.tienzhao.com
corpus-tools,Yet another search platform for linguistic corpora.
User: timarkh
corpus-tools,Reading the data from OPIEC - an Open Information Extraction corpus
Organization: uma-pi1
Home Page: https://www.uni-mannheim.de/dws/research/resources/opiec/
corpus-tools,An Interactive Tool for Annotating Discourse Structure and Text Improvement
User: wiragotama
corpus-tools,Python library for handling audio datasets.
User: ynop
Home Page: https://audiomate.readthedocs.io/
corpus-tools,文本数据分析, Text-Analysis
User: yongzhuo
Home Page: https://blog.csdn.net/rensihui
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.