GithubHelp home page GithubHelp logo

starlangsoftware / corpus-swift Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 1.0 2.16 MB

Corpus processing library

License: GNU General Public License v3.0

Swift 100.00%
sentence-tokenizer corpus-processing sentence-segmentation turkish-sentence-segmentation turkish-sentence-tokenizer

corpus-swift's Introduction

Corpus

Video Lectures

For Developers

You can also see Java, C, Python, Cython, C#, Js, or C++ repository.

Requirements

  • Xcode Editor
  • Git

Git

Install the latest version of Git.

Download Code

In order to work on code, create a fork from GitHub page. Use Git for cloning the code to your local or below line for Ubuntu:

git clone <your-fork-git-link>

A directory called Corpus-Swift will be created. Or you can use below link for exploring the code:

git clone https://github.com/starlangsoftware/Corpus-Swift.git

Open project with XCode

To import projects from Git with version control:

  • XCode IDE, select Clone an Existing Project.

  • In the Import window, paste github URL.

  • Click Clone.

Result: The imported project is listed in the Project Explorer view and files are loaded.

Compile

From IDE

After being done with the downloading and opening project, select Build option from Product menu. After compilation process, user can run Corpus-Swift.

Detailed Description

Corpus

To store a corpus in memory

a = Corpus(fileName: "derlem.txt")

If this corpus is split with dots but not in sentences

init(fileName: String, sentenceSplitter: SentenceSplitter)

To eliminate the non-Turkish sentences from the corpus

init(fileName: String, languageChecker: LanguageChecker)

The number of sentences in the corpus

func sentenceCount() -> Int

To get ith sentence in the corpus

func getSentence(index: Int) -> Sentence

TurkishSplitter

TurkishSplitter class is used to split the text into sentences in accordance with the . rules of Turkish.

func split(line: String) -> [Sentence]

corpus-swift's People

Contributors

olcaytaner avatar

Watchers

 avatar

Forkers

cbedre

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.