GithubHelp home page GithubHelp logo

wassname / pysle Goto Github PK

View Code? Open in Web Editor NEW

This project forked from timmahrt/pysle

0.0 2.0 0.0 262 KB

Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.

License: MIT License

Python 100.00%

pysle's Introduction

pysle

https://img.shields.io/badge/license-MIT-blue.svg?

Pronounced like 'p' + 'isle'.

An interface for the ILSEX (international speech lexicon) dictionary, along with some tools for working with comparing and aligning pronunciations (e.g. a list of phones someone said versus a standard or canonical dictionary pronunciation).

1   Common Use Cases

What can you do with this library?

  • look up the list of phones and syllables for canonical pronunciations of a word:

    pysle.isletool.LexicalTool.lookup('cat')
    
  • map an actual pronunciation to a dictionary pronunciation (can be used to automatically find speech errors):

    pysle.pronunciationtools.findClosestPronunciation(isleDict, 'cat', ['k', 'æ',])
    
  • automatically syllabify a praat textgrid containing words and phones (e.g. force-aligned text) -- requires my praatIO library:

    pysle.syllabifyTextgrid(isleDict, praatioTextgrid, "words", "phones")
    
  • search for words based on pronunciation:

    e.g. Words that start with a sound, or have a sound word medially, or
    in stressed vowel position, etc.
    
    see /tests/dictionary_search.py
    

2   Major revisions

Ver 1.4 (July 9, 2016)

  • added search functionality
  • ported code to use the new unicode IPA-based isledict (the old one was ascii)

Ver 1.3 (March 15, 2016)

  • added indicies for stressed vowels

Ver 1.2 (June 20, 2015)

  • Python 3.x support

Ver 1.1 (January 30, 2015)

  • word lookup ~65 times faster

Ver 1.0 (October 23, 2014)

  • first public release.

3   Requirements

  • Before you use this library (before or after installing it) you will need to download the ILSEX dictionary. It can be downloaded here under the section 'English' linked under the text 'English Pronlex' (with a file name of ISLEdict.txt):

    ISLEX project page

    Direct link to the ISLEX file used in this project (ISLEdict.txt)

  • Python 2.7.* or above

  • Python 3.3.* or above

  • The praatIO library is required IF you want to use the textgrid functionality. It is not required for normal use.

4   Installation

If you on Windows, you can use the installer found here (check that it is up to date though) Windows installer

Otherwise, to manually install, after downloading the source from github, from a command-line shell, navigate to the directory containing setup.py and type:

python setup.py install

If python is not in your path, you'll need to enter the full path e.g.:

C:\Python27\python.exe setup.py install

5   Example usage

Here is a typical common usage:

from pysle import isle
isleDict = isle.LexicalTool('C:\islev2.dict')
print isleDict.lookup('catatonic')[0] # Get the first pronunciation
>> [['k', 'ˌæ'], ['t˺', 'ə'], ['t', 'ˈɑ'], ['n', 'ɪ', 'k']] [2, 0]

and another:

from pysle import isle
from psyle import pronunciationTools

searchWord = 'another'
anotherPhoneList = ['n', '@', 'th', 'r'] # Actually produced (ASCII or IPA ok here)

returnList = pronunciationTools.findBestSyllabification(isleDict,
                                                        searchWord,
                                                        anotherPhoneList)
print syllableList
>> [["''"], ['n', '@'], ['th', 'r']]

Please see \examples for example usage

6   Citing pysle

Pysle is general purpose coding and doesn't need to be cited (you should cite the ISLEX project instead) but if you would like to, it can be cited like so:

Tim Mahrt. Pysle. https://github.com/timmahrt/pysle, 2016.

7   Acknowledgements

Development of Pysle was possible thanks to NSF grant IIS 07-03624 to Jennifer Cole and Mark Hasegawa-Johnson, NSF grant BCS 12-51343 to Jennifer Cole, José Hualde, and Caroline Smith, and to the A*MIDEX project (n° ANR-11-IDEX-0001-02) to James Sneed German funded by the Investissements d'Avenir French Government program, managed by the French National Research Agency (ANR).

pysle's People

Contributors

timmahrt avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.