GithubHelp home page GithubHelp logo

keeferrourke / pyapa Goto Github PK

View Code? Open in Web Editor NEW
2.0 3.0 1.0 32 KB

A Python module for checking APA style. Not a papaya.

Home Page: https://pypi.python.org/pypi/pyapa/

License: ISC License

Python 96.37% Makefile 2.01% Shell 1.61%
python python3 apa writing style-checker regex pyapa apacheck

pyapa's Introduction

pyapa

A Python module for checking APA style in writing.

This module provides an abstraction class, mostly composed of regular expressions. This project does not aim to cover the entire APA style guide, but it does attempt to cover several common errors with style.

To make errors easier to visually locate, 5 characters of context are sometimes given, depending on the detected error.

Installation

Clone this repository and run:

make install

or install from Pypi:

pip3 install pyapa

Information

Pypi: https://pypi.python.org/pypi/pyapa/

Github: https://github.com/keeferrourke/pyapa

Contact: https://krourke.org/contact

License: ISC License

pyapa.ApaCheck object class

This class documents regular expressions to match style errors.

The pyapa.ApaCheck.match() method generates an array of ApaMatch objects and stores them in the ApaCheck.Matches list for easy access.

pyapa.ApaMatch object class

This class contains the following members:

  • ApaMatch.feedback a unicode string intended to hold a brief explanation of an associated error
  • ApaMatch.see a URL string intended to hold an external reference to an APA Style guide
  • ApaMatch.end the position of the character in a text which marks the end of the target character span
  • ApaMatch.start the position of the character in a text which marks the beginning of the target character span
  • ApaMatch.target the matching string that contains an error
  • ApaMatch.suggestions a list of suggested replacement strings; at this time, the list is only ever one element long

There are also two helper functions to make printing these objects easy:

  • ApaMatch.print() prints the object to stdout
  • ApaMatch.sprint() builds a string containing information about the object

Example usage

From the interpreter:

>>> from pyapa import pyapa
>>> a = pyapa.ApaCheck()
>>> text = u'Papaya are delicious fruit, it was concluded (Author, et al. 2017).'
>>> a.match(text)
[<pyapa.pyapa.ApaMatch object at 0x000000000000>]
>>> len(a.Matches)
1
>>> a.Matches[0].print()
Match from 46 to 65 for:
Target: Author, et al. 2017
Feedback: Do not put a comma before 'et al.'
Suggestion: Author et al. 2017

Calling the module itself:

python3 -m pyapa -i input.txt
python3 -m pyapa -h
Usage:
pyapa [-h] [--version] [-o OUTPUT_FILE] -i INPUT_FILE

pyapa's People

Contributors

harpunius avatar keeferrourke avatar sophiekovalevsky avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

harpunius

pyapa's Issues

Add more regexes

Current regexes are inspired from the common errors cited by @JonathanAquino.

ApaStyle.org has a rather extensive FAQ, from which other patterns can likely be derived. The class structure in pyapa.py is very simple, so it should be trivial for new contributors add on to this.

Reference-dot match captures too much

When run on input

Hans & Yorke (2006) contradict existing literature (Hans and Yorke, 2006).

the program outputs

Match from 9 to 22 for:
Target: Yorke (2006)
Feedback: References go at the end of sentences; end your sentence after the reference.
Suggestion: Yorke (2006).

which is not correct (from my understanding, and as referenced here.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.