GithubHelp home page GithubHelp logo

ortografe's Introduction

This project contains tools to help with proposed spelling rationalizations for French:

  • a browser extension that applies the spelling change to every visited page, without the user having to take any action like clicking a "translate" button. This is fast enough that you won't see the transcription happening, and it doesn't slow down the browing experience. The extension refrains from running on pages not in French, to avoid messing up English text.

    This extension is deployed:

    • to firefox (computer and phone)
    • to chrome, and presumably all chromium-based browsers. Chrome doesn't support extensions on Android, but some of the other chromium-based browsers do, so the extension should work there.
    • to safari (MacOS and iOS)
  • a website that provides an entry point to all the tools, and in particular transcribe text and documents interactively without installing anything. Source.

  • a CLI tool that:

    • transcribes text and documents in various formats (pure text, .html, .xhtml, .htmlz, .doc, .docx, .odt, .odp, .ppt, .pptx, .epub)
    • creates dictionaries for other kinds of spelling changes (which can be used to convert document, or with the browser extension)
  • dictionaries for virtual keyboards on phones, so it's convenient to write in these adjusted spellings

  • data for this work (it is surprisingly difficult to find structured data)

We may add more tools in the future.

Different spelling changes

Almost none of the code cares about the specific spelling change. The conversion code should work for other spelling changes or other languages, with the following constraints:

  • the language has space-separated words
  • the spelling change can be expressed as a word->word dictionary mapping old spelling to new spelling, without analysis of the context.

For instance, by loading a dictionary containing (dict format doc):

{ "lang": "en" }
theater,theatre
finalize,finalise
learned,learnt

you would British-ify American spellings.

Building locally

The commands below should:

  • build the whole repository (extension, website, CLI converter and dictionary creation)
  • run the tests
  • rebuild everything as you make changes
$ make first-install
$ make all-w

Please open an issue if you want to build this repository, but can't!

How data flows

ortografe's People

Contributors

v-gb avatar

Stargazers

QuantumSoul avatar  avatar  avatar Valentin D. Richard avatar

Watchers

 avatar

ortografe's Issues

Mes proposicions d'ajout seront ici

En même temps que le faire sur Jlailu (Reddit an anglais), je vais poser ici directemant mes proposicions pour que l'extancion soit ancore plus conplète et permète d'anglober dans sa version finale l'ansenble des réformes du français, et pas que d'ordre ortografique ! ^^

  • Tanter une traduccion automatique des anglicismes. J'ai déjà perso fait une liste de traduccion des anglicismes que j'ai trouvé. Mais je vois déjà le problème où l'extancion traduirait des mots anglais directemant sur des pages anglaises, un problème que j'imagine conpliqué à résoudre vu qu'il faudrait que le module détecte si une page est an anglais...

  • Réforme de la conjugaison et des verbes. Je bosse dessus encore, vu le nombre de verbe iréguliers ça va me prendre encore un peu de tenps mais voici un gdoc en ébauche. Si jamais tu as la flème de tout encoder, juste montre moi comant faire et je le ferais :)
    (https://docs.google.com/document/d/1ubpRqfx-zzZdVLl5brH3ISUFdx3nBQpVDG8aV-XkP4M/edit?usp=drivesdk)

  • Substitucion de certains mots en français par d'autres. Je panse à hui à la place du long et inutile pléonasme aujourd'hui, sempre à la place de toujours, moult à la place de beaucoup etc.
    (là c'est juste moi qui aime les mots plus concis et plus proches des autres langues romanes)

  • Racionalisacion ortografique de certains termes et leurs dérivés qui je crois ne sont pas pris en charges par l'extancion en tout cas avec moi
    vingt > vint
    caoutchouc > caoutchou
    seconde > segonde
    Qu'est-ce que c'est > Qu'est-ce
    De la > da (équivalant féminin à du)
    doigt > Doit / doi ?
    Pyrénées > Pirénées
    Méditerranée > Méditéranée
    Wiktionnaire > Wikcionaire

(je rajouterais des mots au fur à mesure que j'an trouve)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.