GithubHelp home page GithubHelp logo

crate-ci / typos Goto Github PK

View Code? Open in Web Editor NEW
2.1K 2.1K 78.0 16.88 MB

Source code spell checker

License: Apache License 2.0

Rust 99.92% Shell 0.08% Dockerfile 0.01% Python 0.01% Julia 0.01% PHP 0.01%
cli code-quality rust spell-checker

typos's People

Contributors

alatiera avatar augustelalande avatar bnjbvr avatar clo4 avatar damianbarabonkovqc avatar delgan avatar dependabot-preview[bot] avatar dependabot[bot] avatar dosisod avatar dupgit avatar epage avatar flakebi avatar foriequal0 avatar halkeye avatar hamdor avatar jiralite avatar jplatte avatar kachick avatar leodog896 avatar ncfavier avatar not-my-profile avatar peter-kehl avatar phip1611 avatar renovate[bot] avatar scop avatar shirayu avatar steffahn avatar szepeviktor avatar tekumara avatar tklauser avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

typos's Issues

Layered config

For large projects, it can be helpful to support layered configs.

Calculate line number / line offset only when typo is found?

Right now we proactively parse out lines and then parse within a line. What if instead we found out our line number by counting the new lines afterwards? This puts the cost on the typo case, which should be rare, rather than on every case when parsing

Custom dictionaries

Source

  • passed in on cli
  • found on disc

Include

  • file type definitions
  • per file type corrections

Perf: remove allocation when case correcting by switching to KString

KStringCow has the following states:

  • Box<str>
  • 'static str
  • 's str
  • inlined string

If we add a From to it, we can possibly detect being able to use the inline string and write straight to it, avoiding the allocation when case correcting.

In addition, we'd be dropping from 4 machine words to 3 machine words iirc.

Per-file type identifier rules

We'll to define file types and what traits those file types should have (specialized dictionaries, _ / - as identifier characters, and whether escape sequences are supported (#3).

This can then be extended into a config file that works with custom dictionaries (#9) to allow the user to override existing file type definitions or add their own.

Config file support

We're developing a lot of flags. It'd be good if we added a config file so people can easily get a consistent experience

Custom ignores?

Some times files should just be ignore for spelling but work for all others

Audit API

The API has gone through some churn. We should audit it before 1.0 to make sure its something we want.

Fill in misspell-go's comparison

Add benchmarks

Possibly steal ripgreps cases

Compare to scspell, the go one that we took the list from, and some kind of baseline search, like ripgrep

Calculate line number / line offset on-demand?

Right now we proactively parse out lines and then parse within a line. What if instead we found out our line number by counting the new lines afterwards? This puts the cost on the typo case, which should be rare, rather than on every case when parsing

Support file types embedded in file types?

With #14, we're going to have special handing of different file types but one file isn't always a single type

  • markdown files that have code fences
    • treat markdown as non-code (no identifier support), `` as generic code, and code-fences as the specified language
  • rust comments have markdown which have code fences
  • mako files are a mixture of python and whatever the generated type will be.

Support an any-dialict mode

Currently, all corrections force into a single english dialect. This will cause a lot more failures in a CI/. We should support any dialect.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.