GithubHelp home page GithubHelp logo

hansbambel / fscrawler Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dadoonet/fscrawler

0.0 0.0 0.0 14.49 MB

Elasticsearch File System Crawler (FS Crawler)

Home Page: https://fscrawler.readthedocs.io/

License: Apache License 2.0

Shell 0.89% Java 90.35% HTML 4.24% Batchfile 0.17% Dockerfile 0.04% Rich Text Format 4.31%

fscrawler's Introduction

File System Crawler for Elasticsearch

Welcome to the FS Crawler for Elasticsearch

This crawler helps to index binary documents such as PDF, Open Office, MS Office.

Main features:

  • Local file system (or a mounted drive) crawling and index new files, update existing ones and removes old ones.
  • Remote file system over SSH/FTP crawling.
  • REST interface to let you "upload" your binary documents to elasticsearch.

Latest versions

Current "most stable" versions are:

Elasticsearch FS Crawler Released Docs
6.x, 7.x, 8.x 2.10-SNAPSHOT 2.10-SNAPSHOT

Maven Central GitHub Release Date Maven metadata URL GitHub last commit

Docker Pulls Docker Image Size (tag) Docker Image Version (latest semver)

Build and Quality Status

Build Documentation Status

Lines of Code Duplicated Lines (%) Maintainability Rating Technical Debt Reliability Rating

Vulnerabilities Bugs Quality Gate Status Code Smells Security Rating

GitHub stats

GitHub commits since latest release (by SemVer including pre-releases) GitHub commit activity (branch) GitHub contributors

GitHub issues GitHub pull requests

Documentation

The guide has been moved to ReadTheDocs.

X (formerly Twitter) Follow

Contribute

Works on my machine - and yours ! Spin up pre-configured, standardized dev environments of this repository, by clicking on the button below.

Open in Gitpod

License

GitHub

Read more about the Apache2 License.

Thanks

Thanks to JetBrains for the IntelliJ IDEA License!

Thanks to SonarCloud for the free analysis!

SonarCloud

fscrawler's People

Contributors

babadofar avatar barts2108 avatar cadm-frank avatar chrissound avatar circuitguy avatar coder-sa avatar dadoonet avatar dependabot-preview[bot] avatar dependabot[bot] avatar eternallybaffled avatar fgaujous avatar helsonxiao avatar iadcode avatar ian-cameron avatar it20one avatar janhoy avatar kikkauz avatar kneubi avatar koopmac avatar logicer16 avatar mario-89 avatar mergify[bot] avatar muraken720 avatar quix0r avatar rhaist avatar shadiakiki1986 avatar shahariaazam avatar tommylike avatar xcorail avatar ywjung avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.