codeaudit / xtext Goto Github PK
View Code? Open in Web Editor NEWThis project forked from opensextant/xtext
Textual Content extraction from multimedia featuring modes for crawling folders, websites and Sharepoint. This is Tika-based, but aims to simplify this for pipelining in extraction applications.
License: Apache License 2.0