GithubHelp home page GithubHelp logo

dsv-analyzer's Introduction

DSV-Analyzer

This is a multi-threaded tool for analyzing files with DSV File Format.
For each found file a format and structure is guessed:

  • which delimiter is used (comma, tab and semicolons are possible),
  • which digital (dot or comma) and thousand (dot, comma or space) separators are used for numbers,
  • which date format is used (DD/MM/YYYY, MM/DD/YYYY, YYYY/MM/DD, DD.MM.YYYY, MM.DD.YYYY, YYYY.MM.DD are possible),
  • which are data types for columns (number, date, string could be used).

Assumptions

  1. If a number contains delimiters, then it must contain both digital and a thousand separators
  2. That each column should follow the same format (i.e., If a column is date, then all the entries in that column should follow the same format as may be DD/MM/YYYY)
  3. If the column is written in one of the date format ,then it can be detected from the data of that column (i.e if the column is date, then the column must be a day greater than 12 to be able to match with DD/MM/YYYY or MM/DD/YYYY).

How to Run?

  1. Open the Properties.config to configure the settings
  2. Open Controller/Controller.java and just Run the main method.

dsv-analyzer's People

Contributors

mohamedshetewi avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.