GithubHelp home page GithubHelp logo

btargac / excel-parser-processor Goto Github PK

View Code? Open in Web Editor NEW
218.0 10.0 30.0 20.69 MB

Automate downloads from Excel files in seconds. Simply does the tedious, repetitive operations for rows of Excel files and reports the results. It downloads files from URL(s) in column A, if a new filename is provided at column B it will rename before saving. It will even create sub folders if column C is filled with a valid folder name.

License: MIT License

JavaScript 70.65% HTML 6.07% SCSS 23.28%
excel electron electronjs nodejs downloader batch-processing batch download batch-download excel-import excel-parse exceldownload excel-download excel-based rename subfolder autodownloader filedownloader imagedownloader

excel-parser-processor's Introduction

Excel Parser Processor Excel Parser Processor

A Desktop app for processing all rows of Excel files

Simply generates an array of items from the rows of an Excel file and does the repetitive tedious operations step by step till every item of the array is processed. For example downloads all the assets from the URLs from column A in an Excel file.

Dependency Status Build Status Github Tag codecov Backers on Open Collective Sponsors on Open Collective Open Source Helpers CodeFactor CodeQL

How to use

You can download the latest release for your operating system or build it yourself (see Development).

Just select or drag & drop an Excel file, then select the output folder for the downloaded images or files. All the items in the Excel file will be downloaded into the selected folder, and you will be notified about the state of ongoing progress.

Sample Excel file structure

A (resource to download) B (new filename if used) C (subfolder name)
1 https://www.buraktargac.com/sample_image.gif optional-sample-file-name optional-sub-folder-name
2 https://www.buraktargac.com/sample_image.png optional-sample-file-name optional-sub-folder-name
3 https://www.buraktargac.com/sample_image.jpg
. ...
. ...
n Asset URL (any type of file image, text, pdf etc.)

Currently, there is no limit for n, I tested with 4000 items and unless your IP is banned from the publisher there is no problem to download as much as you can.

Demo

Excel Parser Processor Demo

Development

You need to have Node.js installed on your computer in order to develop & build this app.

$ git clone https://github.com/btargac/excel-parser-processor.git
$ cd excel-parser-processor
$ npm install
$ npm run build
$ npm start

If you are changing the view or renderer related things, you can use Webpack's watch feature with

$ npm run start-renderer-dev

After running this command, you'll see a webpack process watching your files after a new renderer.bundle.js is generated you can refresh the Excel parser processor app window with cmd + R or ctrl + R depending on your system.

To generate binaries on your computer after your development is completed, you can run;

$ npm run dist

This will add binaries under /release folder on your project folder.

/release folder is ignored at the repository. Github Actions will be building the binaries after your branch is merged with master.

Contributors

This project exists thanks to all the people who contribute. [Code of Conduct].

Backers

Thank you to all our backers! 🙏 [Become a backer]

Sponsors

Support this project by becoming a sponsor. Your logo will show up here with a link to your website. [Become a sponsor]

License

MIT © Burak Targaç

excel-parser-processor's People

Contributors

btargac avatar dependabot[bot] avatar kant avatar monkeywithacupcake avatar suleymangunduzz avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

excel-parser-processor's Issues

Workaround for sites banning ip?

Very nice tool, tested on some sites and it works wonderfully

However, one of my target site banned me immediately after 80 requests

Any workaround besides VPN, proxies etc?

Can we set the scrapping speed? It's ok for me to scrap slower

add Issue templates

issue templates will force people to open more readable and standart issues

In-app Navigation and Electron.js Version

Summary:

While the Excel Parser Desktop Application uses secure web preferences, it does not use event listeners that prevent in-app navigation. Moreover, the application can benefit from an update to the underlying Electron.js version.

Platform(s) Affected:

MacOS, Windows, Linux

Steps To Reproduce:

  1. Open the Excel Parser Desktop Application.
  2. From the “View” menu, choose “Toggle Developer Tools”.
  3. [In-app Navigation] Within the console, enter window.location=”https://attacker.com/”. The application window navigates to the third-party site.
  4. [Alt. in-app navigation] Alternatively, within the console, enter window.open(“https://attacker.com/”). The application opens a new window with the third-party domain.
  5. [Web Preferences] While the app disables nodeIntegration and enables contextIsolation, it does not enable sandbox. These features can be taken care of by the defaults of the latest Electron.js version.
  6. [Electron.js Version] Finally, the current version of Excel Parser depends on Electron v17 which is vulnerable to numerous CVEs. [Example] The app can benefit from an update to the framework version that fixes numerous security issues. [Link]

--
Mir Masood Ali, PhD student, University of Illinois at Chicago
Mohammad Ghasemisharif, PhD Candidate, University of Illinois at Chicago
Chris Kanich, Associate Professor, University of Illinois at Chicago
Jason Polakis, Associate Professor, University of Illinois at Chicago

Allow Image URLs without .JPG/.PNG Suffixes

I have been searching for well over a week for an application that does precisely what this wonderful application can do. However, I ran into one insurmountable obstacle: the website from which I am currently attempting to batch download images (of historical newspapers, journals, magazines, etc.) obscures the absolute path to the images in question. Thus, my image URLs conform to the following format:

http://digitale.bnc.roma.sbn.it/tecadigitale/img/giornale/TO00185283/1880/unico/00000001/original

It would be super if the developers of this application could allow for image URLs such as this one that do not feature a .jpg/.png file extension. As it currently stands, my URLs simply error out. I have tried to use other batch downloaders, and they successfully download these images, but none of them allow for filenames or folder names, which is what makes this application highly attractive.

Everything else about this application is incredible, and I commend the developers for their very good work.

Thank you for your consideration.

show the ongoing progress on the client side

Now the percentage or currently downloaded image name is not shown on the renderer process, send IPC messages to renderer process and show on the DOM elements via events and message datas

Rename from next coloumn function is needed

Rename from next coloumn function is needed, as the program goest through excel rows and grabs the file from the link, it would be a great addition to rename the file with a new name described on the next coloumn or any place. If left blank program can still stick with the original file name,

Thanks all for the great program!

Nothing happens after file select

I downloaded the windows version 1.3.0. But when I have selected the xls nothing more happens. Tested both drag&drop and using browse.

Expected to get to choose a folder for download. Not showing. No images downloads.

What can be the problem? my column A is Url, column B is filename.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.