azizsonawalla / duplicate-detector-old Goto Github PK
View Code? Open in Web Editor NEWFind space-hogging duplicate images in your collections and bulk remove them
Find space-hogging duplicate images in your collections and bulk remove them
Add the implementation for the RandomPixelStrategy class.
Add logging statements throughout the project to help debug
Implement the full hash strategy class that hashes the entire image file to look for duplicates.
Logger should be able to save logs from session into a file on disk. Feature should have a toggle in global config class to enable/disable.
Implement the unified start()
method in ScanController
that would trigger the execution of all stages automatically.
When the search finds no duplicates, the View Results
button should be replaced with an Exit
button since there is no point going to the results scene
Add the implementation for the SimilarityStrategy class.
Collect errors and pass it in the Progress object when requested.
Sometimes (after clicking the 'load more' button a few times) the row spacing between images on the results page becomes too compressed together. The cause is yet to be identified - requires further investigation.
Currently the pre-search UI does not provide a percentage and ETA. This can be implemented once the directory crawler starts providing ETA and % stats.
Depends on: https://github.com/azizsonawalla/Duplicate-Detector/issues/3
Currently the ScanController stage is only refreshed when refreshCurrentStage()
is called - this method is invoked every time progress is requested. Instead of polling the results every time, an async callback should be passed to the strategy object to update the stage when the task is complete.
Pool size shouldn't be hardcoded as optimal value depends on the user's CPU and core count.
Supported file types depend on the search strategy implementation and should therefore live there.
ScanException should extend a checked exception to enforce error handling in caller. The change is quick (just change the super class) but it will require adding error handling code in numerous callers.
Directory crawler can estimate ETA based on total directory size and size of files scanned so far - add this to the implementation.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.