darealfreak / watcher-go Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 1.0 2.52 MB

download and keep track of your favorite artists on multiple platforms

License: MIT License

Go 99.91% Dockerfile 0.09%

watcher-go's People

Contributors

Stargazers

Watchers

Forkers

dimied

watcher-go's Issues

add option to disable specific modules

if you are f.e. IP banned or something the like it would be beneficial to be able to skip modules.
just running only 1 module without the parallel mode can be quite time consuming, so definitely a need for that.

twitter download queue

twitter module often tries to download already downloaded posts, most likely the order is reversed

Dependabot can't resolve your Go dependency files

Dependabot can't resolve your Go dependency files.

As a result, Dependabot couldn't update your dependencies.

The error Dependabot encountered was:

github.com/DaRealFreak/watcher-go/cmd/watcher: cannot find module providing package github.com/DaRealFreak/watcher-go/cmd/watcher

If you think the above is an error on Dependabot's side please don't hesitate to get in touch - we'll do whatever we can to fix it.

View the update logs.

animation conversion fails on many files

due to using the full file path for all added images the command exceeds the max length earlier than expected.
change into the dir for conversion without using the full file path

add book support for sankaku

sankaku got books, would be neat to track books of specific tags too and get them in separate folders

add pixiv booth support

pixiv booth are not supported as of yet, would be neat to support it too, unsure if we can retrieve it with the API though

add option to enable/disable tracked items

currently there is no option to enable or disable tracked items except for manually editing it in the database, this should get added

add function to retrieve proxy from proxy loop settings

since proxy loops are now included in the base module we should add a function to retrieve the next proxy to the module base

add function to circumvent pixiv API limit 5000 results

since the API now accepts start_date and end_date arguments we can circumvent the 5000 API results limit of the pixiv API

Dependabot can't parse your go.mod

Dependabot couldn't parse the go.mod found at /go.mod.

The error Dependabot encountered was:

go: github.com/spf13/[email protected] requires
	github.com/grpc-ecosystem/[email protected] requires
	gopkg.in/[email protected]: invalid version: git fetch --unshallow -f origin in /opt/go/gopath/pkg/mod/cache/vcs/748bced43cf7672b862fbc52430e98581510f4f2c34fb30c0064b7102a68ae2c: exit status 128:
	fatal: The remote end hung up unexpectedly

View the update logs.

include leaky bucket for eh too

currently a random delay between 1.5 and 2.5 seconds is chosen.
the data harvesting check is most likely a leaky bucket too, since requests can take a different amount of time.
Instead of random sleep times a leaky bucket should be implemented for eh as for pixiv already.
refill of 1.5s should be a good start for estimating the bucket size.

configuration for modules

modules should be able to have a custom configuration (f.e. animation conversion of pixiv to webp/gif/fliff)

add twitter support

mainly this API functionality:
https://developer.twitter.com/en/docs/tweets/timelines/api-reference/get-statuses-user_timeline

have to wait on the twitter developer account response though

extend backup for OAuth2 clients

currently the OAuth2 clients are not getting backed up, this should get added

clean up unused response structs for deviantart

forgot to remove the response structs when I removed the unused API functions, they should get removed too

differentiate module initialization between parsing items / collecting information

currently modules get fully initialized twice:
1st time at the startup for cobra commands
2nd time for actually parsing the jobs

for debugging purposes it would help to differentiate between those 2.

pixiv manga type incomplete download

when the manga type work of pixiv isn't fully downloaded (f.e. 2 out of 5 pages) the item gets marked as complete nonetheless.
we may have to split illustration and manga downloads here

implement additional search parameters for search

respecting the search type for pixiv is defined by the type argument
type:

ugoira
illust
manga

mode:

r18
safe

add nhentai module

since it also saves deleted galleries and lists previously updated galleries it would be pretty nice to have

make user argument consistent

currently in adding account and update account command we use one time "user" and one time "username" as argument, unifying it to "user" would be neat

update deviantart OAuth2 process

since we only need the client request there is no need for a webserver.
we can ignore the not resolved error and just retrieve the URL

make database file configurable

currently it's always in the path where the app is executed, would make sense to make this configurable too since the configuration file is configurable too

update download algorithm for gw

Just iterating through the jump selection after the last selected item would have the executed requests and be more error prone

extend pixiv trackable/downloadable types

more types of pixiv should become supported for download

single illustrations for ugoira animations
search words to keep track of f.e. tags

use published time as uid in deviantart

since the UUID can't be compared with bigger/smaller comparison deleting the newest work can cause having to download all gallery items once again, using the published_time as unique ID could prevent that since can use > comparison to detect new items

Request field validation failed for favourites

DA module currently fails with return value "invalid_request (Request field validation failed.)" for parsing collections of favourites

eh download logic fails on large galleries

for the eh module the generated image link is only available for 3600 seconds.
with the random delay this time can get exceeded with a minimum of just 1440 images.
the module should generate the image link on the download part to fix this

rework OAuth2 credential storage

the identifier shouldn't be be client ID only but client ID and access token to work with static tokens too

use round tripper for pixiv module

instead of implementing another wrapper for the default session we could use the default session and add an http round tripper to manage the pixiv API headers which would be much cleaner

Add option to run specific item

migrate logrus to zap

since logrus is in maintenance mode migrating the logging to zap would be nice to have

update user tag directory of pixiv on update

to directly sort by new updates the pixiv module should also update the changed timestamp of the user directory

add patreon support

patreon as site would be pretty neat, patreon even offers a pretty good API
https://docs.patreon.com/#apiv2-oauth

unneeded sentry reports

stop reporting custom thrown errors, they are already handled/there for a reason

add proxy options for connections

proxy usage would be neat, optimally even module specific proxy connections

rework duplicate image check for deviantart/download

with the new structure we don't get the file extension in the download anymore, we should use ImageMagick to check for image similarity

%IM%convert input1.png -resize 200x200 input1_scaled.png
%IM%convert input2.png -resize 200x200 input2_scaled.png
%IM%compare -subimage-search -metric mse input1_scaled.png input2_scaled.png NULL:

or for better performance but without coordinates of expected sub image:

%IM%convert input1.png -resize 200x200 input1_scaled.png
%IM%convert input2.png -resize 200x200 input2_scaled.png
%IM%compare -metric mse input1_scaled.png input2_scaled.png NULL:

While at it we should rename the downloaded file to identify and match the image format:

%IM%identify -format "%m" input2

fallback solution for deviantart download API endpoint

the endpoint throws a lot of internal server errors, while the web interface works properly
a fallback in case the API endpoint throws the internal server error to the web interface would be useful.

response on internal server error:

{
    "error": "server_error",
    "error_description": "Internal server error.",
    "status": "error"
}

skip on error

generally the application should skip the currently handled item on errors.
while it is great for seeing when something failed, it is kinda tedious when you run the application and your internet is gone for a few minutes

eh/ex
deviantart
pixiv
sankaku

extend cobra commands for cookies

currently missing from the CLI workflow and only configurable through environment variables so far:

add command
update (+enable/disable) command
list function implementation (list all cookies, add requirement of cookies to module list)
extend backup commands for stored cookies
README update for cookies

new error message unavailable due to copyright

f.e. 1671575/af7d69fce1 contains a new error message that a gallery is unavailable due to copyright which is not caught yet

delete temp files after usage

we should clear temp files after we're done using them. windows f.e. will clear them automatically after 10 days but on many many downloads the size can escalate really quick

parse app links of deviantart

translate the app url meta data to parse the endpoint:
document.querySelector('meta[property="da:appurl"]').content
https://www.deviantart.com/developers/app_links

relevant uris:

DeviantArt://deviation/{deviation-UUID}
DeviantArt://tag/{tag}
DeviantArt://collection/{username}/{folder-UUID}
DeviantArt://gallery/{username}/{folder-UUID}
~~DeviantArt://browse/morelikethis/{deviation-UUID}~~ (no valid continuation)
~~DeviantArt://watchfeed~~ (this endpoint is completely broken for 3+ years, just ignore it

all other uris don't contain deviations or don't provide a proper sorting so we can update/track them properly

eh gallery search logic flaw

Currently when the current item gets deleted/updated eh search update won't stop and it will try to add every gallery again.
This causes some delays for opening all pages (leaky bucket) again.
Check whether there is an increment identifier visible for fixing it or use fallback items to minimize the chance of it happening

add command
update (+enable/disable) command
list function implementation (list all OAuth2 clients, add requirement of OAuth2 client to module list)
README update for OAuth2 clients

darealfreak / watcher-go Goto Github PK

watcher-go's People

Contributors

Stargazers

Watchers

Forkers

watcher-go's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs