Comments (26)
@rarous With @MartinaGelnerova we changed the proxies to RESIDENTIAL, and we wanted to wait like a week to see how much would the price per run change.
Currently we are waiting for confirmation based on the new price, right? @MartinaGelnerova
from hlidac-shopu.
@JJetmar Oh, I didn't see "Build" option but that was because I was checking the task, not actor itself. Good to know... Thank you.
But it's not working still (different reasons for CZ and SK):
https://console.apify.com/admin/users/iMWJjifpQdTwbkKYn/actors/tasks/L48jIjbkN9oQlUQY9/runs/eLnYhfxC9nHVTs4IA#log
https://console.apify.com/admin/users/iMWJjifpQdTwbkKYn/actors/tasks/wASaIqALSVN65yR2o/runs/LzmXHpa4Vp92FykUv#log
from hlidac-shopu.
Checked this one, it seems that the proxy groups we used were blocked. I did some testing and changed the groups. No implementation needed. We will evaluate the results tomorrow.
from hlidac-shopu.
@JJetmar is this solved?
from hlidac-shopu.
@JJetmar @rarous we decided with Kuba B. to keep current setup with RESIDENTIAL, so I am closing the issue as solved
from hlidac-shopu.
@JJetmar @rarous sorry, re-opening the issue again: today both CZ and SK runs took over 5 hours with only few returned results. I aborted both runs but there should be some time-limit in order not to let run last (and consume residential proxies) forever...
Can you please have a look at it?
https://console.apify.com/admin/users/iMWJjifpQdTwbkKYn/actors/n8WwDXAF6HXzNBPYm/runs
from hlidac-shopu.
Hmm... When the Actor works, the run takes about 40min, I will setup (temporary) timeout to 60min. So we don't have to abort the actor manually and check if there are any other options.
from hlidac-shopu.
Looks like there is some new additional protection.
I did few changes, currently having test run under my account.
Currently just checking if it make it through. Once it finishes I will try to do some optimizations for the resources/cost.
from hlidac-shopu.
PR #1490
from hlidac-shopu.
So far it looks like that after update no Residential proxies are longer needed and we are obtaining more results (there is no blocking in log from what I see), the cost is lower than with Residential proxies (currently waiting for optimized run to finish).
Highly suggest to increase the memory for the actor to 4096MB, since it is using Puppeteer now. Or ping me, when merged - I can do that.
from hlidac-shopu.
Highly suggest to increase the memory for the actor to 4096MB, since it is using Puppeteer now. Or ping me, when merged - I can do that.
@MartinaGelnerova I set up the Memory on the Tasks for this Actor, so far everything should be good to go (I noticed that running the Actor was unscheduled recently). Additionally I set the timeout for run to 2hrs which should be more than fine for the results, and also it will save the cost in case that something unpredictable happens.
from hlidac-shopu.
@JJetmar thank you, I have re-set schedule for both CZ and SK task.
from hlidac-shopu.
@JJetmar Am I missing something? I tried to manually run iTesco_cz and iTesco_sk tasks, but both ended with 0 results. Logs:
https://console.apify.com/admin/users/iMWJjifpQdTwbkKYn/actors/tasks/L48jIjbkN9oQlUQY9/runs/Fa3A8XNPokTq8DpvV#log
https://console.apify.com/admin/users/iMWJjifpQdTwbkKYn/actors/tasks/wASaIqALSVN65yR2o/runs
from hlidac-shopu.
yeah, the actor wasn't rebuilt (it's a manual process). It should work now.
from hlidac-shopu.
Ah, sorry about that, I remember pressing the build button on platform but based on the builds it was not really rebuilt at all 🤔
from hlidac-shopu.
@MartinaGelnerova Weird,
I just started run under my account and everything works fine. I am investigating what is happening 🤔
from hlidac-shopu.
@JJetmar fyi:
CZ finished successfully today, but returns low amount of results (used to be 7t items, today only 1832 items)
cost is low ($0.482)
log: https://console.apify.com/admin/users/iMWJjifpQdTwbkKYn/actors/tasks/L48jIjbkN9oQlUQY9/runs/oEHTYdRtfmmdG1MUj#log
SK: failed, ERROR Selected proxy groups have no usable proxies from country 'SK'.
log: https://console.apify.com/admin/users/iMWJjifpQdTwbkKYn/actors/tasks/wASaIqALSVN65yR2o/runs/C1asnkHq52fe9Wbd6#log
from hlidac-shopu.
I did another two runs today under my account both with more than 7k results.
I noticed, that only difference is that I am running it with the development mode which somehow affects the proxyConfiguration, so I changed that in the code.
I also fixed the issue with the SK version.
from hlidac-shopu.
@MartinaGelnerova I see both of the runs are fine for today.
from hlidac-shopu.
@JJetmar You are right - number of results is correct and the cost is reasonable ($1). Thank you for fixing it. Closing the issue.
from hlidac-shopu.
@JJetmar sorry to bother you again:
SK 0 items during last 2 days, logs:
https://console.apify.com/admin/users/iMWJjifpQdTwbkKYn/actors/tasks/wASaIqALSVN65yR2o/runs/ojbC9KdExBoJfVaAT#log
https://console.apify.com/admin/users/iMWJjifpQdTwbkKYn/actors/tasks/wASaIqALSVN65yR2o/runs/nPJcLn6ir8Oh2s1eB#log
CZ 0 items on Sat 26.8., but succeeded with items on Sun 27.8.. Log with 0 items:
https://console.apify.com/admin/users/iMWJjifpQdTwbkKYn/actors/tasks/L48jIjbkN9oQlUQY9/runs/fXNvPkAIn6sLG8NcM#log
from hlidac-shopu.
It gave all the results today without any changes - so it is happening occasionally.
Hopefully I will be able to reproduce this behavior and fix it by better turning of sessions when request is blocked.
I will check today in the evening.
from hlidac-shopu.
It works pretty well - the only issue is it get blocked sometimes - I did some minor improvement for the session management so it tries more than just 3 retries before it dies. For some reason I can not really reproduce this behavior under my account. Let's merge the changes and see.
from hlidac-shopu.
@MartinaGelnerova I see it was merged + built yesterday, today we are still obtaining the results regulary.
from hlidac-shopu.
@MartinaGelnerova I see the drops are still comming back, I am currently testing new implementations under my account.
from hlidac-shopu.
Mostly successful. Good enough
from hlidac-shopu.
Related Issues (20)
- chybné zobrazení ceny/slevy
- Nefunguje vývoj ceny u produktů na allegro.cz HOT 1
- Vyčištění uploaderu
- Okay_cz vrací 0 items HOT 4
- Sleva v CZC se nepromítla do grafu HOT 2
- Pilulka_cz - EDIT: 0 items + starší problém: scraper nebere originalPrice (extenze bere originilPrice správně) + problémy s vykreslováním extenze (Chrome) HOT 2
- Rohlik_cz - vrací 0 items (Cannot read properties of null) HOT 1
- OBI_cz/sk - vrací 0 items HOT 1
- Mironet_cz vrací 0 items
- Ignorované klubové karty ? HOT 2
- Přepočet ? HOT 1
- Lidl_cz - vrací 0 items HOT 2
- DM_cz/sk - vrací 0 items HOT 4
- DM_cz - problémy s vykreslováním extenze (race condition)
- Kaufland_cz - vrací 0 items ("found 0 categories")
- Smarty_cz - vrací 0 items ("Cannot read properties of null")
- Allegro_cz - vrací 0 items ("found XY products, saved 0")
- Obi_sk - scrape žere hodně peněz, nešlo by optimalizovat?
- Safari - The service_worker script failed to load due to an error.
- mall.cz
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hlidac-shopu.