Comments (9)
Can you post the ID of the malware sample you were attempting to fetch the families for? It'll be easier for us reproduce the error.
from threatexchange.
The script first collects the IDs by calling malware object as
"for result in Malware.objects(since=1433167932,until=1433171532, dict_generator=True):"
Values in result['id'] are:
814109225352881
1150492118301411
854234691335667
913965018662359
910996378970454
689470754492122
873962145973716
1081450491870663
838455449543136
829052140506010
908174619229394
798787550237174
1059883600696011
719527378156947
from threatexchange.
I took a look and those samples don't appear to have any families. Perhaps pytx doesn't handle an empty results set?
from threatexchange.
It's possible, but why would it be detecting a 500 response coming from the server?
from threatexchange.
Gah, sorry, missed that part. I'm not sure what's happening there. I'm unable to reproduce the problem. From https://developers.facebook.com/tools/explorer/, and attempting to visit, for example, /838455449543136/families, I get an empty "data" block, but no error.
from threatexchange.
jessek: Here is the how the code looks like https://gist.github.com/cybercuffs/bc8f4b0d776f6a9e7baa. This might help you in reproducing the issue. (I've commented out families and variants pull in the code)
from threatexchange.
There is another problem in regards to the time it takes to fetch the data.
In the code you see I've commented out the lines that are pulling families and variants. But even when I pull only Malware, Connection_dropped and dropped_by just for an hour duration, it is taking a very long time, almost 4 hours. I did couple of tests and the execution start and stop time is as below.
Test1:
Execution Started: 2015/06/23 21:08:22
Execution Ended: 2015/06/24 00:54:46
The files' size I got at the end of this completion were: Connection_dropped=954K, Connection_dropped_by: 409K, Malware_Dump=2.6M.
Test2:
Execution Started: 2015/06/28 05:42:25
Execution Ended: 2015/06/28 09:24:18
The files' size I got at the end of this completion were: Connection_dropped=955K, Connection_dropped_by: 409K, Malware_Dump=2.6M.
from threatexchange.
Just pinging on this open issue. Please let us know if things were resolved or not. Thanks!
from threatexchange.
With no response after three weeks, I am closing out. @cybercuffs , please reopen if you are still having problems!
from threatexchange.
Related Issues (20)
- Typing of SignalExchangeAPIWithSimpleUpdates is too Generic | remove use of t.Any
- [py-tx] CLI error opaque for PDQ match with low hash quality HOT 1
- [py-tx] Use the new NON_MALICIOUS reaction
- pdq_hasher error for B/W png HOT 1
- [py-tx] SignalType Reference implementation for Video TMK+PDQF Matching
- [py-tx] ThreatExchange checkpoint time implementation is incorrect, potentially skipping updates HOT 2
- [py-tx] Investigate dbm as a replacement for the default store
- /matches/for-hash/ returns 400, could not parse request HOT 9
- [hma] Clicking Sync button on the webui doesn't do anything
- [py-tx] New extension interface for storage
- [py-ty] Venv setup documentation and/or files
- [hma] Cleanup Settings > ThreatExchange Tab
- [hma] 500 error thrown on invalid PDQ hash HOT 1
- [HMA] graph API 9.0 hardcoded, now deprecated HOT 1
- [py-tx][HMA-in-a-bottle] Modularising py-tx -- Draft roadmap HOT 6
- [hma] Fetcher policy fails to access index HOT 1
- [hma] submitting content gets stuck between "hashed" and "matched" HOT 2
- /matches/for-hash/ gives AttributeError: 'IndexMatchUntyped' object has no attribute 'distance' HOT 1
- [pytx] No match results if creating a local_file with only 1 hash in it HOT 1
- [hma] Size of hashkey has exceeded the maximum size limit of 2048 bytes HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from threatexchange.