GithubHelp home page GithubHelp logo

microsoft / cameratraps Goto Github PK

View Code? Open in Web Editor NEW
701.0 45.0 230.0 426.51 MB

PyTorch Wildlife: a Collaborative Deep Learning Framework for Conservation.

Home Page: https://cameratraps.readthedocs.io/en/latest/

License: MIT License

Python 1.76% Jupyter Notebook 38.03% Shell 0.05% Python 20.73% Jupyter Notebook 38.03% Shell 0.39% Dockerfile 0.02% HTML 0.37% C# 0.52% Starlark 0.10% Batchfile 0.01%
camera-traps wildlife conservation machine-learning computer-vision megadetector pytorch pytorch-wildlife

cameratraps's People

Contributors

aa-hernandez avatar aa-hernandez-celis avatar aench2023 avatar agentmorris avatar amritagupta avatar arashno avatar beerys avatar bencevans avatar brianhogg avatar chrisyeh96 avatar chunting-linct avatar coreyjaskolski avatar dependabot[bot] avatar fedegonzal avatar jakob-98 avatar kant avatar louis030195 avatar luvargas2 avatar marcel-simon avatar marcelsimon avatar microsoft-github-policy-service[bot] avatar microsoftopensource avatar msftgits avatar oksachi avatar pflickin avatar ranjanbalappa avatar suhail14298 avatar vardhan-duvvuri avatar yangsiyu007 avatar zhmiao avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

cameratraps's Issues

`group` class?

Hi

Please could you clarify as to what the group class is supposed to represent? Group of animals, group of people, vehicles...?

Thanks

Ben

Are the MDv5 pt files "normal" YOLOv5 model files?

Are the MDv5 pt files "normal" YOLOv5/PyTorch model files? Aka can I use my existing scripts for running YOLOv5 models to run MDv5? or is there something custom about the pt files that I need to be aware of?

Tutorial is out of date

From the main README.md, in section classification (https://github.com/microsoft/CameraTraps#classification),
it states "here's a tutorial on training your own classifier using our detector and our training pipeline." and points to the archived TUTORIAL.md (https://github.com/microsoft/CameraTraps/blob/master/archive/classification_marcel/TUTORIAL.md).

However, the tutorial seems to be far out of sync with the readme in https://github.com/microsoft/CameraTraps/blob/master/classification/README.md.

I am still unsure how to run the training with my dataset with the current master branch.

google.protobuf.message.DecodeError: Error parsing message

hello, i got some errors when run your code when refering to the file of megadetector_colab.ipynb
##########################
!python run_tf_detector_batch.py megadetector_v4_1_0.pb "$images_dir" "$output_file_path" --recursive

Then the error occurs, the messages are showed as follow:
...
GPU:0 with 10672 MB memory) -> physical GPU (device: 0, name: Tesla K80, pci bus id: 0000:90:00.0, compute capability: 3.7)
2020-09-28 13:04:05.656622: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1115] Created TensorFlow device (/device:GPU:1 with 10672 MB memory) -> physical GPU (device: 1, name: Tesla K80, pci bus id: 0000:91:00.0, compute capability: 3.7)
2020-09-28 13:04:05.657306: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1115] Created TensorFlow device (/device:GPU:2 with 10672 MB memory) -> physical GPU (device: 2, name: Tesla K80, pci bus id: 0000:94:00.0, compute capability: 3.7)
2020-09-28 13:04:05.657562: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1115] Created TensorFlow device (/device:GPU:3 with 10672 MB memory) -> physical GPU (device: 3, name: Tesla K80, pci bus id: 0000:95:00.0, compute capability: 3.7)
tf.test.is_gpu_available: True
3597 image files found in the input directory
TFDetector: Loading graph...
Traceback (most recent call last):
File "run_tf_detector_batch.py", line 375, in
main()
File "run_tf_detector_batch.py", line 357, in main
n_cores=args.ncores)
File "run_tf_detector_batch.py", line 166, in load_and_run_detector_batch
tf_detector = TFDetector(model_file)
File "/data1/user_datas/tianxianhao/DeepLearning/CameraTrap/CameraTraps/detection/run_tf_detector.py", line 147, in init
detection_graph = TFDetector.__load_model(model_path)
File "/data1/user_datas/tianxianhao/DeepLearning/CameraTrap/CameraTraps/detection/run_tf_detector.py", line 212, in __load_model
od_graph_def.ParseFromString(serialized_graph)
google.protobuf.message.DecodeError: Error parsing message
###################
how to solve this problem? please.

Script to convert Mega output JSON to COCO?

Hi, we are trying to do some transfer learning with our Megadetector output JSON (retraining Mobilenet in TF2) and just realized the JSON output format from Megadetector is not in COCO format. Do you know if there is a handy script laying around somewhere that converts the images..detections into annotations and fixes the bounding box coordinates and any other differences i haven't learned about yet! Thank you

Method to look up request IDs from the API

The API rocks! It would be handy to have a method to list a table (or JSON) of all the request ids with date submitted, date completed, job name, number of images, and brief status (e.g. failed, completed, processing). Maybe be able to limit to a date range or job name matcher?

make_classification_dataset with custom train/test split

Thanks for making this great repo!

I love that make_classification_dataset.py allows you to crop images using the megadetector and create the tfrecords at the same time. One thing that has not been clear to me is how to specify a specific train/test split rather than having the script automatically split across locations.

Is there a way to do this?

Release v4.1

Hey! Im trying to use the current release (v4.1)

When running the data_management/databases/classification/make_classificiation_dataset.py it tries to locate a create_tfrecords_v3 which is seems by the comments to be a script to create tfrecords when using python 3 or higher.

However, this create_tfrecords_v3 does not seem to exist.

I am using the environment-detector.yml conda setup.

TFLite Converter Errors

I am attempting to port the frozen .pb file to a TFLite model for use with the Raspberry Pi and the Google Coral TPU USB Coprocessor and ran into issues running the conversion. I am new to TensorFlow and, while I'm a Software Engineer, I'm using this as a user more than a developer, so bear with me here.

The command I run and samples of the, very verbose, output are below. I know this may not be at the top of your priority list, but if it helps motivation, if this works out we could use MegaDetector on camera traps in the field to detect animals in real-time in the field, like for a predator detector I'm looking for for our farm. I'm able to crunch a detection in 45 seconds on my CPU but am eager to see what the Coral TPU can do. Thanks in advance for any help you're able to provide and I'm happy to roll up my sleeves and dig in if I can get a nudge in the right direction.

Here's the command I run:
tflite_convert --output_file=md_v4.1.0.pb.tflite --graph_def_file md_v4.1.0.pb --input_arrays=image_tensor --output_arrays=detection_boxes,detection_scores,detection_classes --input_shapes=1,640,480,3 --enable_v1_converter

Traceback (most recent call last): File "c:\anaconda3\envs\py36\lib\runpy.py", line 193, in _run_module_as_main "main", mod_spec) File "c:\anaconda3\envs\py36\lib\runpy.py", line 85, in run_code exec(code, run_globals) File "C:\anaconda3\envs\py36\Scripts\tflite_convert.exe_main.py", line 7, in File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\tflite_convert.py", line 442, in main app.run(main=run_main, argv=sys.argv[:1]) File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\python\platform\app.py", line 125, in run _sys.exit(main(argv)) File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\tflite_convert.py", line 438, in run_main _convert_model(tflite_flags) File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\tflite_convert.py", line 191, in _convert_model output_data = converter.convert() File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\lite.py", line 455, in convert **converter_kwargs) File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\convert.py", line 442, in toco_convert_impl input_data.SerializeToString()) File "c:\anaconda3\envs\py36\lib\site-packages\tensorflow\lite\python\convert.py", line 205, in toco_convert_protos "TOCO failed. See console for info.\n%s\n%s\n" % (stdout, stderr)) tensorflow.lite.python.convert.ConverterError: TOCO failed. See console for info. 2020-12-10 21:12:31.516113: I tensorflow/lite/toco/import_tensorflow.cc:1324] Converting unsupported operation: TensorArrayV3 2020-12-10 21:12:31.516570: E tensorflow/core/framework/op_kernel.cc:1325] OpKernel ('op: "WrapDatasetVariant" device_type: "CPU"') for unknown op: WrapDatasetVariant 2020-12-10 21:12:31.516902: E tensorflow/core/framework/op_kernel.cc:1325] OpKernel ('op: "WrapDatasetVariant" device_type: "GPU" host_memory_arg: "input_handle" host_memory_arg: "output_handle"') for unknown op: WrapDatasetVariant 2020-12-10 21:12:31.517337: E tensorflow/core/framework/op_kernel.cc:1325] OpKernel ('op: "UnwrapDatasetVariant" device_type: "CPU"') for unknown op: UnwrapDatasetVariant 2020-12-10 21:12:31.517673: E tensorflow/core/framework/op_kernel.cc:1325] OpKernel ('op: "UnwrapDatasetVariant" device_type: "GPU" host_memory_arg: "input_handle" host_memory_arg: "output_handle"') for unknown op: UnwrapDatasetVariant 2020-12-10 21:12:31.518258: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: TensorArrayV3 2020-12-10 21:12:31.518503: I tensorflow/lite/toco/import_tensorflow.cc:1324] Converting unsupported operation: TensorArrayScatterV3 2020-12-10 21:12:31.518752: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: TensorArrayScatterV3 2020-12-10 21:12:31.519004: I tensorflow/lite/toco/import_tensorflow.cc:1324] Converting unsupported operation: TensorArrayV3 2020-12-10 21:12:31.519201: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: TensorArrayV3 2020-12-10 21:12:31.519404: I tensorflow/lite/toco/import_tensorflow.cc:1324] Converting unsupported operation: TensorArrayV3 2020-12-10 21:12:31.519606: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: TensorArrayV3 2020-12-10 21:12:31.519863: I tensorflow/lite/toco/import_tensorflow.cc:1324] Converting unsupported operation: Enter 2020-12-10 21:12:31.520066: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: Enter

..........

2020-12-10 21:12:31.887365: I tensorflow/lite/toco/import_tensorflow.cc:1373] Unable to determine output type for op: TensorArrayGatherV3 2020-12-10 21:12:34.795091: I tensorflow/lite/toco/graph_transformations/graph_transformations.cc:39] Before Removing unused ops: 8696 operators, 17295 arrays (0 quantized) 2020-12-10 21:12:37.623228: I tensorflow/lite/toco/graph_transformations/graph_transformations.cc:39] After Removing unused ops pass 1: 8662 operators, 17241 arrays (0 quantized) 2020-12-10 21:12:41.875052: I tensorflow/lite/toco/graph_transformations/graph_transformations.cc:39] Before general graph transformations: 8662 operators, 17241 arrays (0 quantized) 2020-12-10 21:12:42.990606: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_1/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.114359: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_2/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.249639: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_3/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.371587: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_4/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.490386: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_5/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.610138: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_6/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.729914: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_7/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.845808: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_8/Conv2d_1x1/Conv2D". 2020-12-10 21:12:43.961710: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_9/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.079812: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_10/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.198136: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_11/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.316202: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_12/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.430921: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_13/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.545152: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_14/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.657000: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_15/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.767624: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_16/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.878775: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_17/Conv2d_1x1/Conv2D". 2020-12-10 21:12:44.987341: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_18/Conv2d_1x1/Conv2D". 2020-12-10 21:12:45.098353: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_19/Conv2d_1x1/Conv2D". 2020-12-10 21:12:45.207493: I tensorflow/lite/toco/graph_transformations/identify_dilated_conv.cc:220] Replaced sub-network with Dilated Conv2D op outputting "FirstStageFeatureExtractor/InceptionResnetV2/InceptionResnetV2/Repeat_1/block17_20/Conv2d_1x1/Conv2D". 2020-12-10 21:12:45.215230: F tensorflow/lite/toco/graph_transformations/resolve_constant_slice.cc:59] Check failed: dim_size >= 1 (0 vs. 1)

Program seems to hit a dead-lock when using multiple cores

Hello,
As I wanted to increase the performance of the program run_tf_detector_batch.py, I used the option --ncores you provided. I tried it with 2 or 4 cores but each time, the program is stuck. Here are the last shell outputs I get:

Creating pool with 2 cores
TFDetector: Loading graph...
TFDetector: Loading graph...
TFDetector: Detection graph loaded.
TFDetector: Detection graph loaded.
Loaded model (batch level) in 6.78 seconds
Loaded model (batch level) in 6.78 seconds
Processing image /home/...
Processing image /home/...
2020-08-17 09:43:40.366492: W tensorflow/core/framework/cpu_allocator_impl.cc:81] Allocation of 20054016 exceeds 10% of system memory.
2020-08-17 09:43:40.366493: W tensorflow/core/framework/cpu_allocator_impl.cc:81] Allocation of 20054016 exceeds 10% of system memory.

When I don't use the ncores option, I process 30 images in ~5 minutes. I've waited 10 minutes with the option, but it didn't process even 1 image.

Do you know what could cause this behaviour ?

multi-GPU inference

Is there an easy way to use multiple GPUs for inference for the MegaDetector? That could speed things up quite a bit for large datasets.

Using the estimated models outside of MegaDetector

Hi,

Could it be possible to use the estimated models (I mean, the model weights) outside of MegaDetector, for those of us who have their own codes already running?
I would like to reuse your estimated models in my codes implemented in Keras. Do you think it could be possible?

I think this could be very interesting for many people...

is this error means my gpu memory_size is small

is this error means my gpu memory_size is small
[00:00<?, ?it/s]2022-04-25 01:31:03.333175: W tensorflow/core/grappler/costs/op_level_cost_estimator.cc:690] Error in PredictCost() for the op: op: "CropAndResize" attr { key: "T" value { type: DT_FLOAT } } attr { key: "extrapolation_value" value { f: 0 } } attr { key: "method" value { s: "bilinear" } } inputs { dtype: DT_FLOAT shape { dim { size: -1204 } dim { size: -1205 } dim { size: -1206 } dim { size: 1088 } } } inputs { dtype: DT_FLOAT shape { dim { size: -25 } dim { size: 4 } } } inputs { dtype: DT_INT32 shape { dim { size: -25 } } } inputs { dtype: DT_INT32 shape { dim { size: 2 } } value { dtype: DT_INT32 tensor_shape { dim { size: 2 } } int_val: 17 } } device { type: "GPU" vendor: "NVIDIA" model: "NVIDIA GeForce GTX 1050 Ti" frequency: 1392 num_cores: 6 environment { key: "architecture" value: "6.1" } environment { key: "cuda" value: "11020" } environment { key: "cudnn" value: "8100" } num_registers: 65536 l1_cache_size: 24576 l2_cache_size: 1048576 shared_memory_size_per_multiprocessor: 98304 memory_size: 2917584078 bandwidth: 112128000 } outputs { dtype: DT_FLOAT shape { dim { size: -25 } dim { size: 17 } dim { size: 17 } dim { size: 1088 } } }

How to use this with TF Serving?

Hi, I'm a seasoned JS/web/full stack developer but fairly new to ML and almost no Python experience.

I'm trying to setup an endpoint (similar to your API endpoint), but running on Google Cloud Infrastructure, where I can submit an image (or several for batching) and get json results back.

From the documentation I understood that I can download this model (https://lilablobssc.blob.core.windows.net/models/camera_traps/megadetector/md_v4.1.0/md_v4.1.0_saved_model.zip), and exposing it up using TFServing.
However I'm hitting some dead ends on how to input the image as an argument.

I set the model up using tf version 1.15 on Google's Vertex AI Prediction service (https://cloud.google.com/vertex-ai/docs/general/import-model), and tried this for an input argument:

{"instances": [{"b64": ""}]}

But I get this error back:
{ "error": "Failed to process element: 0 of \'instances\' list. Error: Invalid argument: JSON Value: {\n \"b64\": \"\"\n} Type: Object is not of expected type: uint8" }

I'm thinking that maybe the Vertex AI input parameters need to be formatted somehow to match what the model expects, but I can't find the documentation on what arguments the model expects when used with TF Serving?

Any pointers is much appreciated! Am I even on the right path to try and setup the service using Vertex AI's services? Is there maybe a tutorial on how to use the model using TF serving?

MegaDetector crashes on VM instance

I'm trying to run the MegaDetector over some images on a GCP VM instance with a Tesla T4 GPU. I created a conda virtual environment using the environment-detector.yml file with the tensorflow-gpu version set to 1.13.1, and I am trying to run the run_tf_detector_batch.py script as instructed in the MegaDetector page. However, the process crashes while processing the first image (below is a screenshot of my terminal window when that happens):

Screen Shot 2021-03-25 at 1 34 27 AM

I've tried to trace the program but haven't been able to figure out what the issue is. Any idea why this might be happening and how to solve it? Thanks!

Confused about language for how to set-up a local batch process with the pre-trained Megadetector_v3

Hi!

I am confused about how to initiate the batch processing functionality of the megadetector. I've now successfully called the single image detection script, but am not completely understanding what phrasing to use when designating a multiple images to be sifted through the run_tf_detector_batch.py

I'm new to all of this, I've learned what a json file is and see that designating one is necessary for the batch process. Would someone be able to point me in the right direction for how to phrase the command to designate multiple images to the batch process script?

Thanks for your time!

edit: Correction, I now understand the purpose of the json file and it's role with using the visualization tool. My main confusion is how do you communicate to the script that an entire directory of images is to be classified as opposed to just one.

ModuleNotFoundError: No module named 'visualization'

Hi there,
I've probably done something stupid, but can't figure out what. I've run this on two different computers (but within the same university, so maybe there are some rights issues) and get to the same error.

Running Windows 10, Anaconda and Git installed fine. I ran the following lines of code without error:

mkdir c:\git
cd c:\git
git clone https://github.com/Microsoft/cameratraps
git clone https://github.com/Microsoft/ai4eutils
cd c:\git\cameratraps
conda env create --file environment-detector.yml
conda activate cameratraps-detector
set PYTHONPATH=%PYTHONPATH%;c:\git\cameratraps;c:\git\ai4eutils;c:\git\yolov5

The git folder is in C:\git and the megadetector file is in C:\megadetector. (i.e. using the suggested defaults).

But when I run the following code, I get the error below.

(cameratraps-detector) PS C:\git\CameraTraps> python detection\run_detector.py "c:\megadetector\md_v5a.0.0.pt" --image_file ".\test_images\test_images\caltech_camera_traps_58a022a4-23d2-11e8-a6a3-ec086b02610b.jpg" --threshold 0.2

Traceback (most recent call last):
File "detection\run_detector.py", line 53, in
import visualization.visualization_utils as viz_utils
ModuleNotFoundError: No module named 'visualization'

I did try do some trouble shooting, and two websites suggested that I run the following code:
pip install tf-keras-vis
pip install keras-vis

I did this. But no change.

What I am I missing? Thank you in advance for your time, and please let me know if there is any additional information that you may need.

Successful run of visualize_detector_output.py outputs empty file "anno_"

After successfully running the run_tf_detector_batch.py on a test-subset of 10 images I'm getting stuck after running visualize_detector_output.py.

I'm able to successfully run visualize_detector_output.py as referenced by this log:

C:\git\cameratraps>python visualization\visualize_detector_output.py G:\Datasets\Test\test.json G:\Datasets\Annotations.jpg -i G:\Datasets\Test\test.json

Options to the script: Namespace(confidence=0.8, detector_output_path='G:\Datasets\Test\test.json', images_dir='G:\Datasets\Test\test.json', out_dir='G:\Datasets\Annotations\.jpg', output_image_width=700, random_seed=None, sample=-1, sas_url=None)
detection_categories provided
Detector output file contains 10 entries.
Starting to annotate the images...
100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████| 10/10 [00:00<00:00, 17.83it/s]
Rendered detection results on 10 images, saved to G:\Datasets\Annotations.jpg.

But the output of the script is an empty file "anno_G" with no discernible extension.

Where have I gone wrong!

ModuleNotFoundError: No module named 'ct_utils'

Hello
I'm new to using Tensorflow and Python, and have possibly made a silly mistake, but I keep getting this error when I try and run the run_tf_detector.py script:

python CameraTraps/detection/run_tf_detector.py md_v4.1.0.pb --image_file IMG0027.JPG
Traceback (most recent call last):
File "CameraTraps/detection/run_tf_detector.py", line 54, in
from ct_utils import truncate_float
ModuleNotFoundError: No module named 'ct_utils'

Setup is:
Ubuntu 18.04
python --version
Python 3.6.8

echo $PYTHONPATH
~/ml/microsoft_mega/CameraTraps;~/ml/microsoft_mega/ai4eutils
which was created via
export PYTHONPATH="~/ml/microsoft_mega/CameraTraps;~/ml/microsoft_mega/ai4eutils"
so it is possible I've made a mistake here as the instructions are for Windows.

I'm not using Anaconda, but a Python virtual environment. Tensorflow and other pip libraries all installed without problems.
Many thanks, and apologies if I've overlooked something simple.
Roy

File "path/run_tf_detector.py", line 54, in <module> from ct_utils import truncate_float ModuleNotFoundError: No module named 'ct_utils'

Hi, I'm a wildlife researcher who is very new to scripting/command line interfaces, so I'm sorry if this seems simple!

I'm kind of confused, I was trying to run a quick demo of the megadetector_v3 on some camera trap imagery but can't seem to get the run_tf_detector.py script working due to 'ct_utils' in the root folder not being successfully referenced. I've made sure that my working directory is the root file of the git clone (CameraTraps) and I've tried running it in both 'cameratraps' and 'cameratraps-detector' environments. I've tried moving ct_utils into the detector sub-folder and vice-versa with no success.

Any advice on how to get the ct_utils script referenced when running the run_tf_detector.py script?

I'm following the directions found on: https://github.com/microsoft/CameraTraps/blob/master/megadetector.md

Thanks!

eliminating repeated detection

Hi Dan,
thank you for your quick reply! (Issue #290 )

after running the pipeline:

  • find_repeat_detections.py
  • manually removing false posittives
  • remove_repeat_detections.py
    I end up with a json file.
    when running this json file in the script postprocess_batch_results.py, I get the following error:
    File "load_api_results.py", line 93, in load_api_results
    assert s in detection_results, 'Missing field {} in detection results'.format(s)
    AssertionError: Missing field info in detection results

Looking at the json file, it doesn't have the same fields as in the output json file of the run_detector_batch.py.

Is there a way use the postprocess_batch_results.py on the json file of the repeat detections?

Thanks

[building a potential Edge AI solution] As an ecologist, what are you looking for to extract from camera traps?

Hey all,

Not sure if it falls under an issue (I'm happy to post or x-post it somewhere more suitable). This repo may be the best place to ask some questions about what's needed to be integrated in camera traps, since individuals with technical backgrounds, with some understanding on research needs, are likely to stumble upon the repo.

I'm an AI for Earth grantee (agritech and their integration within smart cities). I learned about camera traps during the AI for Earth Summit, and coincidentally, as a part of an online competition (Microsoft's Discover AI Upskilling Journey) where organizers have asked participants to look into helping out with camera traps.

However, what is not particularly clear is, what exactly are ecologists looking to extract out of camera trap images?
Animals (detection and classifying which animal)?
Other objects related to poaching (people, cars, trucks)?
Other features (weather conditions, time stamps of when detection happens)?

Understanding what needs to be extracted can help us (researchers and hobbyists) understand what needs to be developed, as well as tailor the hardware and integration to better fit what is needed.

Overview of my project:

The goal of this project is to highlight how effective Edge AI can be in eliminating/filtering out a significant number of inter-steps between data collection and obtaining classification and detection results, potentially along with other data/features that need to be collected or extracted.

Presentation1
Picture legend:
green arrows -> working
red arrows -> not working as intended
blue arrows -> have not tried it yet int this project, but tested out and working in other projects

More information:
The project aims to tie-in features from machine learning and internet of things with hardware devices, similar in some aspects to what is currently used on the ground, in order to reduce the overall time spent ecologists spend on classifications, collecting data, etc... As someone who has never interacted with ecologists, it's difficult to understand what is currently needed for them to better allocate their time and focus on research aspects, rather than countless hours on manual data labelling.

Model: I trained a custom object detection and classification (Yolov4) model for 17 classes, which are all the classes of animals that I was able to manually make out in the MegaDetector demo video. The classes are Armadillo, Bear, Bird, Bull, [Car], Cat, Cattle, Deer, Dog, Fox, Monkey, [Person], Pig, Raccoon, Sheep, Tiger, [Truck]. I added a few classes in [brackets], just in case someone would find the model useful as is.

The images used for training are obtained from Open Images Dataset V6.

Then, I used the model to run object detection and classification on some test images (the sheep image in the overview is one of the test images used) and the MegaDetector demo video.

Testing images (after ~5hrs training):
Yolov4 TFLite (TensorFlow Lite model, converted from yolov4)
yolov4 TFLite

Yolov4
yolov4

Video:
Running the yolov4 model on the MegaDetector demo video (less than 2 minutes for the model to detect and classify the video).

It seems to be pretty ok for some parts, needs more training for other parts.
Some examples for detection/classification:
Screen Shot 2020-06-25 at 10 56 42 AM
Screen Shot 2020-06-25 at 10 56 16 AM
Screen Shot 2020-06-25 at 10 55 52 AM
Screen Shot 2020-06-25 at 10 58 01 AM

Some examples for bad detection/classification (model needs more training?):
Screen Shot 2020-06-25 at 10 59 07 AM
Screen Shot 2020-06-25 at 10 57 10 AM

Extracting more information from images

I was reading that ecologists also look at camera trap images to extract more information about the ecosystem, such as weather conditions.

My hand-waving approach was to use Custom Vision to train weather data. I stumbled upon a Multi-class Weather Dataset for Image Classification, which contains 4 classes of weather conditions. The dataset was used for training.

Screen Shot 2020-06-25 at 11 18 21 AM

Screen Shot 2020-06-25 at 11 19 00 AM

However, testing the model on some images, the model does not seem to be the best for camera-traps. Even though some classifications were on point, the accuracy is really low.

Screen Shot 2020-06-25 at 11 19 22 AM

Screen Shot 2020-06-25 at 11 25 14 AM

Another hand-waving approach to extract weather data from camera-trap images (haven't implemented this yet):

It seems that some images display temperature data in each image. Another approach is to extract temperature information as well as time/date stamps from images (OCR). Then use historical weather datasets (such as openweather API) to fetch other weather features such as humidity, wind direction/speed, weather conditions (rainy, cloudy, partially sunny, sunny), etc..

Raspberry pi as an Edge AI Device

Screen Shot 2020-06-22 at 5 38 48 PM

As implied earlier, the main reason to convert the Yolov4 model to a TensorFlow Lite model is for it to be run on the raspberry pi. This means that as an image or video is taken, object detection and classification happens on the device itself, where the results can be either uploaded directly to the internet or saved on an SD card.

As the weather information is relatively more challenging to extract directly from images and videos, integrating relatively cheap sensors may provide a more custom overview of the ecosystem. For example, it is possible to add sensors to collect data about emissions such as COx, NOx, NH3, VOCs as well as air quality. It would also allow for more customizability when it comes to choosing a camera that may be a better fit for custom applications, as well as detection sensors (in this example, pyroelectric/ infrared sensor in addition to an ultrasonic sensor).

No module named 'detection'

when i try to run the code on local pc i got this error
Traceback (most recent call last): File "CameraTraps\detection/run_detector_batch.py", line 79, in <module> from detection.run_detector import ImagePathUtils, is_gpu_available, load_detector ModuleNotFoundError: No module named 'detection'

Detection Issues

Dear authors of MegaDetector,

I would be very grateful, if you could help me and my colleagues with a very persistent issue:
We applied MegaDetector (run_tf_detector_batch.py) for a large amount of camera trap imagery. Yet we noticed that for many cases, the locations of the individual bounding boxes, stored in the resulting JSON-file, do not represent the animals’ actual position within the image. Thus, we ran the test version (run_tf_detector.py) for a collection of images. Yet the bounding boxes in the resulting “..._detections”-images just matched perfectly with the animal’s location. Consequently, we performed the batch-version on folders only containing one of these test images and the resulting JSON-files contained the exact same values as after the first run while still not matching the animals position.
We performed these steps on several different PCs (computers and laptops) as well as with different operating systems (Ubuntu and Windows). However, the issue persists.

We would be extremely grateful for your advice and help.

Best regards,
Tobias Pelchen (e-mail: [email protected])

detection issue.pdf

cannot import name 'TFDetector' from 'detection.run_detector

using colab

Traceback (most recent call last):
  File "run_tf_detector_batch.py", line 66, in <module>
    from detection.run_detector import ImagePathUtils, TFDetector
ImportError: cannot import name 'TFDetector' from 'detection.run_detector' (/content/CameraTraps/detection/run_detector.py)

Apple M1 Support

Hi All - very excited about this release!

I know this is probably deep down in the dependencies but wanted to raise given that M1 chips are becoming more common and that Mac instructions are given in the README.

The following error occurred running run_detector.py

Intel MKL FATAL ERROR: This system does not meet the minimum requirements for use of the Intel(R) Math Kernel Library.
The processor must support the Intel(R) Supplemental Streaming SIMD Extensions 3 (Intel(R) SSSE3) instructions.
The processor must support the Intel(R) Streaming SIMD Extensions 4.2 (Intel(R) SSE4.2) instructions.
The processor must support the Intel(R) Advanced Vector Extensions (Intel(R) AVX) instructions.

Failed to load the native TensorFlow runtime.

I'm a little familiar with Python and Anaconda, and have followed the steps for installation on a windows machine. However, I get the error "Failed to load the native TensorFlow runtime" when I try to run_tf_detector.py - can anyone point me to where I am going wrong?

Training megadetector on my own images

Hello,
Thank you so much for this great program!

I have thousands of images from trap cameras, and while megadetector works wonderfully on some landscapes, it fails in other (it detects trees and stones as animals).

What is a useful way to train megadetector on my own set of images?

Cheers,
Bar

CameraTraps/api/batch_processing/api_core/ readme not specific enough

I am attempting to set up the API in my own subscription and I have never used Azure Machine Learning workspace before. In the CameraTraps/api/batch_processing/api_core/README.md it states: to set up a new "AML workspace" but there are two resource types on azure that have similar names to this and it is not clear which I should use. The two names are:

  • Azure Machine Learning service workspace
  • Azure Machine Learning Studio workspace

After reading the descriptions, I assume it is the Azure Machine Learning service workspace but I would like clarification.

MegaDetector: broken data stream when reading image file / cannot join current thread

Before I start digging into it further, has anyone come across the following problem? I've run the detector twice and got the same result... thinking along the lines of corrupt file or faulty disk?

Potentially a duplicate of #94 but doesn't contain any logs so unsure.

$ PYTHONPATH=$PYTHONPATH:$(pwd) python3 detection/run_tf_detector_batch.py --recursive --forceCpu --checkpointFrequency 1000
 --outputRelativeFilenames ./detection/megadetector_v3.pb ../borneo-dataset/release/0.5/SAFE/SAFE_2/ ../Borneo-0.5-SAFE2.txt
                                                                                                             
tensorflow tf version: 1.14.0                                     
2019-10-03 12:28:29.084524: I tensorflow/core/platform/cpu_feature_guard.cc:142] Your CPU supports instructions that this TensorFlow binary was not compiled to use: FMA
2019-10-03 12:28:29.381379: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2400065000 Hz
2019-10-03 12:28:29.386639: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3d43f60 executing computations on platform Host. Devices:
2019-10-03 12:28:29.386727: I tensorflow/compiler/xla/service/service.cc:175]   StreamExecutor device (0): <undefined>, <undefined>
tf_detector.py, tf.test.is_gpu_available: False                                               
WARNING: Logging before flag parsing goes to stderr.                            
W1003 12:28:29.404660 140666144171840 deprecation_wrapper.py:119] From detection/run_tf_detector_batch.py:51: The name tf.logging.set_verbosity is deprecated. Please us
e tf.compat.v1.logging.set_verbosity instead.                                                                 
                
W1003 12:28:29.405052 140666144171840 deprecation_wrapper.py:119] From detection/run_tf_detector_batch.py:51: The name tf.logging.ERROR is deprecated. Please use tf.com
pat.v1.logging.ERROR instead. 
                                                                                                                      
Running detector on 57170 images                       
Loading model...                                                                                             
tf_detector.py: Loading graph...                                                
tf_detector.py: Detection graph loaded.                     
Loaded model in 15.1 seconds                                                                                  
Running detector...                     
0it [00:00, ?it/s]2019-10-03 12:28:55.971887: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because env
var TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set.  If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU.  To confirm that XL
A is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile.
var TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set.  If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU.  To confirm [0/498]
A is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile.
2019-10-03 12:29:01.395446: W tensorflow/core/framework/allocator.cc:107] Allocation of 377318400 exceeds 10% of system memory.
2019-10-03 12:29:01.487368: W tensorflow/core/framework/allocator.cc:107] Allocation of 377318400 exceeds 10% of system memory.
2019-10-03 12:29:01.982219: W tensorflow/core/framework/allocator.cc:107] Allocation of 99878400 exceeds 10% of system memory.
2019-10-03 12:29:02.555869: W tensorflow/core/framework/allocator.cc:107] Allocation of 159744000 exceeds 10% of system memory.
2019-10-03 12:29:02.715443: W tensorflow/core/framework/allocator.cc:107] Allocation of 159744000 exceeds 10% of system memory.
Checkpointing 1 images to /tmp/detector_batch/tmpud3njeui......done
1000it [2:11:24,  7.87s/it]Checkpointing 1001 images to /tmp/detector_batch/tmp6wgeh416......done                                                                       
2000it [4:23:28,  7.90s/it]Checkpointing 2001 images to /tmp/detector_batch/tmp_6tnl91e......done                                                                       
3000it [6:36:44,  8.07s/it]Checkpointing 3001 images to /tmp/detector_batch/tmpnwm51sek......done                                                       
3344it [7:23:24,  8.08s/it]Traceback (most recent call last):                                                                  
  File "detection/run_tf_detector_batch.py", line 559, in <module>                                                             
    main()                                                                                                                    
  File "detection/run_tf_detector_batch.py", line 554, in main                                                                 
    load_and_run_detector(options)                                                                                             
  File "detection/run_tf_detector_batch.py", line 437, in load_and_run_detector
    boxes,scores,classes,imageFileNames = generate_detections(detector,imageFileNames,options)   
  File "detection/run_tf_detector_batch.py", line 167, in generate_detections                    
    imageNP = PIL.Image.open(image).convert("RGB"); imageNP = np.array(imageNP)                  
  File "/home/bencevans/.local/lib/python3.6/site-packages/PIL/Image.py", line 912, in convert
    self.load()                                                   
  File "/home/bencevans/.local/lib/python3.6/site-packages/PIL/ImageFile.py", line 261, in load
    raise_ioerror(err_code)                                   
  File "/home/bencevans/.local/lib/python3.6/site-packages/PIL/ImageFile.py", line 58, in raise_ioerror
    raise IOError(message + " when reading image file")                        
OSError: broken data stream when reading image file                                           
Exception ignored in: <bound method tqdm.__del__ of 3344it [7:23:25,  8.08s/it]>
Traceback (most recent call last):                                             
  File "/home/bencevans/.local/lib/python3.6/site-packages/tqdm/_tqdm.py", line 931, in __del__
    self.close()
  File "/home/bencevans/.local/lib/python3.6/site-packages/tqdm/_tqdm.py", line 1133, in close 
    self._decr_instances(self)
  File "/home/bencevans/.local/lib/python3.6/site-packages/tqdm/_tqdm.py", line 496, in _decr_instances
    cls.monitor.exit()                                 
  File "/home/bencevans/.local/lib/python3.6/site-packages/tqdm/_monitor.py", line 52, in exit
    self.join()                                                                 
  File "/usr/lib/python3.6/threading.py", line 1053, in join
    raise RuntimeError("cannot join current thread")                                                          
RuntimeError: cannot join current thread

pretrained weights for megadetector v5?

Thanks for continuing to maintain this awesome set of tools! My question is about where to find the latest release. I see megadetector v5 referenced here: https://github.com/microsoft/CameraTraps/blob/main/detection/README.md

But the latest release is for 4.1 https://github.com/microsoft/CameraTraps/releases/tag/v4.1

Are the weights available somewhere else other than the release page? We might be using Megadetector soon in addition to https://github.com/microsoft/SpeciesClassification to detect people, mountain lions, foxes, coyotes, and deer and I was looking to use the most up to date models published here.

Error installing tensorflow 1.13.1

I couldn't install tensorflow using the readme instructions:

$ pip install tensorflow==1.13.1
ERROR: Could not find a version that satisfies the requirement tensorflow==1.13.1 (from versions: 2.2.0rc1, 2.2.0rc2, 2.2.0rc3, 2.2.0rc4, 2.2.0, 2.2.1, 2.2.2, 2.3.0rc0, 2.3.0rc1, 2.3.0rc2, 2.3.0, 2.3.1, 2.3.2, 2.4.0rc0, 2.4.0rc1, 2.4.0rc2, 2.4.0rc3, 2.4.0rc4, 2.4.0)
ERROR: No matching distribution found for tensorflow==1.13.1

But after some googling this method worked for me:

python -m pip install --upgrade https://storage.googleapis.com/tensorflow/mac/cpu/tensorflow-1.13.1-py3-none-any.whl

Model doesn't learn to classify

Hey guys!

So, I've been trying to work with this amazing repo you guys put out there but, even though I can run the scripts and execute training, the model simply doesn't seem to be learning how to classify. My goal is to train a classifier on my own species.

I will try to describe the path I made to get where I am now.
I'm on Ubuntu 18.04, tensorflow-gpu=1.15 and have installed all necessary dependencies to run your repository.
I mainly followed this to try and achieve what I needed: https://github.com/microsoft/CameraTraps/blob/master/classification/TUTORIAL.md

First, I started with a VOC format dataset (consists of images + xml with the detections). Because here we need to use the COCO json format, I used the following script to make such changes:
https://github.com/Tony607/voc2coco/blob/master/voc2coco.py
And also, here's an example of an original XML file.
image
Now I bring up the question: the "path" field is not correct, as this file is not in the original place anymore. However, it doesn't seem to affect anything, because I can still generate the COCO json. Here's an example extracted from said json:
image
Which seems ok, since it got the file name right (this is the name it'll search when I point to the images folder, right?) and as I understand COCO json doesn't use xmax and ymax but rather width and height, right as well? The category id also matches the one expected.
Alright, the next step was generating the tf records.
I started trying by using your script, but it seemed to be trying to import a file that did not exist, so instead I went in and used this one (from TF OD API):
https://github.com/tensorflow/models/blob/master/research/object_detection/dataset_tools/create_coco_tf_record.py
As for the test and validation folders, I pointed it to the same and simply ignored the validation files (since here only train and test matters). Everything went well and I got my folder full of tf records (I just had to rename them because the training script here searches for "train-" and "test-" so I add these to the beginning of my file name.
So, with everything in hand, I went to the training part.
I had already the tf records and model checkpoint (inception v4), just needed to prepare some files for training. First copy and modify the dataset python script. I changed the "SPLIT_TO_SIZE" variable to: SPLITS_TO_SIZES = {'train': 7624, 'test': 1885} and "_NUM_CLASSES = 17". Furthermore, I added my dataset to "dataset_factory.py". Finally, I modify the .sh script to adapt and create the one for my own dataset.
DATASET_DIR points to the folder containing the tf records. TRAIN_DIR to the output path (log/date...). CHECKPOINT_PATH to where I had my inception model checkpoint. And MODEL_NAME was still "inception_v4". For better visualization:
image

Finally, I executed the script by typing: bash ../training_scripts/train_deep_inception_v4.sh
It trained reeeeally slowly, with the loss decreasing .0001 per 300~400 steps. It seemed ok though cause it was actually going down. I left the model training overnight, unfortunately it trained only for a few hours.
However, when I woke up and came to check the model, it had stopped immediately after an evaluation step which outputed accuracy and recall equal to 1, which I knew wasn't right. I then exported the graph to see an actual inference and this is what I got:
image
So, the detection is working well, since I use the megadetector model you guys provide, however, classification absolutely sucks. What it is doing is simply labeling all animals as the first label it sees on my "label.txt" file. If I change which class comes first, classification changes as well. So, it doesn't like the model is learning to classify anything, and I simply can't find a reason why.
I'm working on this problem for quite a while now, have tried to use TF OD API directly but with no success as well.

Any tips? Directions? Questions?
Please any help is welcome!

Is the batch processing API and the real-time API running the same classifiers ?

Looking into classifying some thermal images to filter out empty images and roughly label when there is are different objects in the pictures. Will process around 3000 images per day and need to label 3-4 different categories - empty , animal, human

Wondering if the live web API or the batch processing would be a better option?
The readme says that the web API runs species classifiers so wondering if they are the same for the batch processing

Also do you use some public discord or slack?

Colab notebook: annotated images not available in [Visualization_Folder]

Thanks for sharing the colab notebook. As a newbie in this, it's very helpful to be able to test your script before investing time into deploying it at full scale on all the camera traps photos in our project (nest success and invasive predator in an endangered Caribbean seabird).

I would like to share with collaborators the annotated images resulting from the script but the Google Drive folder used as [Visualization_Folder] does not contain any file. Also, Colab's runtime keeps disconnecting and cannot make through the annotation/display of 25 samples photos, hence the need to be able to save those images that have gone through the process.
Thus, my question may be very basic: how to save (or access) the images that are annotated and displayed in the last step (cell below)?

Here are the steps of interest:

images_dir = '/content/drive/My Drive/ctrap'

# choose a location for the output JSON file
output_file_path = '/content/drive/My Drive/ctrap/detector_2020-09-28.json'

Here we use the visualize_detector_output.py in the visualization folder of the Camera Traps repo to see the output of the MegaDetector visualized on our images. It will save images annotated with the results (original images will not be modified) to the [Visualization_Folder] you specify here.

visualization_dir = '/content/My Drive/ctrap/visualize_2020-09-28'  # pick a location for annotated images
!python visualize_detector_output.py "$output_file_path" "$visualization_dir" --confidence 0.8 --images_dir "$images_dir"
import os
from PIL import Image
for viz_file_name in os.listdir(visualization_dir):
  print(viz_file_name)
  im = Image.open(os.path.join(visualization_dir, viz_file_name))
  display(im)  # display() is an iPython method that comes with the notebook

I understand that the images are supposed to be saved in the [Visualization_Folder] but this folder is empty (see edited screeenshots below - note that the visualize_2020-09-28 folder is open).
Untitled

Error when image file is corrupted

Some images, especially from Bushnell cameras, are corrupted (there is missing data). An example is: Snapshot Serengeti\S1\I03\I03_R1\S1_I03_R1_PICT3967.JPG
The run_tf_detector_batch.py script crashes on those images. The only way to process the images is to open them in an image editor and save them again, replacing the bad data with some background value.

Adding the following code to run_tf_detector_batch.py will replace the bad data automatically and allows the batch detector to run over all images without an error;

from PIL import ImageFile
ImageFile.LOAD_TRUNCATED_IMAGES = True

Continuing the training of MegaDetector

Hi all! That's a lot for making this fantastic work available, the collective usefulness of the tools provided is truly staggering.

I was trying out MegaDetector and it's working really well. However, it consistently misses some smaller animals that are partially obscured in the grass (so, false negatives), and sometimes has a tendency to pick up inanimate objects like tree stumps as animals (false positives). Confidence thresholding doesn't help to weed out those false positives as it comes at a high price - basically, I'd have to set a threshold close to unity and miss on a lot of robust animal detection that way.

I see that there are checkpoints available for the 4.1 detector version, and was wondering if there's a code snippet one can use for continuing the training?

Alternatively (a bit of a general question, sorry), are they any good strategies to mitigate the issues above?

Limit number of cores used

I'm using the batch detector on a HPC and need to limit the number of cores used to the number requested. If I used the --cores the batch detector creates a pool of workers which uses a lot more memory and does not support logging or checkpointing. Is there a way to limit the number of cores used without creating a workerpool? I think it's related to set_inter_op_parallelism_threads, but I'm not sure how to set tensorflow config from run_tf_detector_batch.py.

Prediction results from locally loaded SavedModel significantly differ from TF Serving model

Hi,

I'm working with MegaDetector v3, 2019.05.30 and noticed that the prediction results for a given image significantly differ between the locally loaded SavedModel and the TFServing model -> where:

locally loaded SavedModel = saved_model_normalized_megadetector_v3_tf19.tar.gz
TFServing model = saved_model_megadetector_v3_tf19.zip

The locally loaded quite underperforms vs. the TFServing model. Is this something you have seen before? Happy to share my code (am on TF2.1)

Thanks & best regards,

Mike

Crop MegaDetector json

Hi,
I've just started using megadetector and I was wondering if you could recommend any good code to crop my images using the boxes identified?
Thanks!

tensorflow-gpu version

Hello, just a quick question about the supported tensorflow-gpu version. I am having installation issues with version 1.13.1 and was wondering if there is a more recent version that works, or if it must necessarily be 1.13.1. Thanks!

tflite version

Hi, that you for all the work done on this model and for sharing it with freely. I am wanting to use it in a real-time situation i.e. to spot monkeys in my vegetable patch. I was hoping to get it working on a rasp pi 4 with Coral Edge TPU. The Coral can only use tensorflow lite and so I paid someone to try convert the MegaDetector into a version that could run on the Coral TPU but that was unfortunately unsuccessful. My question is where you think there is any change or releasing a version that might run on a pi / Coral TPU / tflite?

Batch inference with classifier?

What is the appropriate way to do batch inference using a trained classifier? The classifier documentation lists three methods:

  1. predict_image.py
  2. detect_and_predict_image.py
  3. generate_sample_predictions.py

The first appears not to support batch inference. #2 and #3 do batch inference, but running each of them resulted in the entire data set attempting to be loaded into memory. What is the correct way to perform inference on a large number of images using this repo?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.