GithubHelp home page GithubHelp logo

Comments (8)

anson0910 avatar anson0910 commented on June 8, 2024

I think the detect_face_12c_net function is meant to feedforward through the fully convolutional version of the net, so if you did the training of 12-net by yourself, you can use the face_net_surgery/face_12_surgery.py script to convert face_12c to a fully convolutional network!

Otherwise, if you wish to adopt the sliding window technique, you need to crop appropriate sizes of windows from the input image, and feedfoward them once at a time.

from cnn_face_detection.

hiyijian avatar hiyijian commented on June 8, 2024

Thanks a lot.
I should transfer model trained from train_val.prototxt using face_12_surgery.py. I tried this just now, detection works without any error. However, the result seems wrong. Is it possible that my model didnt converge? I think my trained model of face_12c has converged accoding to LOSS(=0.00003) and Accury(=1) of caffe report. Is it overfitted? I used about 20000 positive sample frome AFLW and 60000 negative samples crop from some backgroud images.
BTW: when I use your face12c_full_conv.caffemodel, it works like a charm

from cnn_face_detection.

hiyijian avatar hiyijian commented on June 8, 2024

I observed that the detection rectangles using face12c_full_conv.caffemodel and face_12c_train_iter_400000.caffemodel are exactly the same.
In other words, face_12_surgery.py didnt make any diffrience.

from cnn_face_detection.

anson0910 avatar anson0910 commented on June 8, 2024

Sorry, I did not encounter such a problem... not sure how to solve it

from cnn_face_detection.

hiyijian avatar hiyijian commented on June 8, 2024

Would like to tell me how to determin the threshold of 12-net. yours is 0.01, which I think is very very low

from cnn_face_detection.

anson0910 avatar anson0910 commented on June 8, 2024

Quoting the original paper :
"We then apply a 2-stage cascade consists of the 12-net and 12-calibration-net on a subset of the AFLW images to choose a threshold T 1 at 99% recall rate. Then we densely scan all background images with the 2- stage cascade. All detection windows with confidence score larger than T 1 become the negative training samples for the 24-net."

Basically, if you wish to have higher recall but do not care about precision, then the lower the better!

from cnn_face_detection.

hiyijian avatar hiyijian commented on June 8, 2024

Thanks @anson0910 .I fixed the problem by replace caffe from the buggy-vesion to official-version

from cnn_face_detection.

anson0910 avatar anson0910 commented on June 8, 2024

Great!

from cnn_face_detection.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.