GithubHelp home page GithubHelp logo

chou141253 / fgvc-herbs Goto Github PK

View Code? Open in Web Editor NEW
78.0 78.0 10.0 2.2 MB

Pytorch implementation of "Fine-grained Visual Classification with High-temperature Refinement and Background Suppression"

Python 100.00%

fgvc-herbs's People

Contributors

chou141253 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

fgvc-herbs's Issues

resnet

I noticed that your article mentions about resnet, can you provide the content about resnet?

no eval.py file

There is no eval.py file, and there is no 'suppression' function in you eval.py file from PIM project

The method of inference

How do you obtain the final inference result?I find your best_ Top1 is the highest accuracy value from all classifiers.

Missing backbone and return_nodes at run_evaluation.py

Hi,
I tried to reproduce your results and encountered an error when running run_evaluation.py.
The backbone and return_nodes parameters are missing.

If someone encountered the same problem too, change the "build_model" function to be:

def build_model(pretrainewd_path: str,
                img_size: int,
                fpn_size: int,
                num_classes: int,
                num_selects: dict,
                use_fpn: bool = True,
                use_selection: bool = True,
                use_combiner: bool = True,
                comb_proj_size: int = None):
    from models.pim_module.pim_module import PluginMoodel

    # swin_base_patch4_window12_384_in22k
    backbone = create_model('swin_large_patch4_window12_384_in22k', pretrained=True)

    model = \
        PluginMoodel(
            backbone=backbone,
            return_nodes=None,
            img_size=img_size,
            use_fpn=use_fpn,
            fpn_size=fpn_size,
            proj_type="Linear",
            upsample_type="Conv",
            use_selection=use_selection,
            num_classes=num_classes,
            num_selects=num_selects,
            use_combiner=use_combiner,
            comb_proj_size=comb_proj_size)

    if pretrainewd_path != "":
        print("loaded model")
        ckpt = torch.load(pretrainewd_path)
        model.load_state_dict(ckpt['model'])

    model.eval()

    return model

Run time

Hi, why does it take me hours to run one epoch.

CUB-200-2011 dataset

Hi, thank you very much for the code. When I want to run python main.py with CUB-200-2011 dataset --c . /configs/config.yaml, I realized that the CUB-200-2011 dataset does not have a
tain/
│ ├── class1/
│ │ ├── img001.jpg
│ │ ├── img002.jpg
│ │ └── ....
│ ├── class2/ │ ├── class2/
│ │ ├── img001.jpg
│ │ ├── img002.jpg
I would like to ask if these training images are processed by myself, or where can I download them directly?

How to predict

Hello, author. Which one should be selected as the output?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.