Hello, what image sizes are supported by ram and tag2text models?</p

Image sizes about recognize-anything HOT 4 OPEN

shersoni610 commented on May 28, 2024

Image sizes

from recognize-anything.

Comments (4)

xinyu1205 commented on May 28, 2024

Hi, the default input size is 384, and you can also specify the input size at will when model initialization.

from recognize-anything.

shersoni610 commented on May 28, 2024

Hello Xinyu,

After looking at the code it seems only two sizes are suported 224 and 384 and there is assert statement to check the image size. How should we handle 2k or 4K images?

from recognize-anything.

shersoni610 commented on May 28, 2024

This is error message with RAM:
(ramrecog) $ python inference_ram.py --image ~/Downloads/dot-and-mom-aziza-2-649f050319cb3.jpg --image-size 1024
Traceback (most recent call last):
File "/Users/Projects/SegmentAnything/RAM/recognize-anything/inference_ram.py", line 42, in
model = ram(pretrained=args.pretrained, image_size=args.image_size, vit='swin_l')
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/Projects/SegmentAnything/RAM/recognize-anything/ram/models/ram.py", line 263, in ram
model = RAM(**kwargs)
^^^^^^^^^^^^^
File "/Users/Projects/SegmentAnything/RAM/recognize-anything/ram/models/ram.py", line 77, in init
vision_config = read_json(vision_config_path)
^^^^^^^^^^^^^^^^^^
UnboundLocalError: cannot access local variable 'vision_config_path' where it is not associated with a value

from recognize-anything.

xinyu1205 commented on May 28, 2024

Hi, the reason should be that we only include the Swin config for 224&384 in ram/configs/swin. Perhaps you should check how Swin adapts to larger image resolutions.

from recognize-anything.

Recommend Projects

Image sizes about recognize-anything HOT 4 OPEN

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs