Subgroups can substantially enhance performance and adaptability for machine learning

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Use subgroup operations when possible about web-llm HOT 5 OPEN

beaufortfrancois commented on September 27, 2024 1

Use subgroup operations when possible

from web-llm.

Comments (5)

beaufortfrancois commented on September 27, 2024

@CharlieFRuan @tqchen What are your thoughts on this?

from web-llm.

tqchen commented on September 27, 2024

This is great, subgroup shuffle can be useful for reduction operations. We did have warp shuffle support for metal backend, so maybe we can try add codegen backend for webgpu

from web-llm.

beaufortfrancois commented on September 27, 2024

The following subgroup shuffle functions are actually in Chrome 129 (currently beta):

subgroupShuffle(value, id): Returns value from the active invocation whose subgroup_invocation_id matches id.
subgroupShuffleXor(value, mask): Returns value from the active invocation whose subgroup_invocation_id matches subgroup_invocation_id ^ mask. mask must be dynamically uniform.
subgroupShuffleUp(value, delta): Returns value from the active invocation whose subgroup_invocation_id matches subgroup_invocation_id - delta.
subgroupShuffleDown(value, delta): Returns value from the active invocation whose subgroup_invocation_id matches subgroup_invocation_id + delta.

from web-llm.

beaufortfrancois commented on September 27, 2024

@tqchen @CharlieFRuan Is this being implemented in Apache TVM?

from web-llm.

CharlieFRuan commented on September 27, 2024

Hi @beaufortfrancois Really appreciate the info and suggestions! We think it is a good idea to have it implemented in the TVM flow. Unfortunately, we are a bit out of bandwidth as of now. We'll revisit in the future!

from web-llm.

Recommend Projects