context_dim and query_dim in CrossAttention

class CrossAttention(nn.Module):
    def __init__(self, query_dim, context_dim=None, heads=8, dim_head=64, dropout=0.):
        super().__init__()

Hi, I am having some trouble setting the dimensions for modules in my code.

Could you please tell me how should I set "query_dim" and "context_dim" if  I were to use this function to calculate the cross attention between a feature map **X** and a **text**?

Is "query_dim" is the dimension of the feature map?
Is "context_dim" the dimension of the tensor I get after feeding a text into an text encoder?

Thank you so much!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

context_dim and query_dim in CrossAttention #511

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

context_dim and query_dim in CrossAttention #511

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions