Idea

Append two types of attention modules on top of dilated FCN, which model the sematic interdependencies in spatial and channel dimensions respectively.

Architecture

Dual Attention Module

Positional Attention Module

Just like the attention mechanism in Transformer, here we may consider $B$ as key, $C$ as query and $D$ as value. The difference is that here we apply convolution layers instead of a linear layer to convert $A$ to $B$ , $C$ and $D$ . The final self-attention score is

S = Softmax (B^{T} C) D

Channel Attention Module

Apply channel-wise self-attention

Lin's Notes Garden

Explorer

Dual Attention Network for Scene Segmentation

Idea

Architecture

Dual Attention Module

Positional Attention Module

Channel Attention Module

Graph View

Table of Contents

Backlinks