Sanity checks for saliency maps, Equation sheets, [XAI-6 (1)]

Notice

Recent Posts

Recent Comments

Link

« 2025/04 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

Tags more

Archives

Today

Total

관리 메뉴

iMTE

Sanity checks for saliency maps, Equation sheets, [XAI-6 (1)] 본문

Deep learning study/Explainable AI, 설명가능한 AI

Sanity checks for saliency maps, Equation sheets, [XAI-6 (1)]

Wonju Seo 2021. 4. 20. 11:35

논문 제목 : Sanity checks for saliency maps

논문 주소 : arxiv.org/abs/1810.03292

Sanity Checks for Saliency Maps

Saliency methods have emerged as a popular tool to highlight features in an input deemed relevant for the prediction of a learned model. Several saliency methods have been proposed, often guided by visual appeal on image data. In this work, we propose an a

arxiv.org

주요 수식 정리:

0) Definition

input : $x \in \mathbb{R}^d$

model : $S : \mathbb{R}^d -> \mathbb{R}^C$ , C : the number of classes

1) Gradient with respect to input

$Egrad(x)=∂S∂x<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>E</mi><mrow data-mjx-texclass="ORD"><mi>g</mi><mi>r</mi><mi>a</mi><mi>d</mi></mrow></msub><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mrow><mi>∂</mi><mi>S</mi></mrow><mrow><mi>∂</mi><mi>x</mi></mrow></mfrac></math>$

2) Gradient $\odot$ Input (Gradient element-wise product with the input)

$EGrad⊙input(x)=x⊙∂S∂x<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>E</mi><mrow data-mjx-texclass="ORD"><mi>G</mi><mi>r</mi><mi>a</mi><mi>d</mi><mo>⊙</mo><mi>i</mi><mi>n</mi><mi>p</mi><mi>u</mi><mi>t</mi></mrow></msub><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>x</mi><mo>⊙</mo><mfrac><mrow><mi>∂</mi><mi>S</mi></mrow><mrow><mi>∂</mi><mi>x</mi></mrow></mfrac></math>$

3) Guided Backpropagation (GBP)

Feature maps derived during the forward pass : $\{f^l, f^{l-1},...,f^0\}$

Intermediate representations obtained during the backward pass : $\{R^l,R^{l-1},...,R^0\}$

$f l = r e l u (f l - 1) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mi>f</mi><mi>l</mi></msup><mo>=</mo><mi>r</mi><mi>e</mi><mi>l</mi><mi>u</mi><mo stretchy="false">(</mo><msup><mi>f</mi><mrow data-mjx-texclass="ORD"><mi>l</mi><mo>-</mo><mn>1</mn></mrow></msup><mo stretchy="false">)</mo></math>$

$Rl+1=∂fout∂fl+1<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mi>R</mi><mrow data-mjx-texclass="ORD"><mi>l</mi><mo>+</mo><mn>1</mn></mrow></msup><mo>=</mo><mfrac><mrow><mi>∂</mi><msup><mi>f</mi><mrow data-mjx-texclass="ORD"><mi>o</mi><mi>u</mi><mi>t</mi></mrow></msup></mrow><mrow><mi>∂</mi><msup><mi>f</mi><mrow data-mjx-texclass="ORD"><mi>l</mi><mo>+</mo><mn>1</mn></mrow></msup></mrow></mfrac></math>$

GBP aims to zero out negative gradients during computation of R.

$R l = 1 R l + 1 > 0 1 f l > 0 R l + 1 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mi>R</mi><mi>l</mi></msup><mo>=</mo><msub><mn>1</mn><mrow data-mjx-texclass="ORD"><msup><mi>R</mi><mrow data-mjx-texclass="ORD"><mi>l</mi><mo>+</mo><mn>1</mn></mrow></msup><mo>></mo><mn>0</mn></mrow></msub><msub><mn>1</mn><mrow data-mjx-texclass="ORD"><msup><mi>f</mi><mi>l</mi></msup><mo>></mo><mn>0</mn></mrow></msub><msup><mi>R</mi><mrow data-mjx-texclass="ORD"><mi>l</mi><mo>+</mo><mn>1</mn></mrow></msup></math>$

위 식의 $1_{R^{l+1}>0}$ 는 positive gradient만 전달, $1_{f^l>0}$ 은 positive activation만 전달을 의미한다.

4) Integrated Gradients (IG)

$EIG(x)=(x−ˉx)× ∫10∂S(ˉx+α(x−ˉx)∂xdα<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>E</mi><mrow data-mjx-texclass="ORD"><mi>I</mi><mi>G</mi></mrow></msub><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mi>x</mi><mo>−</mo><mrow data-mjx-texclass="ORD"><mover><mi>x</mi><mo stretchy="false">¯</mo></mover></mrow><mo stretchy="false">)</mo><mo>×</mo><mtext> </mtext><msubsup><mo data-mjx-texclass="OP">∫</mo><mn>0</mn><mn>1</mn></msubsup><mfrac><mrow><mi>∂</mi><mi>S</mi><mo stretchy="false">(</mo><mrow data-mjx-texclass="ORD"><mover><mi>x</mi><mo stretchy="false">¯</mo></mover></mrow><mo>+</mo><mi>α</mi><mo stretchy="false">(</mo><mi>x</mi><mo>−</mo><mrow data-mjx-texclass="ORD"><mover><mi>x</mi><mo stretchy="false">¯</mo></mover></mrow><mo stretchy="false">)</mo></mrow><mrow><mi>∂</mi><mi>x</mi></mrow></mfrac><mi>d</mi><mi>α</mi></math>$

$\bar x$ 는 baseline input으로 주로 zero로 set이 된다.

5) SmoothGrad

$Esg(x)=1NN∑i=1E(x+gi),gi∼N(0,σ2)<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>E</mi><mrow data-mjx-texclass="ORD"><mi>s</mi><mi>g</mi></mrow></msub><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mn>1</mn><mi>N</mi></mfrac><munderover><mo data-mjx-texclass="OP">∑</mo><mrow data-mjx-texclass="ORD"><mi>i</mi><mo>=</mo><mn>1</mn></mrow><mi>N</mi></munderover><mi>E</mi><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><msub><mi>g</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo>,</mo><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><msub><mi>g</mi><mi>i</mi></msub><mo>∼</mo><mi>N</mi><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><msup><mi>σ</mi><mn>2</mn></msup><mo stretchy="false">)</mo></math>$

6) VarGrad

$V$ : the variance.

$E v g (x) = V (E (x + g i)), g i \sim N (0, σ 2) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>E</mi><mrow data-mjx-texclass="ORD"><mi>v</mi><mi>g</mi></mrow></msub><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>V</mi><mo stretchy="false">(</mo><mi>E</mi><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><msub><mi>g</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>,</mo><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><msub><mi>g</mi><mi>i</mi></msub><mo>\sim</mo><mi>N</mi><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><msup><mi>σ</mi><mn>2</mn></msup><mo stretchy="false">)</mo></math>$

7) GradCAM and Guided GradCAM

$A^k$ : last convolutional layer에서 추출된 feature map

$αkc=1Z∑i∑j∂S∂Akij<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msubsup><mi>α</mi><mi>c</mi><mi>k</mi></msubsup><mo>=</mo><mfrac><mn>1</mn><mi>Z</mi></mfrac><munder><mo data-mjx-texclass="OP">∑</mo><mi>i</mi></munder><munder><mo data-mjx-texclass="OP">∑</mo><mi>j</mi></munder><mfrac><mrow><mi>∂</mi><mi>S</mi></mrow><mrow><mi>∂</mi><msubsup><mi>A</mi><mrow data-mjx-texclass="ORD"><mi>i</mi><mi>j</mi></mrow><mi>k</mi></msubsup></mrow></mfrac></math>$

$E g r a d = R e L U (\sum k α k c A k) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>E</mi><mi>g</mi></msub><mi>r</mi><mi>a</mi><mi>d</mi><mo>=</mo><mi>R</mi><mi>e</mi><mi>L</mi><mi>U</mi><mo stretchy="false">(</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>k</mi></munder><msubsup><mi>α</mi><mi>c</mi><mi>k</mi></msubsup><msup><mi>A</mi><mi>k</mi></msup><mo stretchy="false">)</mo></math>$

$E g u i d e d - g r a d c a m (x) = E g r a d ⊙ E g b p <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>E</mi><mrow data-mjx-texclass="ORD"><mi>g</mi><mi>u</mi><mi>i</mi><mi>d</mi><mi>e</mi><mi>d</mi><mo>-</mo><mi>g</mi><mi>r</mi><mi>a</mi><mi>d</mi><mi>c</mi><mi>a</mi><mi>m</mi></mrow></msub><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msub><mi>E</mi><mrow data-mjx-texclass="ORD"><mi>g</mi><mi>r</mi><mi>a</mi><mi>d</mi></mrow></msub><mo>⊙</mo><msub><mi>E</mi><mrow data-mjx-texclass="ORD"><mi>g</mi><mi>b</mi><mi>p</mi></mrow></msub></math>$

나중에 쉽게 보려고 정리해놨다. (언제 논문 켜서 확인하니..)

저작자표시

'Deep learning study > Explainable AI, 설명가능한 AI' 카테고리의 다른 글

Interpretable and fine-grained visual explanations for CNNs 내용 정리 [XAI-7] (0)	2021.04.23
Sanity checks for saliency maps 내용정리 [XAI-6 (2)] (0)	2021.04.20
SmoothGrad : removing noise by adding noise 내용 정리 [XAI-5] (0)	2021.04.15
Smooth Grad-CAM++ 내용 정리 [XAI-4] (0)	2021.04.14
Grad-CAM++ 내용 정리 [XAI-3] (0)	2021.04.09

'Deep learning study/Explainable AI, 설명가능한 AI' Related Articles

Comments

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

iMTE

iMTE

Sanity checks for saliency maps, Equation sheets, [XAI-6 (1)] 본문

Sanity checks for saliency maps, Equation sheets, [XAI-6 (1)]

논문 제목 : Sanity checks for saliency maps

논문 주소 : arxiv.org/abs/1810.03292

'Deep learning study > Explainable AI, 설명가능한 AI' 카테고리의 다른 글

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역