Basic : The First Dense Layer

Austin Jiuk Kim·2022년 3월 24일

Deep Learning

목록 보기

6/10

Parameters of Dense Layer

(\overrightarrow{x})^T = (x_1 \dots x_i)

goes through

\dots \: {L}_i \: \dots

, which is composed of

\dots \: {\nu}_{i}^{\:[i_L]} \: \dots

, and which is composed of

\dots \: \overrightarrow{w}_{i,\: i_{\nu}}^{[i_L]} \: \dots \in \R^{l_w \times 1}

\: \dots \: {b}_{\: \: i_{\nu}}^{[i_L]} \: \dots \in \R^{1 \times 1}

There are two reasons why the weights is arrayed in column vector : One is that Algebra sets column vecters as the default, the other is that especially dense layer read the vector of weights in the column type.

Weighted Matrix and Bias Vector

combine the column vectors of the weights as matrix.

The shape of the weight matrix is that the length of Input times the length of Output.

{W}^{[i_L]} \in \R^{l_w \times l_{\nu}}

\: \overrightarrow{b}^{[i_L]} \in \R^{1 \times l_{\nu}}

Forward Propagation of Dense Layer

{a}_i^{[i_L]} = \nu_i^{\:[i_L]}((\overrightarrow{x})^T; \: \overrightarrow{w}_{i,\: i_{\nu}}^{[i_L]}, \: b_{\: \: i_{\nu}}^{[i_L]})

(\overrightarrow{a}^{[i_L]})^T = (\overrightarrow{x})^T \cdot {W}^{[i_L]} + \overrightarrow{b}^{[i_L]}

(\overrightarrow{x})^T \in {\R}^{1 \times {l_x}}

becomes

(\overrightarrow{a})^T \in {\R}^{1 \times {l_{\nu}}}

Austin Jiuk Kim

그냥 돼지

이전 포스트

Basic : Dense Layer

다음 포스트

Basic : Generalized Dense Layers

0개의 댓글

관련 채용 정보

퀸라이브

[재택근무] 프론트엔드 개발자 (2년 이상)

퀸라이브는 데이터 기반의 폐쇄형 라이브커머스를 운영하며, React 및 Next.js로 사용자 경험을 높이는 프론트엔드 개발자를 찾고 있습니다. 자유롭게 아이디어를 공유할 수 있는 문화 속에서 함께 성장하며 새로운 기회를 얻어보세요.

뱅크샐러드

웹 프론트엔드 엔지니어

뱅크샐러드는 금융과 건강 데이터를 연계하여 혁신적인 서비스를 제공하는 플랫폼입니다. 프론트엔드 엔지니어로서 React를 활용한 개발 및 크로스 플랫폼 환경에서의 최적화 작업에 참여하게 됩니다.

두어스

프론트엔드 엔지니어

크리에이터와 브랜드를 연결하는 혁신적인 마케팅 플랫폼, 지비지오에서 프론트엔드 엔지니어를 모집합니다. React Native와 NextJS를 활용한 유저 경험 극대화에 집중하며, 지속 가능한 성장을 추구하는 팀에 합류해보세요!