๐Ÿ‘๏ธ Computer Vision

๐Ÿ‘๏ธ Computer Vision/Convolutional Neural Network

[Paper Review] Visualizing and Understanding Convolution Networks(ECCV 2013)

Visualizing and Understanding Convolutional Networks(2013) 0. Abstract AlexNet ์ดํ›„ `Large Convolutional Network` ๋ชจ๋ธ๋“ค์ด ImageNet bechmark์—์„œ ์ธ์ƒ์ ์ธ classification ์„ฑ๋Šฅ์„ ๋ณด์ž„ ๊ทธ๋Ÿฌ๋‚˜ ์™œ ์„ฑ๋Šฅ์ด ์ข‹์€์ง€, ์–ด๋–ป๊ฒŒ ์„ฑ๋Šฅ์„ ๊ฐœ์„ ์‹œ์ผฐ๋Š”์ง€์— ๋Œ€ํ•ด์„  ๋ช…ํ™•ํ•˜๊ฒŒ ์ดํ•ดํ•˜์ง€ ๋ชปํ•จ ๋ณธ ๋…ผ๋ฌธ์—์„  Large Convolutional Network์˜ ์ค‘๊ฐ„์— ์žˆ๋Š” feature layers์˜ ๊ธฐ๋Šฅ๊ณผ classifier์˜ ์ž‘๋™ ๊ณผ์ •์„ ํ™•์ธํ•˜๋Š” ์ƒˆ๋กœ์šด ์‹œ๊ฐํ™” ๊ธฐ๋ฒ•์„ ์ œ์•ˆ ํ•ด๋‹น visualization ๊ธฐ์ˆ ์ด diagnostic role์„ ์ˆ˜ํ–‰ํ•˜์—ฌ AlexNet๋ณด๋‹ค ImageNet benchmark์—์„œ ๋” ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ..

๐Ÿ‘๏ธ Computer Vision/Convolutional Neural Network

[Paper Review] ImageNet Classification with Deep Convolutional Neural Networks(NIPS 2012)

ImageNet Classification with Deep Convolutional Neural Networks 0. Abstract ImageNet LSVRC-2010์—์„œ 1.2m๊ฐœ์˜ ๊ณ ํ•ด์ƒ๋„ ์ด๋ฏธ์ง€์— ๋Œ€ํ•ด 1000๊ฐœ์˜ ํด๋ž˜์Šค๋กœ ๋ถ„๋ฅ˜ํ•˜๊ธฐ ์œ„ํ•ด large, deep convolutional neural network๋ฅผ ํ•™์Šต test data์—์„œ ์ด์ „ SOTA ๋ชจ๋ธ๋ณด๋‹ค ์ข‹์€ top-1๊ณผ top-5 error rates์—์„œ 37.5%์™€ 17.0%๋ฅผ ๊ธฐ๋ก `AlexNet`์—๋Š” ์•ฝ 6,000๋งŒ๊ฐœ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ์™€ 65๋งŒ ๊ฐœ์˜ neurons๊ฐ€ 5๊ฐœ์˜ convolutional layers๋กœ ๊ตฌ์„ฑ ์ด์— ๋”ํ•ด max-pooling layers์™€ 1000-way softmax๋กœ ๊ตฌ์„ฑ๋œ 3๊ฐœ์˜ fully-connected l..

๐Ÿ‘๏ธ Computer Vision/Convolutional Neural Network

[DL] CNN(Image classification) - LeNet-5, AlexNet, VGGNet, GoogleNet, ResNet

`์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜(classification)`๋Š” ํŠน์ • ๋Œ€์ƒ์ด ์˜์ƒ ๋‚ด์— ์กด์žฌํ•˜๋Š”์ง€ ์—ฌ๋ถ€๋ฅผ ํŒ๋‹จํ•˜๋Š” ๊ฒƒ์ด๋‹ค. ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜์—์„œ ์ฃผ๋กœ ์‚ฌ์šฉ๋˜๋Š” ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง์˜ ์œ ํ˜•์€ ๋‹ค์–‘ํ•˜๋‹ค. LeNet-5 `LeNet-5`๋Š” ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง์ด๋ผ๋Š” ๊ฐœ๋…์„ ์ตœ์ดˆ๋กœ ๊ฐœ๋ฐœํ•œ ๊ตฌ์กฐ๋กœ, ํ˜„์žฌ CNN์˜ ์ดˆ์„์ด ๋˜์—ˆ๋‹ค. LeNet-5๋Š” `ํ•ฉ์„ฑ๊ณฑ(convolutional)`๊ณผ `๋‹ค์šด ์ƒ˜ํ”Œ๋ง(sub-sampling)`(ํ˜น์€ ํ’€๋ง)์„ ๋ฐ˜๋ณต์ ์œผ๋กœ ๊ฑฐ์น˜๋ฉด์„œ ๋งˆ์ง€๋ง‰์— ์™„์ „์—ฐ๊ฒฐ์ธต์—์„œ ๋ถ„๋ฅ˜๋ฅผ ์ˆ˜ํ–‰ํ•œ๋‹ค. LeNet-5์˜ ์‹ ๊ฒฝ๋ง ๊ตฌ์กฐ๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™๋‹ค. (32 x 32 x 1) ํฌ๊ธฐ์˜ ์ด๋ฏธ์ง€์— ํ•ฉ์„ฑ๊ณฑ์ธต๊ณผ ์ตœ๋Œ€ ํ’€๋ง์ธต์ด ์Œ์œผ๋กœ ๋‘ ๋ฒˆ ์ ์šฉ๋œ ํ›„ ์™„์ „์—ฐ๊ฒฐ์ธต์„ ๊ฑฐ์ณ ์ด๋ฏธ์ง€๊ฐ€ ๋ถ„๋ฅ˜๋˜๋Š” ์‹ ๊ฒฝ๋ง์ด๋‹ค. ์ด๋Ÿฌํ•œ ์‹ ๊ฒฝ๋ง ๊ตฌ์กฐ๋ฅผ ํŒŒ์ดํ† ์น˜๋ฅผ ํ†ตํ•ด ๊ตฌํ˜„ํ•˜๋ฉด ๋‹ค์Œ๊ณผ ๊ฐ™๋‹ค. ์ž…๋ ฅ ์ด๋ฏธ์ง€..

๐Ÿ‘๏ธ Computer Vision/Convolutional Neural Network

[DL] CNN - convolutional layer, pooling, fully connected, transfer learning, feature extractor, fine-tuning

ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง ํ•ฉ์„ฑ๊ณฑ์ธต์˜ ํ•„์š”์„ฑ ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง์€ ์ด๋ฏธ์ง€ ์ „์ฒด๋ฅผ ํ•œ ๋ฒˆ์— ๊ณ„์‚ฐํ•˜๋Š” ๊ฒƒ์ด ์•„๋‹Œ ์ด๋ฏธ์ง€์˜ ๊ตญ์†Œ์  ๋ถ€๋ถ„์„ ๊ณ„์‚ฐํ•จ์œผ๋กœ์จ ์‹œ๊ฐ„๊ณผ ์ž์›์„ ์ ˆ์•ฝํ•˜์—ฌ ์ด๋ฏธ์ง€์˜ ์„ธ๋ฐ€ํ•œ ๋ถ€๋ถ„๊นŒ์ง€ ๋ถ„์„ํ•  ์ˆ˜ ์žˆ๋Š” ์‹ ๊ฒฝ๋ง์ด๋‹ค. ๋งŒ์•ฝ (3 x 3) ๋ฐฐ์—ด์„ ํŽผ์ณ์„œ(flattening) ๊ฐ ํ”ฝ์…€์— ๊ฐ€์ค‘์น˜๋ฅผ ๊ณฑํ•˜์—ฌ ์€๋‹‰์ธต์— ์ „๋‹ฌํ•˜๋Š” ๊ฒฝ์šฐ ๋ฐ์ดํ„ฐ์˜ ๊ณต๊ฐ„์  ๊ตฌ์กฐ๋ฅผ ๋ฌด์‹œํ•˜๊ฒŒ ๋œ๋‹ค. ์ด๊ฒƒ์„ ๋ฐฉ์ง€ํ•˜๊ธฐ ์œ„ํ•ด ๋„์ž…๋œ ๊ฒƒ์ด ํ•ฉ์„ฑ๊ณฑ์ธต์ด๋‹ค. ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง์˜ ๊ตฌ์กฐ ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง์€ ํ•ฉ์„ฑ๊ณฑ์ธต๊ณผ ํ’€๋ง์ธต์„ ๊ฑฐ์น˜๋ฉด์„œ ์ž…๋ ฅ ์ด๋ฏธ์ง€์˜ ์ฃผ์š” `ํŠน์„ฑ ๋ฒกํ„ฐ(feature vector)`๋ฅผ ์ถ”์ถœํ•œ๋‹ค. ์ดํ›„ ์ถ”์ถœ๋œ ์ฃผ์š” ํŠน์„ฑ ๋ฒกํ„ฐ๋“ค์€ ์™„์ „์—ฐ๊ฒฐ์ธต์„ ๊ฑฐ์ณ 1์ฐจ์› ๋ฒกํ„ฐ๋กœ ๋ณ€ํ™˜๋˜๋ฉฐ, ๋งˆ์ง€๋ง‰์œผ๋กœ ์ถœ๋ ฅ์ธต์—์„œ ํ™œ์„ฑํ™” ํ•จ์ˆ˜์ธ `softmax` ํ•จ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ตœ์ข… ๊ฒฐ๊ณผ๊ฐ€ ์ถœ๋ ฅ๋œ๋‹ค. ์ž…๋ ฅ์ธต..

๐Ÿ‘๏ธ Computer Vision/Convolutional Neural Network

[DL] ์ปจ๋ฒŒ๋ฃจ์…˜ ์‹ ๊ฒฝ๋ง ๋ชจ๋ธ - LeNet-5, AlexNet, ZFNet, VGGNet, GoogleNet, ResNet

LeNet-5 ์–€ ๋ฅด์ฟค์€ 1998๋…„ ์šฐํŽธ๋ฌผ์— ํ•„๊ธฐ์ฒด๋กœ ์“ฐ์ธ ์šฐํŽธ๋ณ€ํ˜ธ๋ฅผ ์ธ์‹ํ•˜๋Š” ์ตœ์ดˆ์˜ ์ปจ๋ฒŒ๋ฃจ์…˜ ์‹ ๊ฒฝ๋ง์ธ `LeNet-5`๋ฅผ ์„ ๋ณด์˜€๋‹ค. ๋ชจ๋ธ ๊ตฌ์กฐ LeNet-5๋Š” ์ปจ๋ฒŒ๋ฃจ์…˜ ๊ณ„์ธต๊ณผ ํ’€๋ง ๊ณ„์ธต์„ 2๋ฒˆ ๋ฐ˜๋ณตํ•œ ๋’ค ์™„์ „ ์—ฐ๊ฒฐ 3๊ณ„์ธต์œผ๋กœ ์—ฐ๊ฒฐ๋˜๋Š” ๊ตฌ์กฐ์ด๋‹ค. ์ปจ๋ฒŒ๋ฃจ์…˜ ๊ณ„์ธต๊ณผ ํ’€๋ง ๊ณ„์ธต์€ ์ƒ์ฒด ์‹ ๊ฒฝ๋ง์˜ ์‹œ๊ฐ ์˜์—ญ์„, ์™„์ „ ์—ฐ๊ฒฐ ๊ณ„์ธต์€ ์—ฐ๊ด€ ์˜์—ญ์„ ๋ชจ๋ธ๋งํ–ˆ๋‹ค. ์ž…๋ ฅ ๋ฐ์ดํ„ฐ๋Š” (32 x 32 x 1) ์ด๋ฏธ์ง€ ์ปจ๋ฒŒ๋ฃจ์…˜ ๊ณ„์ธต์€ (5 x 5) ํ•„ํ„ฐ(stride=1) ํ™œ์„ฑ ํ•จ์ˆ˜๋Š” ์‹œ๊ทธ๋ชจ์ด๋“œ ํ˜น์€ ํ•˜์ดํผ๋ณผ๋ฆญ ํƒ„์  ํŠธ ํ’€๋ง ๊ณ„์ธต์€ (2 x 2) ํ•„ํ„ฐ(stride=2)๋กœ ํ‰๊ท  ํ’€๋ง์„ ํ†ตํ•ด ์•กํ‹ฐ๋ฒ ์ด์…˜ ๋งต์˜ ํฌ๊ธฐ๋ฅผ ์ ˆ๋ฐ˜์œผ๋กœ ๊ฐ์†Œ ์™„์ „ ์—ฐ๊ฒฐ ๊ณ„์ธต์€ ์•กํ‹ฐ๋ฒ ์ด์…˜ ๋งต๊ณผ ๊ฐ™์€ ํฌ๊ธฐ์˜ (5 x 5 x 16) ์ปจ๋ฒŒ๋ฃจ์…˜ ํ•„ํ„ฐ 120๊ฐœ๋กœ ํฌ๊ธฐ๊ฐ€ 120์ธ ..

๐Ÿ‘๏ธ Computer Vision/Convolutional Neural Network

[DL] ์ปจ๋ฒŒ๋ฃจ์…˜ ์‹ ๊ฒฝ๋ง - convolution, cross correlation, activation map, pooling, stride, padding, infinitely strong prior

์‹œ๊ฐ ํŒจํ„ด ์ธ์‹์„ ์œ„ํ•œ ์‹ ๊ฒฝ๋ง ๋ชจ๋ธ ์ˆœ๋ฐฉํ–ฅ ์‹ ๊ฒฝ๋ง์œผ๋กœ MNIST ํ•„๊ธฐ์ฒด ์ˆซ์ž๋ฅผ ์ธ์‹ํ•˜๋ ค๋ฉด 28x28 ์ด๋ฏธ์ง€๋ฅผ 784 ํฌ๊ธฐ์˜ 1์ฐจ์› ๋ฒกํ„ฐ๋กœ ๋ณ€ํ™˜ํ•ด์„œ ๋ชจ๋ธ์— ์ž…๋ ฅํ•ด์•ผ ํ•œ๋‹ค. 2์ฐจ์› ์ด๋ฏธ์ง€๋ฅผ 1์ฐจ์›์œผ๋กœ ํŽผ์น˜๋ฉด ์–ด๋–ค ์ˆซ์ž์ธ์ง€ ์ธ์‹ํ•˜๊ธฐ ์–ด๋ ต๋‹ค. ์ด๋Š” ๊ณต๊ฐ„ ๋ฐ์ดํ„ฐ๋ฅผ 1์ฐจ์›์œผ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ์ˆœ๊ฐ„ ํ˜•์ƒ ์ •๋ณด๊ฐ€ ๋ถ„์‚ฐ๋˜๊ธฐ ๋•Œ๋ฌธ์— ์ •ํ™•ํ•œ ํŒจํ„ด์„ ์ธ์‹ํ•˜๊ธฐ ์–ด๋ ค์šด ๊ฒƒ์ด๋‹ค. ๋˜ํ•œ ์ด๋ฏธ์ง€์™€ ๊ฐ™์€ ๊ณ ์ฐจ์› ๋ฐ์ดํ„ฐ๋Š” ์ฐจ์›๋ณ„๋กœ ํฌ๊ธฐ๊ฐ€ ์กฐ๊ธˆ์”ฉ๋งŒ ์ปค์ ธ๋„ ์ „์ฒด ๋ฐ์ดํ„ฐ ํฌ๊ธฐ๊ฐ€ ๊ธฐํ•˜๊ธ‰์ˆ˜์ ์œผ๋กœ ์ฆ๊ฐ€ํ•œ๋‹ค. ์ž…๋ ฅ ๋ฐ์ดํ„ฐ๊ฐ€ ์ปค์งˆ์ˆ˜๋ก ๋” ๋งŽ์€ ํŠน์ง•์„ ํฌํ•จํ•˜๊ธฐ ๋•Œ๋ฌธ์— ๊ทธ์— ๋”ฐ๋ผ ๋ชจ๋ธ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ ์ˆ˜๊ฐ€ ๊ธ‰๊ฒฉํžˆ ์ฆ๊ฐ€ํ•œ๋‹ค. ๋”ฐ๋ผ์„œ ์ˆœ๋ฐฉํ–ฅ ์‹ ๊ฒฝ๋ง์€ ์ด๋ฏธ์ง€ ๋ฐ์ดํ„ฐ๋ฅผ ์ฒ˜๋ฆฌํ•˜๊ธฐ์— ํšจ์œจ์ ์ด์ง€ ์•Š๋‹ค. CNN์˜ ๊ตฌ์กฐ ํ˜„๋Œ€์˜ CNN๋„ ์ตœ์ดˆ์˜ CNN ๋ชจ๋ธ์ธ `LeNet-5`์™€ ๊ฐ™์ด ..

๐Ÿ‘๏ธ Computer Vision/Convolutional Neural Network

[Paper Review] ImageNet Classification with Deep Convolutional Neural Networks(NIPS 2012)

Paper ImageNet Classification with Deep Convolutional Neural Networks(NIPS 2012) ๋ณธ ๋…ผ๋ฌธ์˜ ์ฝ”๋“œ ๊ตฌํ˜„์€ ๊นƒํ—ˆ๋ธŒ์—์„œ ํ™•์ธ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค. 0. Abstract ImageNet LSVRC-2010 ๋Œ€ํšŒ์—์„œ 120๋งŒ ๊ฐœ์˜ ๊ณ ํ•ด์ƒ๋„ ์ด๋ฏธ์ง€๋ฅผ 1000๊ฐœ์˜ ๋‹ค๋ฅธ ์ด๋ฏธ์ง€๋กœ ๋ถ„๋ฅ˜ํ•˜๊ธฐ ์œ„ํ•ด ํฌ๊ณ  ๊นŠ์€ convolutional neural network๋ฅผ ํ›ˆ๋ จ์‹œ์ผฐ๋‹ค. ํ…Œ์ŠคํŠธ ๋ฐ์ดํ„ฐ์—์„œ ์ด์ „์˜ SOTA ๋ชจ๋ธ๋ณด๋‹ค ๋‚˜์€ 37.5%, 17.0%์˜ top-1 error rate์™€ top-5 error rate๋ฅผ ๋‹ฌ์„ฑํ–ˆ๋‹ค. 6์ฒœ๋งŒ ๊ฐœ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋“ค๊ณผ 650,000๊ฐœ์˜ ๋‰ด๋Ÿฐ์œผ๋กœ ๊ตฌ์„ฑ๋œ neural network๋Š” 5๊ฐœ์˜ convolutional layers๋กœ ๊ตฌ์„ฑ๋˜๋ฉฐ, ๊ทธ ์ค‘..

๐Ÿ‘๏ธ Computer Vision/GAN

[GAN] ์ƒ์„ฑ์  ์ ๋Œ€ ์‹ ๊ฒฝ๋ง(Generative Adversarial Networks)์ด๋ž€?

Generative Adversarial Networks Generative Adversarial Networks์€ ๋‘ ๊ฐœ ์ด์ƒ์˜ ์‹ ๊ฒฝ๋ง์ด ์„œ๋กœ๋ฅผ ํ–ฅํ•˜๊ฒŒ ํ•˜๊ณ , ์„œ๋กœ ๋Œ€ํ•ญํ•˜๋“ฏ์ด ํ›ˆ๋ จํ•˜๊ฒŒ ํ•จ์œผ๋กœ์จ, ๊ฒฐ๊ณผ์ ์œผ๋กœ ์ƒ์„ฑ ๋ชจ๋ธ(generative model)์„ ์‚ฐ์ถœํ•ด๋‚ธ๋‹ค. GAN์˜ ์ด์  ๋ฐ์ดํ„ฐ๊ฐ€ ํ•œ์ •๋œ ์ƒํ™ฉ์—์„œ๋„ ์ผ๋ฐ˜ํ™”(Generalization)๋ฅผ ํ•  ์ˆ˜ ์žˆ๋‹ค. ์ž‘์€ ๋ฐ์ดํ„ฐ์…‹์„ ๊ฐ€์ง€๊ณ ๋„ ์ƒˆ๋กœ์šด ์žฅ๋ฉด์„ ์ƒ๊ฐํ•  ์ˆ˜ ์žˆ๋‹ค. ๋ชจ์กฐ ๋ฐ์ดํ„ฐ(simulated data)๋ฅผ ๋”์šฑ ์ง„์งœ์ฒ˜๋Ÿผ ๋ณด์ด๊ฒŒ ํ•  ์ˆ˜ ์žˆ๋‹ค. Generative Modeling & Discriminative Modeling ํŒ๋ณ„ ๋ชจ๋ธ๋ง(Discriminative Modeling) ๊ทธ๋ฆผ์„ ์‚ดํŽด๋ณธ ๋‹ค์Œ์— ํ•ด๋‹น ๊ทธ๋ฆผ์˜ style์„ ์ •ํ•˜๋Š” ์ผ์€ ๋ฌด์—‡์ธ๊ฐ€๋ฅผ ํŒ๋‹จํ•˜๋Š” ..

๐Ÿ‘๏ธ Computer Vision/GAN

[GAN] GAN(Generative Adversarial Network) ๊ธฐ์ดˆ ๊ฐœ๋…

Supervised Learning vs Unsupervised Learning ์ง€๋„ ํ•™์Šต Supervised Learning์—๋Š” ๋Œ€ํ‘œ์ ์œผ๋กœ Discriminative Model์ด ์žˆ๋‹ค. ์ด์—๋Š” ๋กœ์ง€์Šคํ‹ฑ ํšŒ๊ท€๋ถ„์„, ๋‰ด๋Ÿด ๋„คํŠธ์›Œํฌ ๋“ฑ์ด ํ•ด๋‹น๋œ๋‹ค. Discriminative Model์€ Input์ด ์ฃผ์–ด์ง€๋ฉด Input์ด ํ•ด๋‹นํ•˜๋Š” ํด๋ž˜์Šค๋ฅผ ๋งž์ถ”๊ธฐ ์œ„ํ•ด ํ•™์Šตํ•˜๊ฒŒ ๋œ๋‹ค. ๋น„์ง€๋„ ํ•™์Šต Unsupervised Learning์—๋Š” ๋Œ€ํ‘œ์ ์œผ๋กœ Generative Model์ด ์žˆ๋‹ค. Generative ๋ชจ๋ธ์€ Label์ด ์—†์ด ํ•™์Šตํ•˜๊ฒŒ ๋˜๋ฉฐ, ํ•™์Šต ๋ฐ์ดํ„ฐ์˜ ๋ถ„ํฌ๋ฅผ ํ•™์Šตํ•˜๋Š” ๊ฒƒ์ด ๋ชฉ์ ์ด๋‹ค. GAN(Generative Adversarial Network)์ด๋ž€? GAN(Generative Adversarial Network)์ด..

๐Ÿ‘๏ธ Computer Vision/Convolutional Neural Network

[Paper] ๋”ฅ๋Ÿฌ๋‹์„ ํ™œ์šฉํ•œ ๋ฐ˜๋„์ฒด ์›จ์ดํผ ๋ถˆ๋Ÿ‰ ์œ ํ˜• ๊ตฌ๋ถ„ ๋ชจ๋ธ์— ๊ด€ํ•œ ์—ฐ๊ตฌ

Paper ๋”ฅ๋Ÿฌ๋‹์„ ํ™œ์šฉํ•œ ๋ฐ˜๋„์ฒด ์›จ์ดํผ ๋ถˆ๋Ÿ‰ ์œ ํ˜• ๊ตฌ๋ถ„ ๋ชจ๋ธ์— ๊ด€ํ•œ ์—ฐ๊ตฌ(๋ฐฑ์„ ์žฌ, ์ด๋ฏผํ˜) Summary 0. Abstract ๊ธฐ์กด ์‚ฐ์—…ํ˜„์žฅ์—์„œ๋Š” ๋ฐ˜๋„์ฒด ์›จ์ดํผ ๋งต์„ ์ง์ ‘ ํ™•์ธํ•˜์—ฌ ๋ถˆ๋Ÿ‰์„ ์„ ๋ณ„ํ•œ๋‹ค. ์œก์•ˆ์„ ํ†ตํ•œ ์›จ์ดํผ ์„ ๋ณ„๊ณผ์ •์€ ํญ์ฆํ•˜๋Š” ์‹œ์žฅ์˜ ์ˆ˜์š”๋ฅผ ์ถฉ์กฑ์‹œํ‚ฌ ์ˆ˜ ์—†๋‹ค. ๋”ฐ๋ผ์„œ ์ธ๊ฐ„๋ณด๋‹ค ์‹ ์†, ์ •ํ™•ํ•œ ๋ฐ˜๋„์ฒด ์›จ์ดํผ ๋ถˆ๋Ÿ‰์„ ๊ฒ€์ถœํ•˜์—ฌ ์ž๋™ํ™”์— ๊ธฐ์—ฌํ•  ์ˆ˜ ์žˆ๋Š” AI ๊ธฐ์ˆ ์„ ์ œ์‹œํ•œ๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ๋‹ค์ธตํผ์…‰ํŠธ๋ก (MLP)๊ณผ ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง(CNN)์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ 2๊ฐ€์ง€ ์ธ๊ณต์ง€๋Šฅ ๋ชจ๋ธ์„ ๊ณ ์•ˆํ•˜์˜€๊ณ , ์‹คํ—˜ ๊ฒฐ๊ณผ CNN ๋ชจ๋ธ์ด ์ •ํ™•๋„๊ฐ€ ํ‰๊ท  6.4% ๋” ๋†’์•˜์Œ์„ ํ™•์ธํ–ˆ๋‹ค. 1. Introduction ๋ฐ˜๋„์ฒด ์นฉ์€ ์ˆ˜๋งŽ์€ ์ œ์กฐ๊ณต์ •์„ ๊ฑฐ์นœ ๋’ค ๋งˆ์ง€๋ง‰ ์ ˆ์ฐจ์ธ ํ…Œ์ŠคํŠธ๋ฅผ ํ†ตํ•ด ์–‘ํ’ˆ, ๋ถˆ๋Ÿ‰ํ’ˆ์„ ์„ ๋ณ„ํ•œ๋‹ค. ๋ฐ˜๋„์ฒด ์ˆ˜์œจ ํ–ฅ์ƒ๊ณผ ์ง๊ฒฐ๋œ ..

Junyeong Son
'๐Ÿ‘๏ธ Computer Vision' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๊ธ€ ๋ชฉ๋ก