[Week 7-3] ๐Ÿ‘€Object detection

Jadeยท2021๋…„ 3์›” 10์ผ
0

๋ถ€์ŠคํŠธ์บ ํ”„ AI Tech

๋ชฉ๋ก ๋ณด๊ธฐ
30/54

7์ฃผ์ฐจ ์ˆ˜์š”์ผ

  • Object detection
  • Two-stage detector
  • One-stage detector

๐Ÿ“[Object detection]

classification๊ณผ Box localization์„ ๋™์‹œ์— ์ˆ˜ํ–‰ํ•˜๋Š” ์ž‘์—…์œผ๋กœ, ์ด๋ฏธ์ง€ ๋‚ด์—์„œ ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค๋ฅผ ์ฐพ๊ณ (์œ„์น˜ ์ •๋ณด), ๊ทธ ์•ˆ์— ์–ด๋–ค ๋ฌผ์ฒด๊ฐ€ ์žˆ๋Š”์ง€ ๋ถ„๋ฅ˜ํ•œ๋‹ค(์นดํ…Œ๊ณ ๋ฆฌ ์ •๋ณด). ์ž์œจ์ฃผํ–‰์ด๋‚˜ OCR(Optical Character Recognition) ๋“ฑ์— ํ•„์š”ํ•œ ๊ธฐ์ˆ ์ด๋‹ค.

๊ธฐ์กด์˜ object detection ๋ฐฉ์‹์€ ์˜์ƒ ๋‚ด์˜ ๊ฒฝ๊ณ„์„ (gradient)์„ ์ฐพ์•„๋‚ด์–ด ๊ด€์‹ฌ ๋ฌผ์ฒด์ธ์ง€ ํŒŒ์•…ํ•˜๋Š” Gradient-based detector, ์ƒ‰์ด๋‚˜ gradient์˜ ๋ถ„ํฌ๊ฐ€ ๋น„์Šทํ•œ ์˜์—ญ์„ ๋ฌถ์–ด์„œ ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค(๋ฌผ์ฒด์˜ ํ›„๋ณด๊ตฐ)์„ ์ œ์•ˆํ•˜๋Š” Selective search ๋“ฑ ์‚ฌ๋žŒ์˜ ์ง๊ด€์„ ํ†ตํ•ด ์„ค๊ณ„๋œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด์—ˆ๋‹ค.


๐Ÿ“[Two-stage detector]

selective search ๋“ฑ์„ ์ด์šฉํ•ด ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค๋ฅผ ์ถ”์ถœํ•˜๋Š” ์ž‘์—…์„ region proposal์ด๋ผ๊ณ  ๋ถ€๋ฅธ๋‹ค. two-stage detector ๋ชจ๋ธ์€ region proposal์„ ํ†ตํ•ด ์ถ”์ถœํ•œ ๊ด€์‹ฌ ์˜์—ญ(Region Of Interest) ์ •๋ณด๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค. ROI๋ฅผ ์ถ”์ถœํ•˜๋Š” ๊ณผ์ •์„ ๊ฑฐ์ณ์•ผ ํ•˜๊ธฐ ๋•Œ๋ฌธ์— ์†๋„๊ฐ€ ๋Š๋ฆฌ๋‹ค.

  • R-CNN
    ์ดˆ๊ธฐ ๋”ฅ ๋Ÿฌ๋‹ object detection ์•Œ๊ณ ๋ฆฌ์ฆ˜์ธ R-CNN์—์„œ๋Š” ์ถ”์ถœํ•œ ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค๊ฐ€ CNN์˜ ์ž…๋ ฅ์œผ๋กœ ์ ์ ˆํ•œ ํฌ๊ธฐ๊ฐ€ ๋˜๋„๋ก wrapingํ•œ ๋‹ค์Œ, fine tuning์„ ๊ฑฐ์นœ ์‚ฌ์ „ ํ›ˆ๋ จ๋œ CNN ๋ชจ๋ธ์— ์ž…๋ ฅํ•ด ์นดํ…Œ๊ณ ๋ฆฌ๋ฅผ ํŒ๋ณ„ํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ•œ๋‹ค. ์ถ”์ถœ๋œ ๋ชจ๋“  ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค์— ๋Œ€ํ•ด ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•˜๋ฏ€๋กœ ์‹œ๊ฐ„์ด ๋Š๋ฆฌ๊ณ  ํ•™์Šต์„ ํ†ตํ•œ ์„ฑ๋Šฅ ํ–ฅ์ƒ์— ํ•œ๊ณ„๊ฐ€ ์žˆ๋‹ค๋Š” ๋‹จ์ ์ด ์žˆ๋‹ค.

  • Fast R-CNN
    R-CNN์˜ ์†๋„๋ฅผ ๊ฐœ์„ ํ•˜๊ธฐ ์œ„ํ•œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ, CNN์˜ ๊ฒฐ๊ณผ๋กœ ์–ป์€ ํŠน์„ฑ ๋งต์„ ์žฌํ™œ์šฉํ•˜๋Š” ๋ฐฉ์‹์„ ์‚ฌ์šฉํ•œ๋‹ค. ๋จผ์ € CNN ๋ ˆ์ด์–ด๋ฅผ ํ†ตํ•ด ํŠน์„ฑ ๋งต์„ ์–ป๋Š”๋‹ค. ๊ทธ๋ฆฌ๊ณ  selective search ๋“ฑ์„ ํ†ตํ•ด ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค๋“ค์„ ์ถ”์ถœํ•œ๋‹ค. ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค ์ค‘ ROI์— ํ•ด๋‹นํ•˜๋Š” ๊ฒƒ์„ ์•„๊นŒ ๊ตฌํ•œ ์ „์ฒด ์ด๋ฏธ์ง€์˜ ํŠน์„ฑ ๋งต์— ํˆฌ์˜ํ•˜๋ฉด ROI์— ๋Œ€ํ•œ ํŠน์„ฑ ๋งต์„ ์–ป์„ ์ˆ˜ ์žˆ๋‹ค. ์ดํ›„์— ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค๋ฅผ ๋” ์ž์„ธํžˆ ์ถ”์ •ํ•˜๊ธฐ ์œ„ํ•œ regressor์™€ ROI์˜ ์นดํ…Œ๊ณ ๋ฆฌ ๋ถ„๋ฅ˜๋ฅผ ์œ„ํ•œ classifier๋ฅผ ํ†ต๊ณผํ•˜๊ฒŒ ๋œ๋‹ค. ํŠน์„ฑ ๋งต์„ ์žฌํ™œ์šฉํ•˜์—ฌ ์†๋„๊ฐ€ ํ–ฅ์ƒ๋˜์—ˆ์ง€๋งŒ ์—ฌ์ „ํžˆ ์‚ฌ๋žŒ์˜ ์ง๊ด€์— ์˜ํ•ด ์„ค๊ณ„๋œ selective search ์•Œ๊ณ ๋ฆฌ์ฆ˜์— ์˜์ง€ํ•œ๋‹ค. ์‚ฌ๋žŒ์— ์˜ํ•œ ์‚ฌ์ „ ์ž‘์—…์ด ํ•„์š”ํ•˜๊ธฐ ๋•Œ๋ฌธ์— ๋ฐ์ดํ„ฐ๋งŒ ๊ฐ€์ง€๊ณ  ์„ฑ๋Šฅ์„ ์˜ฌ๋ฆฌ๊ธฐ์—๋Š” ํ•œ๊ณ„๊ฐ€ ์žˆ๋‹ค.

  • Faster R-CNN
    Fast R-CNN์˜ ๋‹จ์ ์„ ๊ฐœ์„ ํ•˜๊ธฐ ์œ„ํ•ด ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค๋ฅผ ์ถ”์ถœํ•˜๋Š” ์ž‘์—…์ธ region proposal์„ neural network ๊ธฐ๋ฐ˜์œผ๋กœ ๋Œ€์ฒดํ•˜์—ฌ ๋ชจ๋“  ๋ถ€๋ถ„์ด neural network ๊ธฐ๋ฐ˜์ธ ์ตœ์ดˆ์˜ object detection ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด๋‹ค.

    ๐Ÿ“ŒIoU(Intersection over Union)
    object detection์— ํ”ํžˆ ์“ฐ์ด๋Š” ์ฒ™๋„๋กœ, Area of Overlap / Area of Union์œผ๋กœ ๊ณ„์‚ฐํ•œ๋‹ค.


๐Ÿ“[One-stage detector]

region proposal์„ ํ†ตํ•ด ROI๋ฅผ ์ฐพ๋Š” ๊ณผ์ •์„ ๊ฑฐ์น˜์ง€ ์•Š๊ณ  ๋ฐ”๋กœ regression์„ ํ†ตํ•ด ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค๋ฅผ ์ฐพ๋Š”๋‹ค. ์ฐพ์•„๋‚ธ ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค ์˜์—ญ์— ๋Œ€ํ•ด ๋‚ด์šฉ๋ฌผ์„ ๋ถ„๋ฅ˜ํ•˜๋Š” classification๊ณผ ๋ฐ•์Šค ์˜์—ญ์„ ๋” ์ •ํ™•ํ•˜๊ฒŒ ๋งŒ๋“œ๋Š” refinement๋ฅผ ์ˆ˜ํ–‰ํ•œ๋‹ค. region proposal ๋‹จ๊ณ„๋ฅผ ์ƒ๋žตํ–ˆ๊ธฐ ๋•Œ๋ฌธ์— ์†๋„๊ฐ€ ๋น ๋ฅด๊ณ , ์‹ค์‹œ๊ฐ„ ์˜์ƒ์—๋„ ์ ์šฉ ๊ฐ€๋Šฅํ•˜๋‹ค.

  • YOLO(you only look once)
    ์ž…๋ ฅ๋˜๋Š” ์ด๋ฏธ์ง€๋ฅผ ์ผ์ • ๊ฐ„๊ฒฉ์˜ ๊ทธ๋ฆฌ๋“œ๋กœ ๋‚˜๋ˆ  ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค์™€ ๊ฐ ๋ฐ•์Šค์˜ confidence ์ ์ˆ˜๋ฅผ ๊ตฌํ•œ๋‹ค.

  • SSD(single shot multibox detector)
    ์–ด๋ ค์›Œ...

profile
๋ฐ˜๊ฐ€์›Œ์šฉ

0๊ฐœ์˜ ๋Œ“๊ธ€