OCR

주제무·2022년 6월 14일

OCR

Text Detection

Paper related

OCR

Optical Character Recognition

There are two parts in OCR.

Text Detection
Text Recognition

Text Detection

Scene Text Detection with Polygon Offsetting and Border Augmentation

Regression-based

SSD: Single shot detector https://arxiv.org/abs/1512.02325

TextBoxes https://arxiv.org/abs/1611.06779

Segmentation-based

in today's paper, it is popular and can be trained stably.
PixelLink

End to End

including Text Detection and Text Recognition
FOTS

vs Object Detection

OCR is more complex than object detection

the reason is
1. high density
2. similar characters; i, I, T, l
3. a lot of languages

Text Recognition

CRNN
GRCNN

code

keras-ocr

https://keras-ocr.readthedocs.io/en/latest/index.html

하지만 밑의 github을 추천한다.
https://github.com/faustomorales/keras-ocr

tesseract

참고 자료

https://tv.naver.com/v/4578167
https://github.com/clovaai/CRAFT-pytorch
https://blogs.sas.com/content/saskorea/2018/12/21/딥러닝을-활용한-객체-탐지-알고리즘-이해하기/

주제무

이전 포스트

Aiffel 그까이꺼, 발표자료

다음 포스트

Matplotlib: 도화지 설정을 미리해서 편하게 인쇄하기

0개의 댓글

관련 채용 정보

한글과컴퓨터

ML 모델 개발자

한컴은 AI 혁신을 통해 더 쉽고 편한 디지털 문서 환경을 만드는 테크 기업으로, OCR 및 이미지 분류 SDK 개발을 맡은 ML 모델 개발자를 찾고 있습니다. PyTorch와 TensorFlow를 활용한 딥러닝 모델 개발 경험이 있다면, 글로벌 빅테크 기업으로 도약할 한컴의 미래에 함께하세요!

룩코

AI 엔지니어 (Stable diffusion / VITON)

에이클로젯은 개인의 옷 데이터를 디지털화하여 초개인화된 스타일과 상품을 추천하는 서비스를 제공합니다. Stable diffusion과 VITON을 활용한 딥러닝 모델 개발에 참여하여 멋진 디지털 옷장을 함께 만들어갈 기회를 잡아보세요!

미리디

[미리캔버스] AI 플랫폼 엔지니어 (MLOps)

미리디는 1,300만 고객이 사랑하는 디자인 올인원 플랫폼으로, AI 기술을 활용해 디자인 경험을 혁신합니다. AI Engineer로서 안정적 ML 시스템을 구축하고, AWS 기반 인프라에서 협업하며 성장의 기회를 누려보세요!