한글 OCR with opencv, pytesseract - 인식률 높이기

yun·2023년 9월 6일

OCR contour opencv pytesseract tesseract

OCR

목록 보기

2/2

Contour

같은 값을 가진 곳을 연결한 선
이미지의 외곽선을 검출하기 위해 사용

실습

# contour를 찾아 크기가 작은 순으로 정렬
cnts = cv2.findContours(edged.copy(), cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
cnts = imutils.grab_contours(cnts)
cnts = sorted(cnts, key=cv2.contourArea, reverse=True)

photo_cnt = None

# 정렬된 contour를 반복문으로 수행하며 윤곽 추출
for c in cnts:
    peri = cv2.arcLength(c, True)
    approx = cv2.approxPolyDP(c, 0.02 * peri, True)

    print(len(approx))  # 5, 3, 11, 16

    # 전체이미지를 가져올 거니까
    if len(approx) == 5:
        photo_cnt = approx
        break

# 만약 추출한 윤곽이 없을 경우 오류
if photo_cnt is None:
    raise Exception(("Could not find receipt outline."))