Layout Parser

김형진·2022년 9월 21일

Layout Parser는

DIA(Document Image Analysis) Task에 사용된다

pip install layoutparser
Install the base layoutparser library with

pip install "layoutparser[layoutmodels]"
Install DL layout model toolkit

pip install "layoutparser[ocr]"
Install OCR toolkit

사용자를 위해 API도 제공한다.

import layoutparser as lp
model = lp.AutoLayoutModel('lp://EfficientDete/PubLayNet')
image = Image.open("path/to/image")
layout = model.detect(image)

문서 내에서 탐색 section을 지정해 줄 수 있다

image_width = image.size[0]
left_column = lp.Interval(0, image_width/2, axis = 'x')
layout.filter_by(left_column, center=True)

탐지된 Layout 구역에서 OCR을 수행할 수 있다.

ocr_agent = lp.TesseractAgent()
for layout_region in layout:
	image_segment = layout_region.crop(image)
    text = ocr_agent.detect(image_segment)

탐지된 Layout을 시각화 하기위한 API도 제공한다.

lp.draw_box(image,layout,box_width=1, show_element_id = True, box_alpha=0.25)

json, csv, pdf로 저장된 layout 데이터를 불러온다

layout = lp.load_json("path/to/json")
layout = lp.load_csv("path/to/csv")
pdf_layout = lp.load_pdf("path/to/pdf")

Example

코더