[2025/W43] ๐Ÿค— Weekly AI Research

Skyยท2025๋…„ 10์›” 24์ผ

Weekly AI Research Digest

๋ชฉ๋ก ๋ณด๊ธฐ
70/89

๋กฑ์ปจํ…์ŠคํŠธ ์–ดํ…์…˜ ๋ถ„๋ฆฌ, ๊ฒฝ๋Ÿ‰ ๋ฉ”๋ชจ๋ฆฌ, RL ์•ˆ์ •ํ™”๋กœ LLM ํ›ˆ๋ จยท์ถ”๋ก  ์ตœ์ ํ™”
์ž์œจ ๋ฐ์ดํ„ฐ ๊ณผํ•™, ์˜ด๋‹ˆ๋ชจ๋‹ฌ, ํ›ˆ๋ จ ์—†๋Š” 3D ํŽธ์ง‘ ๋ฐ ์ƒˆ๋กœ์šด ํ‰๊ฐ€ ๋ฒค์น˜๋งˆํฌ

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper, Project
์ด ๋…ผ๋ฌธ์€ LLM ์ถ”๋ก  ์„ฑ๋Šฅ์„ ๋†’์ด๋Š” ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฐ˜ ํ…Œ์ŠคํŠธ ์‹œ๊ฐ„ ์Šค์ผ€์ผ๋ง ๋ฐฉ๋ฒ•์˜ ์ด๋ก ์  ํ† ๋Œ€๋ฅผ ์ตœ์ดˆ๋กœ ์ œ์‹œํ•œ๋‹ค. ๊ธฐ์กด์˜ Self-Consistency๋Š” ์ถ”์ • ์˜ค๋ฅ˜๊ฐ€ ๋†’๊ณ  Perplexity๋Š” ๋ชจ๋ธ๋ง ์˜ค๋ฅ˜๊ฐ€ ํฌ๋‹ค๋Š” ํ•œ๊ณ„๋ฅผ ์ด๋ก ์ ์œผ๋กœ ๋ถ„์„ํ•˜๋ฉฐ, ์ด ๋‘ ๋ฐฉ๋ฒ•์˜ ์žฅ์ ์„ ๊ฒฐํ•ฉํ•œ RPC(Reasoning Pruning and Perplexity Consistency)๋ผ๋Š” ์ƒˆ๋กœ์šด ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค. RPC๋Š” ๋‚ฎ์€ ํ™•๋ฅ ์˜ ์ถ”๋ก  ๊ฒฝ๋กœ๋ฅผ ์ œ๊ฑฐํ•˜๊ณ  ์ˆ˜๋ ด ์†๋„๋ฅผ ๋†’์—ฌ, ๊ธฐ์กด Self-Consistency์™€ ๋น„์Šทํ•œ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•˜๋ฉด์„œ๋„ ์ƒ˜ํ”Œ๋ง ๋น„์šฉ์„ 50% ์ ˆ๊ฐํ•˜๊ณ  ์‹ ๋ขฐ๋„๋ฅผ ํ–ฅ์ƒ์‹œํ‚จ๋‹ค.

LightMem: Lightweight and Efficient Memory-Augmented Generation

Paper, Project
์ด ๋…ผ๋ฌธ์€ LLM์ด ๊ณผ๊ฑฐ ์ƒํ˜ธ์ž‘์šฉ ์ •๋ณด๋ฅผ ํšจ์œจ์ ์œผ๋กœ ํ™œ์šฉํ•˜๋„๋ก ๋•๋Š” ๊ฒฝ๋Ÿ‰ ๋ฉ”๋ชจ๋ฆฌ ์‹œ์Šคํ…œ LightMem์„ ์ œ์•ˆํ•œ๋‹ค. ์ธ๊ฐ„์˜ ๊ธฐ์–ต ๋ชจ๋ธ(๊ฐ๊ฐ-๋‹จ๊ธฐ-์žฅ๊ธฐ)์— ์ฐฉ์•ˆํ•œ 3๋‹จ๊ณ„ ๋ฉ”๋ชจ๋ฆฌ ๊ตฌ์กฐ๋ฅผ ํŠน์ง•์œผ๋กœ ํ•˜๋ฉฐ, ํŠนํžˆ '์ˆ˜๋ฉด ์‹œ๊ฐ„ ์—…๋ฐ์ดํŠธ' ๋ฐฉ์‹์„ ๋„์ž…ํ•ด ์˜จ๋ผ์ธ ์ถ”๋ก ๊ณผ ๋ฉ”๋ชจ๋ฆฌ ํ†ตํ•ฉ ๊ณผ์ •์„ ๋ถ„๋ฆฌํ•จ์œผ๋กœ์จ ๊ธฐ์กด ๋ฉ”๋ชจ๋ฆฌ ์‹œ์Šคํ…œ์˜ ๋†’์€ ์‹œ๊ฐ„ ๋ฐ ๊ณ„์‚ฐ ์˜ค๋ฒ„ํ—ค๋“œ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•œ๋‹ค. LightMem์€ ์‹คํ—˜์—์„œ ๋†’์€ ์ •ํ™•๋„ ํ–ฅ์ƒ๊ณผ ๋”๋ถˆ์–ด ํ† ํฐ ์‚ฌ์šฉ๋Ÿ‰, API ํ˜ธ์ถœ, ๋Ÿฐํƒ€์ž„์„ ์ตœ๋Œ€ 100๋ฐฐ ์ด์ƒ ํš๊ธฐ์ ์œผ๋กœ ์ ˆ๊ฐ์‹œํ‚จ๋‹ค.

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper
์ด ๋…ผ๋ฌธ์€ ๊ธด ์ปจํ…์ŠคํŠธ LLM ํ›ˆ๋ จ ์‹œ, ๊ณ„์‚ฐ๋Ÿ‰์ด 2์ฐจ(quadratic)๋กœ ์ฆ๊ฐ€ํ•˜๋Š” 'ํ•ต์‹ฌ ์–ดํ…์…˜'(softmax(QKT)Vsoftmax(QK^T)V) ์—ฐ์‚ฐ์ด ๋กœ๋“œ ๋ถˆ๊ท ํ˜•๊ณผ ์ง€์—ฐ(straggler)์„ ์œ ๋ฐœํ•˜๋Š” ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” CAD(Core Attention Disaggregation) ๊ธฐ์ˆ ์„ ์ œ์‹œํ•œ๋‹ค. CAD๋Š” ํ›ˆ๋ จ ๊ฐ€๋Šฅํ•œ ํŒŒ๋ผ๋ฏธํ„ฐ๊ฐ€ ์—†๋Š” ํ•ต์‹ฌ ์–ดํ…์…˜ ๊ณ„์‚ฐ์„ ๋‚˜๋จธ์ง€ ๋ชจ๋ธ ๊ณ„์ธต๊ณผ ๋ถ„๋ฆฌํ•˜์—ฌ ๋ณ„๋„์˜ '์ „์šฉ ์–ดํ…์…˜ ์„œ๋ฒ„' ํ’€์—์„œ ์‹คํ–‰ํ•˜๋Š” ๋ฐฉ์‹์ด๋‹ค. ์ด๋ฅผ ๊ตฌํ˜„ํ•œ DistCA ์‹œ์Šคํ…œ์€ ์–ดํ…์…˜ ์ž‘์—…์„ ํ† ํฐ ๋ ˆ๋ฒจ๋กœ ๋ถ„ํ• ํ•˜๊ณ  ๋™์ ์œผ๋กœ ์žฌ๋ฐฐ์น˜ํ•˜์—ฌ ์™„๋ฒฝํ•œ ๋กœ๋“œ ๋ฐธ๋Ÿฐ์‹ฑ์„ ๋‹ฌ์„ฑํ•˜๋ฉฐ, ์ตœ๋Œ€ 512k ํ† ํฐ ๊ธธ์ด์—์„œ ํ›ˆ๋ จ ์ฒ˜๋ฆฌ๋Ÿ‰์„ 1.35๋ฐฐ๊นŒ์ง€ ํ–ฅ์ƒ์‹œํ‚จ๋‹ค.

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper, Project
์ด ๋…ผ๋ฌธ์€ ๊ธด ์ปจํ…์ŠคํŠธ ์ถ”๋ก  ์‹œ ๋ฐœ์ƒํ•˜๋Š” I/O ๋ฐ ๊ณ„์‚ฐ ์˜ค๋ฒ„ํ—ค๋“œ๋ฅผ ํš๊ธฐ์ ์œผ๋กœ ์ค„์ด๋Š” Ring-linear ๋ชจ๋ธ ์‹œ๋ฆฌ์ฆˆ๋ฅผ ์†Œ๊ฐœํ•œ๋‹ค. ์ด ๋ชจ๋ธ๋“ค์€ ์„ ํ˜• ์–ดํ…์…˜(Linear Attention)๊ณผ ์†Œํ”„ํŠธ๋งฅ์Šค ์–ดํ…์…˜(Softmax Attention)์„ ํšจ๊ณผ์ ์œผ๋กœ ํ†ตํ•ฉํ•œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์•„ํ‚คํ…์ฒ˜๋ฅผ ์ฑ„ํƒํ•˜์—ฌ, ๋ฐ€์ง‘(dense) ๋ชจ๋ธ ๋Œ€๋น„ ์ถ”๋ก  ๋น„์šฉ์„ 1/10 ์ˆ˜์ค€์œผ๋กœ ์ ˆ๊ฐํ•œ๋‹ค. ๋˜ํ•œ ์ž์ฒด ๊ฐœ๋ฐœํ•œ ๊ณ ์„ฑ๋Šฅ FP8 ์—ฐ์‚ฐ์ž ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ 'linghe'๋ฅผ ํ™œ์šฉํ•ด ํ›ˆ๋ จ ํšจ์œจ์„ 50% ํ–ฅ์ƒ์‹œ์ผฐ์œผ๋ฉฐ, ์ด ์•„ํ‚คํ…์ฒ˜๋ฅผ ํ†ตํ•ด ๋ณต์žกํ•œ ์ถ”๋ก  ๋ฒค์น˜๋งˆํฌ์—์„œ SOTA ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•œ๋‹ค.

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper, Project
์ด ๋…ผ๋ฌธ์€ LLM์„ ์˜คํ”„-ํด๋ฆฌ์‹œ(off-policy) ๊ฐ•ํ™”ํ•™์Šต์œผ๋กœ ํ›ˆ๋ จํ•  ๋•Œ ๋ฐœ์ƒํ•˜๋Š” ๋ถˆ์•ˆ์ •์„ฑ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” BAPO ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ œ์•ˆํ•œ๋‹ค. ๊ธฐ์กด ๋ฐฉ์‹์€ '์Œ์˜ ์ด์ (negative-advantage)' ์ƒ˜ํ”Œ์ด ์ตœ์ ํ™”๋ฅผ ์ง€๋ฐฐํ•˜๋Š” ๋ถˆ๊ท ํ˜•๊ณผ ๊ณ ์ •๋œ ํด๋ฆฌํ•‘ ๋ฉ”์ปค๋‹ˆ์ฆ˜์ด ์—”ํŠธ๋กœํ”ผ ์ฆ๊ฐ€๋ฅผ ๋ง‰๋Š” ๋ฌธ์ œ๋กœ ์ธํ•ด ํ›ˆ๋ จ์ด ๋ถ•๊ดด๋  ์ˆ˜ ์žˆ์Œ์„ ๋ฐํ˜€๋‚ธ๋‹ค. BAPO๋Š” ํด๋ฆฌํ•‘ ๊ฒฝ๊ณ„๋ฅผ ๋™์ ์œผ๋กœ ์กฐ์ •ํ•˜๋Š” '์ ์‘ํ˜• ํด๋ฆฌํ•‘'์„ ๋„์ž…ํ•˜์—ฌ ๊ธ์ •์  ๊ธฐ์—ฌ์™€ ๋ถ€์ •์  ๊ธฐ์—ฌ์˜ ๊ท ํ˜•์„ ๋งž์ถ”๊ณ  ์—”ํŠธ๋กœํ”ผ๋ฅผ ๋ณด์กดํ•จ์œผ๋กœ์จ, ๋น ๋ฅด๊ณ  ์•ˆ์ •์ ์ธ ๋ฐ์ดํ„ฐ ํšจ์œจ์  ํ›ˆ๋ จ์„ ๋‹ฌ์„ฑํ•œ๋‹ค.

World-in-World: World Models in a Closed-Loop World

Paper, Project
์ด ๋…ผ๋ฌธ์€ ์ƒ์„ฑํ˜• ์›”๋“œ ๋ชจ๋ธ(WM)์ด ์‹ค์ œ '์ฒดํ™”๋œ ์—์ด์ „ํŠธ'์˜ ์ž„๋ฌด ์„ฑ๊ณต์— ๊ธฐ์—ฌํ•˜๋Š”์ง€(์ฒดํ™”๋œ ์œ ์šฉ์„ฑ) ํ‰๊ฐ€ํ•˜๋Š” ์ƒˆ๋กœ์šด ๋ฒค์น˜๋งˆํฌ ํ”Œ๋žซํผ 'World-in-World'๋ฅผ ์ œ์•ˆํ•œ๋‹ค. ๊ธฐ์กด ๋ฒค์น˜๋งˆํฌ๊ฐ€ ์‹œ๊ฐ์  ํ’ˆ์งˆ์—๋งŒ ์ดˆ์ ์„ ๋งž์ถ˜ '์˜คํ”ˆ ๋ฃจํ”„' ๋ฐฉ์‹์ด์—ˆ๋˜ ๊ฒƒ๊ณผ ๋‹ฌ๋ฆฌ, ์ด ํ”Œ๋žซํผ์€ ์‹ค์ œ ์—์ด์ „ํŠธ-ํ™˜๊ฒฝ ์ƒํ˜ธ์ž‘์šฉ์„ ๋ฐ˜์˜ํ•˜๋Š” 'ํ์‡„ ๋ฃจํ”„' ํ™˜๊ฒฝ์—์„œ ์ž„๋ฌด ์„ฑ๊ณต์„ ํ•ต์‹ฌ ์ง€ํ‘œ๋กœ ์‚ผ๋Š”๋‹ค. ์—ฐ๊ตฌ ๊ฒฐ๊ณผ, ์‹œ๊ฐ์  ํ’ˆ์งˆ๋ณด๋‹ค ์ œ์–ด ๊ฐ€๋Šฅ์„ฑ(controllability)์ด ์ž„๋ฌด ์„ฑ๊ณต์— ๋” ์ค‘์š”ํ•˜๋ฉฐ, ์ถ”๋ก  ์‹œ ๋” ๋งŽ์€ ๊ณ„์‚ฐ ์ž์›์„ ํ• ๋‹นํ•˜๋Š” ๊ฒƒ์ด ์„ฑ๋Šฅ ํ–ฅ์ƒ์— ํฐ ๋„์›€์ด ๋จ์„ ๋ฐํ˜€๋‚ธ๋‹ค.

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper, Project
์ด ๋…ผ๋ฌธ์€ ์›์‹œ ๋ฐ์ดํ„ฐ๋ถ€ํ„ฐ ๋ถ„์„๊ฐ€ ์ˆ˜์ค€์˜ ์‹ฌ์ธต ์—ฐ๊ตฌ ๋ณด๊ณ ์„œ ์ž‘์„ฑ๊นŒ์ง€ ์ „ ๊ณผ์ •์„ ์ž์œจ์ ์œผ๋กœ ์ˆ˜ํ–‰ํ•˜๋Š” ์ตœ์ดˆ์˜ ์—์ด์ „ํŠธํ˜• LLM 'DeepAnalyze-8B'๋ฅผ ์ œ์•ˆํ•œ๋‹ค. ๊ธฐ์กด ์›Œํฌํ”Œ๋กœ์šฐ ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด, ์ธ๊ฐ„ ๋ฐ์ดํ„ฐ ๊ณผํ•™์ž์˜ ํ•™์Šต ๊ถค์ ์„ ๋ชจ๋ฐฉํ•˜์—ฌ ์ ์ง„์ ์œผ๋กœ ์—ญ๋Ÿ‰์„ ์Šต๋“ํ•˜๋Š” '์ปค๋ฆฌํ˜๋Ÿผ ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ ํ›ˆ๋ จ' ํŒจ๋Ÿฌ๋‹ค์ž„์„ ๋„์ž…ํ•œ๋‹ค. 8B ํŒŒ๋ผ๋ฏธํ„ฐ์˜ ์ด ๋ชจ๋ธ์€ ๊ณ ํ’ˆ์งˆ์˜ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•˜๋Š” '๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ๊ถค์  ํ•ฉ์„ฑ' ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํ†ตํ•ด ํ•™์Šตํ–ˆ์œผ๋ฉฐ, ๊ฐ€์žฅ ์ง„๋ณด๋œ ์ƒ์šฉ LLM ๊ธฐ๋ฐ˜์˜ ๊ธฐ์กด ์›Œํฌํ”Œ๋กœ์šฐ ์—์ด์ „ํŠธ๋“ค์„ ๋Šฅ๊ฐ€ํ•˜๋Š” ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•œ๋‹ค.

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper, Project
์ด ๋…ผ๋ฌธ์€ ์—ฌ๋Ÿฌ ์–‘์‹์„ ๋™์‹œ์— ์ธ์‹ํ•˜๊ณ  ์ดํ•ดํ•˜๋Š” ๊ฐ•๋ ฅํ•œ ์˜คํ”ˆ์†Œ์Šค ์˜ด๋‹ˆ๋ชจ๋‹ฌ LLM์ธ OmniVinci๋ฅผ ์†Œ๊ฐœํ•œ๋‹ค. ์ด ๋ชจ๋ธ์€ ๋น„์ „๊ณผ ์˜ค๋””์˜ค ์ž„๋ฒ ๋”ฉ ์ •๋ ฌ์„ ๊ฐ•ํ™”ํ•˜๋Š” 'OmniAlignNet', ์‹ ํ˜ธ ๊ฐ„์˜ ์ƒ๋Œ€์  ์‹œ๊ฐ„ ์ˆœ์„œ๋ฅผ ํฌ์ฐฉํ•˜๋Š” 'Temporal Embedding Grouping', ์ ˆ๋Œ€์  ์‹œ๊ฐ„ ์ •๋ณด๋ฅผ ์ธ์ฝ”๋”ฉํ•˜๋Š” 'Constrained Rotary Time Embedding'์ด๋ผ๋Š” ์„ธ ๊ฐ€์ง€ ์ƒˆ๋กœ์šด ์•„ํ‚คํ…์ฒ˜๋ฅผ ํŠน์ง•์œผ๋กœ ํ•œ๋‹ค. 2,400๋งŒ ๊ฐœ์˜ ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ๋ฅผ ํฌํ•จํ•œ ํ๋ ˆ์ด์…˜๋œ ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•˜์—ฌ, ๊ฒฝ์Ÿ ๋ชจ๋ธ(Qwen2.5-Omni)๋ณด๋‹ค 6๋ฐฐ ์ ์€ ํ›ˆ๋ จ ํ† ํฐ์œผ๋กœ๋„ ๊ต์ฐจ ๋ชจ๋‹ฌ ์ดํ•ด, ์˜ค๋””์˜ค, ๋น„์ „ ๋ฒค์น˜๋งˆํฌ์—์„œ ๋” ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•œ๋‹ค.

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper, Project
์ด ๋…ผ๋ฌธ์€ ํ…์ŠคํŠธ-์ด๋ฏธ์ง€(T2I) ์ƒ์„ฑ ๋ชจ๋ธ์ด ํ”„๋กฌํ”„ํŠธ์˜ ์˜๋ฏธ๋ฅผ ์–ผ๋งˆ๋‚˜ ์ •ํ™•ํ•˜๊ฒŒ ๋ฐ˜์˜ํ•˜๋Š”์ง€ ํ‰๊ฐ€ํ•˜๋Š” ํ†ตํ•ฉ ๋ฒค์น˜๋งˆํฌ UniGenBench++๋ฅผ ์ œ์•ˆํ•œ๋‹ค. ๊ธฐ์กด ๋ฒค์น˜๋งˆํฌ๊ฐ€ ํ”„๋กฌํ”„ํŠธ ๋‹ค์–‘์„ฑ, ๋‹ค๊ตญ์–ด ์ง€์›, ์„ธ๋ถ„ํ™”๋œ ํ‰๊ฐ€ ๊ธฐ์ค€์ด ๋ถ€์กฑํ–ˆ๋˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด, 5๊ฐœ ๋ฉ”์ธ ํ…Œ๋งˆ์™€ 20๊ฐœ ํ•˜์œ„ ํ…Œ๋งˆ๋กœ ๊ตฌ์„ฑ๋œ 600๊ฐœ์˜ ๊ณ„์ธต์  ํ”„๋กฌํ”„ํŠธ๋ฅผ ์ œ๊ณตํ•œ๋‹ค. ๋˜ํ•œ, 10๊ฐœ์˜ ๊ธฐ๋ณธ ๊ธฐ์ค€๊ณผ 27๊ฐœ์˜ ํ•˜์œ„ ๊ธฐ์ค€์œผ๋กœ ์˜๋ฏธ ์ผ๊ด€์„ฑ์„ ์„ธ๋ฐ€ํ•˜๊ฒŒ ํ‰๊ฐ€ํ•˜๋ฉฐ, ์˜์–ด/์ค‘๊ตญ์–ด ๋ฐ ๋‹จ๋ฌธ/์žฅ๋ฌธ ๋ฒ„์ „์„ ๋ชจ๋‘ ํฌํ•จํ•˜์—ฌ ๋ชจ๋ธ์˜ ๊ฒฌ๊ณ ์„ฑ์„ ํ…Œ์ŠคํŠธํ•œ๋‹ค.

NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks

Paper, Project
์ด ๋…ผ๋ฌธ์€ ๋น„ํšจ์œจ์ ์ด๊ณ  ๋ถˆ์ผ์น˜ํ•˜๋ฉฐ ์›๋ณธ ํ›ผ์† ๋ฌธ์ œ๊ฐ€ ์žˆ๋˜ ๊ธฐ์กด 3D ๊ฐ์ฒด ํŽธ์ง‘ ๋ฐฉ์‹์„ ๊ฐœ์„ ํ•˜๊ธฐ ์œ„ํ•ด, ๋ณ„๋„์˜ ํ›ˆ๋ จ์ด๋‚˜ ๋งˆ์Šคํฌ๊ฐ€ ํ•„์š” ์—†๋Š”(training-free, mask-free) Nano3D ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•œ๋‹ค. Nano3D๋Š” FlowEdit ๊ธฐ์ˆ ์„ TRELLIS์— ํ†ตํ•ฉํ•˜์—ฌ ์ „๋ฉด ๋ทฐ ๋ Œ๋”๋ง์„ ๊ธฐ๋ฐ˜์œผ๋กœ ๊ตญ์†Œ์  ํŽธ์ง‘์„ ์ˆ˜ํ–‰ํ•˜๋ฉฐ, '์˜์—ญ ์ธ์‹ ๋ณ‘ํ•ฉ(Voxel/Slat-Merge)' ์ „๋žต์„ ํ†ตํ•ด ํŽธ์ง‘๋œ ์˜์—ญ๊ณผ ํŽธ์ง‘๋˜์ง€ ์•Š์€ ์˜์—ญ ๊ฐ„์˜ ๊ตฌ์กฐ์  ์ผ๊ด€์„ฑ์„ ๊ฐ•๋ ฅํ•˜๊ฒŒ ๋ณด์กดํ•œ๋‹ค. ๋˜ํ•œ, ์ด ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํ™œ์šฉํ•˜์—ฌ 10๋งŒ ๊ฐœ ์ด์ƒ์˜ ๊ณ ํ’ˆ์งˆ 3D ํŽธ์ง‘ ์Œ์œผ๋กœ ๊ตฌ์„ฑ๋œ ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ์…‹ Nano3D-Edit-100k๋ฅผ ๊ตฌ์ถ•ํ•œ๋‹ค.

profile
XR๊ณผ AI์— ๊ด€์‹ฌ์ด ๋งŽ์€ Sky ์ž…๋‹ˆ๋‹ค.

0๊ฐœ์˜ ๋Œ“๊ธ€