[2025/W45] ๐Ÿค— Weekly AI Research

Skyยท2025๋…„ 11์›” 8์ผ

Weekly AI Research Digest

๋ชฉ๋ก ๋ณด๊ธฐ
74/89

ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์˜ ์ง„ํ™”: ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ถ”๋ก ๊ณผ ์ƒํ˜ธ์ž‘์šฉํ˜• Physical AI๋กœ์˜ ํ™•์žฅ ํ™•์‚ฐ ๋ชจ๋ธ
๊ณ ํฌ์†Œ์„ฑ MoE, ํ•˜๋“œ์›จ์–ด ์–‘์žํ™” ํ˜์‹ ์„ ํ†ตํ•œ AI ํšจ์œจ์„ฑ ๋ฐ ์•ˆ์ „์„ฑ ํ™•๋ณด

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper, Project
'Thinking with Video'๋Š” ํ…์ŠคํŠธ์™€ ์ด๋ฏธ์ง€ ๊ธฐ๋ฐ˜ ์ถ”๋ก ์˜ ์ •์ ์ธ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด Sora-2์™€ ๊ฐ™์€ ๋น„๋””์˜ค ์ƒ์„ฑ ๋ชจ๋ธ์„ ํ™œ์šฉํ•˜๋Š” ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„์„ ์ œ์•ˆํ•œ๋‹ค. ์ด ์ ‘๊ทผ๋ฒ•์€ ๋™์ ์ธ ํ”„๋กœ์„ธ์Šค์™€ ์—ฐ์†์ ์ธ ๋ณ€ํ™”๋ฅผ ํ†ต์ผ๋œ ์‹œ๊ฐ„์  ํ”„๋ ˆ์ž„์›Œํฌ ์•ˆ์—์„œ ํ†ตํ•ฉ์ ์œผ๋กœ ๋‹ค๋ฃจ๋ฉฐ, ์ด๋ฅผ ๊ฒ€์ฆํ•˜๊ธฐ ์œ„ํ•ด VideoThinkBench๋ผ๋Š” ๋ฒค์น˜๋งˆํฌ๋ฅผ ๊ฐœ๋ฐœํ–ˆ๋‹ค. ์ด ๋ฒค์น˜๋งˆํฌ์—์„œ Sora-2๋Š” ๋น„์ „ ๋ฐ ํ…์ŠคํŠธ ์ค‘์‹ฌ ์ž‘์—… ๋ชจ๋‘์—์„œ ๊ฐ•๋ ฅํ•œ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ์ž…์ฆํ•˜๋ฉฐ, ๋น„๋””์˜ค ์ƒ์„ฑ ๋ชจ๋ธ์ด ํ…์ŠคํŠธ์™€ ๋น„์ „์„ ์•„์šฐ๋ฅด๋Š” ํ†ตํ•ฉ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ถ”๋ก ๊ธฐ๋กœ์„œ์˜ ์ž ์žฌ๋ ฅ์„ ์ง€๋‹ˆ๊ณ  ์žˆ์Œ์„ ๋ณด์—ฌ์ค€๋‹ค.

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper, Project
VCode๋Š” ๊ธฐ์กด AI ์—ฐ๊ตฌ๊ฐ€ ์†Œํ™€ํžˆ ๋‹ค๋ฃฌ '์‹œ๊ฐ ์ค‘์‹ฌ ์ฝ”๋”ฉ' ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ์ด๋ฏธ์ง€๋ฅผ ํ•ด์„ ๊ฐ€๋Šฅํ•˜๊ณ  ์‹คํ–‰ ๊ฐ€๋Šฅํ•œ SVG ์ฝ”๋“œ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ์ƒˆ๋กœ์šด ๋ฒค์น˜๋งˆํฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค. ์ด ๋…ผ๋ฌธ์€ ์ƒ์„ฑ๋œ SVG๊ฐ€ ์›๋ณธ์˜ ์ƒ์ง•์  ์˜๋ฏธ๋ฅผ ์–ผ๋งˆ๋‚˜ ์ž˜ ๋ณด์กดํ•˜๋Š”์ง€ ํ‰๊ฐ€ํ•˜๋Š” CodeVQA ํ”„๋กœํ† ์ฝœ์„ ํ•จ๊ป˜ ์ œ์•ˆํ•˜๋ฉฐ, ๊ธฐ์กด VLM์˜ SVG ์ƒ์„ฑ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด ๋ฐ˜๋ณต์ ์ธ '์ˆ˜์ •(Revision)'๊ณผ '์‹œ๊ฐ์  ๋„๊ตฌ(Visual Tools)'๋ฅผ ํ™œ์šฉํ•˜๋Š” ์—์ด์ „ํŠธ ํ”„๋ ˆ์ž„์›Œํฌ VCoder๋ฅผ ๊ฐœ๋ฐœํ•ด ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•œ๋‹ค.

Don't Blind Your VLA: Aligning Visual Representations for OOD Generalization

Paper, Project
'Don't Blind Your VLA'๋Š” VLM(์‹œ๊ฐ-์–ธ์–ด ๋ชจ๋ธ)์„ ํ–‰๋™(Action) ๋ฐ์ดํ„ฐ๋กœ ๋ฏธ์„ธ ์กฐ์ •ํ•ด VLA(์‹œ๊ฐ-์–ธ์–ด-ํ–‰๋™) ๋ชจ๋ธ์„ ๋งŒ๋“ค ๋•Œ, VLM์ด ์›๋ž˜ ๊ฐ€์ง€๊ณ  ์žˆ๋˜ ๊ฐ•๋ ฅํ•œ ์‹œ๊ฐ-์–ธ์–ด ํ‘œํ˜„๋ ฅ์ด ์ €ํ•˜๋˜๋Š” ๋ฌธ์ œ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ๊ทœ๋ช…ํ•œ๋‹ค. ์ด๋Ÿฌํ•œ ํ‘œํ˜„๋ ฅ ์ €ํ•˜๋Š” ํŠนํžˆ ํ•™์Šต๋˜์ง€ ์•Š์€ ์ƒˆ๋กœ์šด ํ™˜๊ฒฝ(OOD)์—์„œ์˜ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ์„ ์‹ฌ๊ฐํ•˜๊ฒŒ ๋–จ์–ด๋œจ๋ฆฌ๋ฉฐ, ์ด ์—ฐ๊ตฌ๋Š” ๋‚ด๋ถ€ ํ‘œํ˜„ ๋ถ„์„์„ ํ†ตํ•ด ์ด๋ฅผ ์ž…์ฆํ•˜๊ณ  ํ–‰๋™ ํ•™์Šต ์ค‘์—๋„ ๊ธฐ์กด VLM์˜ ์ง€์‹์„ ๋ณด์กดํ•˜๋„๋ก ์‹œ๊ฐ์  ํ‘œํ˜„์„ ์ •๋ ฌํ•˜๋Š” ํšจ๊ณผ์ ์ธ ์™„ํ™” ์ „๋žต์„ ์ œ์•ˆํ•œ๋‹ค.

Diffusion Language Models are Super Data Learners

Paper, Project
'Diffusion Language Models are Super Data Learners'๋Š” ํ™•์‚ฐ(Diffusion) ์–ธ์–ด ๋ชจ๋ธ(DLM)์ด ์ž๊ธฐํšŒ๊ท€(AR) ๋ชจ๋ธ๋ณด๋‹ค ๋ฐ์ดํ„ฐ ํ•™์Šต ํšจ์œจ์ด ๋›ฐ์–ด๋‚จ์„ ์ž…์ฆํ•œ๋‹ค. ์ด ์—ฐ๊ตฌ๋Š” ๊ณ ์œ ํ•œ ๋ฐ์ดํ„ฐ๊ฐ€ ์ œํ•œ๋œ ํ™˜๊ฒฝ์—์„œ ๋ฐ์ดํ„ฐ๋ฅผ ๋ฐ˜๋ณต ํ•™์Šต์‹œํ‚ฌ ๊ฒฝ์šฐ, DLM์ด AR ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ์ผ๊ด€๋˜๊ฒŒ ๋Šฅ๊ฐ€ํ•˜๋Š” '๊ต์ฐจ(Crossover)' ํ˜„์ƒ์„ ๋ฐœ๊ฒฌํ–ˆ์œผ๋ฉฐ, ์ด๋Š” DLM์˜ ์ˆœ์„œ ๋ฌด๊ด€(any-order) ๋ชจ๋ธ๋ง๊ณผ ๊ณ ๋ฐ€๋„ ์—ฐ์‚ฐ ํŠน์„ฑ ๋•๋ถ„์ด๋ผ๊ณ  ๋ถ„์„ํ•œ๋‹ค. ๊ฒฐ๋ก ์ ์œผ๋กœ DLM์€ ์ œํ•œ๋œ ๋ฐ์ดํ„ฐ๋ฅผ '์งœ๋‚ด์–ด' ํ•™์Šตํ•˜๋Š” ๋ฐ ๋งค์šฐ ๊ฐ•๋ ฅํ•œ '์Šˆํผ ๋ฐ์ดํ„ฐ ํ•™์Šต์ž'์ž„์„ ์‹œ์‚ฌํ•œ๋‹ค.

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper, Project
'Every Activation Boosted'๋Š” '๋ชจ๋“  ํ™œ์„ฑํ™”๊ฐ€ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ์ฆํญ์‹œํ‚จ๋‹ค'๋Š” ์›์น™์— ๊ธฐ๋ฐ˜ํ•˜์—ฌ 160์–ต ๊ฐœ๋ถ€ํ„ฐ 1์กฐ ๊ฐœ ํŒŒ๋ผ๋ฏธํ„ฐ๊นŒ์ง€ ํ™•์žฅ๋˜๋Š” ์ถ”๋ก  ์ง€ํ–ฅ ์–ธ์–ด ๋ชจ๋ธ ์‹œ๋ฆฌ์ฆˆ Ling 2.0์„ ์ œ์•ˆํ•œ๋‹ค. ์ด ๋ชจ๋ธ์€ ๊ณ ํฌ์†Œ์„ฑ MoE ์•„ํ‚คํ…์ฒ˜๋ฅผ ์ฑ„ํƒํ•˜์—ฌ ๊ธฐ์กด ๋ด์Šค ๋ชจ๋ธ ๋Œ€๋น„ ์••๋„์ ์ธ ์—ฐ์‚ฐ ํšจ์œจ์„ ๋‹ฌ์„ฑํ–ˆ์œผ๋ฉฐ, ์ถ”๋ก  ์ง€ํ–ฅ ๋ฐ์ดํ„ฐ(CoT) ํ•™์Šต๊ณผ ๊ฐ•ํ™” ํ•™์Šต ๊ธฐ๋ฐ˜ ๋ฏธ์„ธ ์กฐ์ •, FP8 ์ธํ”„๋ผ ๋“ฑ ํ˜์‹ ์„ ํ†ตํ•ฉํ–ˆ๋‹ค. ๊ทธ ๊ฒฐ๊ณผ, 1์กฐ ํŒŒ๋ผ๋ฏธํ„ฐ ๋ชจ๋ธ์ธ Ling-1T๋Š” ์ถ”๋ก  ์ •ํ™•๋„์™€ ๊ณ„์‚ฐ ํšจ์œจ์„ฑ ์ธก๋ฉด์—์„œ ์ƒˆ๋กœ์šด ํŒŒ๋ ˆํ†  ํ”„๋ก ํ‹ฐ์–ด๋ฅผ ํ™•๋ฆฝํ•œ๋‹ค.

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper, Project
'ThinkMorph'๋Š” ์ง„์ •ํ•œ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ถ”๋ก ์ด๋ž€ ํ…์ŠคํŠธ์™€ ์ด๋ฏธ์ง€๊ฐ€ ์ค‘๋ณต๋˜๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ, ์ถ”๋ก ์„ ํ•จ๊ป˜ ๋ฐœ์ „์‹œํ‚ค๋Š” '์ƒํ˜ธ ๋ณด์™„์ ' ๊ด€๊ณ„์—ฌ์•ผ ํ•œ๋‹ค๊ณ  ์ฃผ์žฅํ•œ๋‹ค. ์ด ์›์น™์— ๋”ฐ๋ผ ๊ฐœ๋ฐœ๋œ ThinkMorph ๋ชจ๋ธ์€ 24,000๊ฐœ์˜ ๊ณ ํ’ˆ์งˆ ์ธํ„ฐ๋ฆฌ๋ธŒ๋“œ(interleaved) ์ถ”๋ก  ๋ฐ์ดํ„ฐ๋กœ ํ•™์Šตํ–ˆ์œผ๋ฉฐ, ์–ธ์–ด์  ๋…ผ๋ฆฌ๋ฅผ ์œ ์ง€ํ•˜๋ฉด์„œ ์‹œ๊ฐ์  ์ฝ˜ํ…์ธ ๋ฅผ ๊ตฌ์ฒด์ ์œผ๋กœ '์กฐ์ž‘'ํ•˜๋Š” ํ…์ŠคํŠธ-์ด๋ฏธ์ง€ ์‚ฌ๊ณ  ๋‹จ๊ณ„๋ฅผ ์ƒ์„ฑํ•œ๋‹ค. ๊ทธ ๊ฒฐ๊ณผ, ๋น„์ „ ์ค‘์‹ฌ ๋ฒค์น˜๋งˆํฌ์—์„œ ๋ฒ ์ด์Šค ๋ชจ๋ธ ๋Œ€๋น„ ํ‰๊ท  34.7%์˜ ํฐ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋ณด์˜€์„ ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ, ํ•™์Šตํ•˜์ง€ ์•Š์€ ์‹œ๊ฐ ์กฐ์ž‘ ๊ธฐ์ˆ ์„ ์„ ๋ณด์ด๋Š” '์ฐฝ๋ฐœ์ (emergent)' ํŠน์„ฑ์„ ๋“œ๋Ÿฌ๋‚ธ๋‹ค.

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper
'OS-Sentinel'์€ VLM ๊ธฐ๋ฐ˜ ๋ชจ๋ฐ”์ผ GUI ์—์ด์ „ํŠธ๊ฐ€ ์‹œ์Šคํ…œ์„ ์†์ƒ์‹œํ‚ค๊ฑฐ๋‚˜ ๊ฐœ์ธ ์ •๋ณด๋ฅผ ์œ ์ถœํ•˜๋Š” ๋“ฑ์˜ '์•ˆ์ „ํ•˜์ง€ ์•Š์€ ์ž‘๋™'์„ ํƒ์ง€ํ•˜๊ธฐ ์œ„ํ•œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•œ๋‹ค. ์ด ์—ฐ๊ตฌ๋Š” ์‹ค์ œ์ ์ธ ์œ„ํ˜‘ ์‹œ๋‚˜๋ฆฌ์˜ค๋ฅผ ํฌํ•จํ•˜๋Š” ์ƒŒ๋“œ๋ฐ•์Šค ํ™˜๊ฒฝ 'MobileRisk-Live'๋ฅผ ๊ตฌ์ถ•ํ–ˆ์œผ๋ฉฐ, OS-Sentinel์€ ๋ช…์‹œ์ ์ธ ์‹œ์Šคํ…œ ๊ทœ์น™ ์œ„๋ฐ˜์„ ํƒ์ง€ํ•˜๋Š” '์ •ํ˜• ๊ฒ€์ฆ๊ธฐ'์™€ ๋ฌธ๋งฅ์  ์œ„ํ—˜์„ ํŒ๋‹จํ•˜๋Š” 'VLM ๊ธฐ๋ฐ˜ ๋ฌธ๋งฅ ํŒ๋‹จ๊ธฐ'๋ฅผ ๊ฒฐํ•ฉํ•œ๋‹ค. ์ด ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ ‘๊ทผ๋ฒ•์€ ๊ธฐ์กด ํƒ์ง€ ๋ฐฉ์‹๋ณด๋‹ค 10~30% ํ–ฅ์ƒ๋œ ์ •ํ™•๋„๋กœ ๋” ์•ˆ์ „ํ•œ ๋ชจ๋ฐ”์ผ ์—์ด์ „ํŠธ ๊ฐœ๋ฐœ์˜ ๊ธฐ๋ฐ˜์„ ๋งˆ๋ จํ•œ๋‹ค.

V-Thinker: Interactive Thinking with Images

Paper, Project
'V-Thinker'๋Š” LMM(๋Œ€ํ˜• ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ชจ๋ธ)์ด ์ด๋ฏธ์ง€์™€ ๊นŠ๊ฒŒ '์ƒํ˜ธ์ž‘์šฉ'ํ•˜๋ฉฐ ์žฅ๊ธฐ์  ์ถ”๋ก ์„ ์ˆ˜ํ–‰ํ•˜๋„๋ก ์„ค๊ณ„๋œ ๋ฒ”์šฉ ์ถ”๋ก  ๋ณด์กฐ ๋ชจ๋ธ์ด๋‹ค. ์ด ๋ชจ๋ธ์€ '๋ฐ์ดํ„ฐ ์ง„ํ™” ํ”Œ๋ผ์ดํœ '์„ ํ†ตํ•ด ๊ณ ํ’ˆ์งˆ์˜ ์ƒํ˜ธ์ž‘์šฉ ์ถ”๋ก  ๋ฐ์ดํ„ฐ์…‹์„ ์ž๋™์œผ๋กœ ์ƒ์„ฑ ๋ฐ ๊ฒ€์ฆํ•˜๋ฉฐ, '์‹œ๊ฐ์  ์ ์ง„์  ํ•™์Šต ์ปค๋ฆฌํ˜๋Ÿผ'์ด๋ผ๋Š” 2๋‹จ๊ณ„ ๊ฐ•ํ™” ํ•™์Šต ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํ†ตํ•ด ์ƒํ˜ธ์ž‘์šฉ ๋Šฅ๋ ฅ์„ ํ•™์Šตํ•œ๋‹ค. V-Thinker๋Š” ์ƒˆ๋กญ๊ฒŒ ๋„์ž…๋œ VTBench ๋ฒค์น˜๋งˆํฌ์—์„œ ๊ธฐ์กด LMM ๊ธฐ๋ฐ˜ ๋ชจ๋ธ๋“ค์„ ์ผ๊ด€๋˜๊ฒŒ ๋Šฅ๊ฐ€ํ•˜๋ฉฐ, ์ด๋ฏธ์ง€ ์ƒํ˜ธ์ž‘์šฉ ์ถ”๋ก ์˜ ์ƒˆ๋กœ์šด ๊ฐ€๋Šฅ์„ฑ์„ ์ œ์‹œํ•œ๋‹ค.

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper, Project
'INT v.s. FP'๋Š” ์ตœ์‹  AI ํ•˜๋“œ์›จ์–ด๊ฐ€ ์ €์ •๋ฐ€ ๋ถ€๋™ ์†Œ์ˆ˜์ (FP) ํ˜•์‹์„ ์„ ํ˜ธํ•˜๋Š” ์ถ”์„ธ ์†์—์„œ, ์ €์ •๋ฐ€ ์ •์ˆ˜(INT)์™€ FP ํ˜•์‹์˜ ์„ฑ๋Šฅ์„ ์„ธ๋ถ„ํ™”๋œ(fine-grained) ์ˆ˜์ค€์—์„œ ์ฒด๊ณ„์ ์œผ๋กœ ๋น„๊ต ๋ถ„์„ํ•œ๋‹ค. ์ด ์—ฐ๊ตฌ๋Š” 8๋น„ํŠธ ์ˆ˜์ค€์—์„œ๋Š” ์„ธ๋ถ„ํ™”๋œ ์ •์ˆ˜ ํ˜•์‹์ธ MXINT8์ด FP๋ณด๋‹ค ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์ •ํ™•๋„์™€ ํ•˜๋“œ์›จ์–ด ํšจ์œจ์„ฑ ๋ชจ๋‘์—์„œ ์šฐ์ˆ˜ํ•˜๋‹ค๋Š” '์„ฑ๋Šฅ ๊ต์ฐจ(crossover)'๋ฅผ ๋ฐํ˜€๋ƒˆ๋‹ค. 4๋น„ํŠธ์—์„œ๋Š” FP๊ฐ€ ์ข…์ข… ์šฐ์„ธํ•˜์ง€๋งŒ, ์ด๋Š” ํ•˜๋“œ์›จ์–ด ์„ค๊ณ„๊ฐ€ FP ์ผ๋ณ€๋„๋กœ ๊ฐ€๋Š” ๊ฒƒ์ด ์ตœ์ ์ด ์•„๋‹ ์ˆ˜ ์žˆ์œผ๋ฉฐ, MXINT8๊ณผ ๊ฐ™์€ ์„ธ๋ถ„ํ™”๋œ INT ํ˜•์‹์ด ๋” ๋‚˜์€ ๊ท ํ˜•์ ์„ ์ œ๊ณตํ•  ์ˆ˜ ์žˆ์Œ์„ ์‹œ์‚ฌํ•œ๋‹ค.

World Simulation with Video Foundation Models for Physical AI

Paper, Project
'World Simulation with Video Foundation Models for Physical AI'๋Š” Physical AI ์—ฐ๊ตฌ๋ฅผ ์œ„ํ•œ ์ฐจ์„ธ๋Œ€ ์›”๋“œ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ [Cosmos-Predict2.5]๋ฅผ ์†Œ๊ฐœํ•œ๋‹ค. ์ด ๋ชจ๋ธ์€ Flow ๊ธฐ๋ฐ˜ ์•„ํ‚คํ…์ฒ˜๋ฅผ ํ†ตํ•ด ํ…์ŠคํŠธ, ์ด๋ฏธ์ง€, ๋น„๋””์˜ค๋กœ๋ถ€ํ„ฐ ์ผ๊ด€๋œ ์›”๋“œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์ƒ์„ฑ์„ ๋‹จ์ผ ๋ชจ๋ธ๋กœ ํ†ตํ•ฉํ–ˆ์œผ๋ฉฐ, 2์–ต ๊ฐœ์˜ ๋น„๋””์˜ค ๋ฐ์ดํ„ฐ์™€ ๊ฐ•ํ™” ํ•™์Šต์œผ๋กœ ์ •์ œ๋˜์–ด ๋น„๋””์˜ค ํ’ˆ์งˆ๊ณผ ๋ช…๋ น์–ด ์ •๋ ฌ ๋Šฅ๋ ฅ์ด ํฌ๊ฒŒ ํ–ฅ์ƒ๋˜์—ˆ๋‹ค. ๋˜ํ•œ, Sim2Real ๋ณ€ํ™˜์„ ์œ„ํ•œ [Cosmos-Transfer2.5]์™€ ํ•จ๊ป˜ ๊ณต๊ฐœ๋˜์–ด ๋กœ๋ณดํ‹ฑ์Šค ๋ฐ ์ž์œจ ์‹œ์Šคํ…œ์„ ์œ„ํ•œ ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ๊ณผ ์ •์ฑ… ํ‰๊ฐ€๋ฅผ ๊ฐ€์†ํ™”ํ•œ๋‹ค.

profile
XR๊ณผ AI์— ๊ด€์‹ฌ์ด ๋งŽ์€ Sky ์ž…๋‹ˆ๋‹ค.

0๊ฐœ์˜ ๋Œ“๊ธ€