๐Ÿ’กThe AI Scientist

oceannยท2024๋…„ 8์›” 30์ผ
0

๐Ÿ’ก๊ด€์‹ฌ์‚ฌ

๋ชฉ๋ก ๋ณด๊ธฐ
3/3
post-thumbnail

์ด๋ฏธ์ง€ ์ถœ์ฒ˜: Sakana AI

Let an ultraintelligent machine be defined as a machine that can far surpass all the intellectual activities of any man, however clever. Since the design of machines is one of these intellectual activities, an ultraintelligent machine could design even better machines; there would then unquestionably be an โ€˜intelligence explosion,โ€™ and the intelligence of man would be left far behind. Thus the first ultraintelligent machine is the last invention that man need ever make, provided that the machine is docile enough to tell us how to keep it under control.
- I. J. Good, 1965


๋ฐœ๋ช…๊ณผ ๋ฐœ๊ฒฌ, AGI

๋ฐœ๋ช…๊ณผ ๋ฐœ๊ฒฌ

ํ‘œ์ค€๊ตญ์–ด๋Œ€์‚ฌ์ „์„ ๊ธฐ์ค€์œผ๋กœ ๋ฐœ๋ช…์ด๋ž€, ์•„์ง๊นŒ์ง€ ์—†๋˜ ๊ธฐ์ˆ ์ด๋‚˜ ๋ฌผ๊ฑด์„ ์ƒˆ๋กœ ์ƒ๊ฐํ•˜์—ฌ ๋งŒ๋“ค์–ด ๋‚ด๋Š” ๊ฒƒ์„ ๋งํ•˜๊ณ , ๋ฐœ๊ฒฌ์ด๋ž€, ๋ฏธ์ฒ˜ ์ฐพ์•„๋‚ด์ง€ ๋ชปํ•˜์˜€๊ฑฐ๋‚˜ ์•„์ง ์•Œ๋ ค์ง€์ง€ ์•„๋‹ˆํ•œ ์‚ฌ๋ฌผ์ด๋‚˜ ํ˜„์ƒ, ์‚ฌ์‹ค ๋”ฐ์œ„๋ฅผ ์ฐพ์•„๋‚ด๋Š” ๊ฒƒ์„ ๋งํ•œ๋‹ค.
์œ„๋Œ€ํ•œ ๋ฐœ๋ช…๊ณผ ๋ฐœ๊ฒฌ์˜ ์˜ˆ์‹œ๋กœ ๊ฐ๊ฐ ์—๋””์Šจ์˜ ์ „๊ตฌ์™€ ์•„์ธ์Šˆํƒ€์ธ์˜ ์ƒ๋Œ€์„ฑ์ด๋ก ์ด ์žˆ๋‹ค. ์ธ๊ฐ„์ด ํ˜ธ๊ธฐ์‹ฌ์„ ๊ฐ–๊ณ , ๊ณต๋ถ€ํ•ด์„œ ์Šต๋“ํ•œ ์ง€์‹์„ ๋ฐ”ํƒ•์œผ๋กœ ์ด์™€ ๊ฐ™์€ ๋ฐœ๋ช…๊ณผ ๋ฐœ๋ช…์„ ์ˆ˜ํ–‰ํ•œ๋‹ค.
๊ฒฐ๊ณผ์ ์œผ๋กœ ์˜ค๋Š˜๋‚  ์œ ํ–‰ํ•˜๋Š” AI๊ฐ€ ๊ฐœ๋ฐœ๋˜์—ˆ๋‹ค. AI๋Š” ๊ณผ์—ฐ ๋ฐœ๋ช…๊ณผ ๋ฐœ๊ฒฌ์„ ํ•  ์ˆ˜ ์žˆ์„๊นŒ?

AGI, Artificial General Intelligence

AGI, Artificial General Intelligence๋Š” ์ง์—ญํ•˜๋ฉด ์ธ๊ณต ์ผ๋ฐ˜ ์ง€๋Šฅ์œผ๋กœ, ์ธ๊ณต์ง€๋Šฅ์ด ์ธ๊ฐ„๊ณผ ์œ ์‚ฌํ•œ ์ง€์  ๋Šฅ๋ ฅ์„ ๊ฐ€์ ธ ์Šค์Šค๋กœ ํ•™์Šตํ•  ์ˆ˜ ์žˆ๋Š” ์ˆ˜์ค€๊นŒ์ง€ ๋ฐœ์ „์‹œํ‚ค๊ณ ์ž ํ•˜๋Š” ์—ฐ๊ตฌ ๋ถ„์•ผ์ด๋‹ค. ์ด AGI์— ๋Œ€ํ•ด์„œ OpenAI์—์„œ ์ผํ–ˆ๋˜ Leopold Aschenbrenner๊ฐ€ ์“ด Situational Awareness๋ผ๋Š” ์‹œ๋ฆฌ์ฆˆ(์ฑ…์ธ๊ฐ€?)๊ฐ€ ์žˆ๋Š”๋ฐ, ์•„๋ž˜ ๊ทธ๋ฆผ์€ ํ•ด๋‹น ๊ธ€์— ์žˆ๋Š” ์ง€์‹ ์ˆ˜์ค€์˜ ํญ๋ฐœ์— ๋Œ€ํ•œ ๊ทธ๋ฆผ์ด๋‹ค.

์ถœ์ฒ˜: situational-awareness.ai

๊ทธ๋ฆผ์„ ํ†ตํ•ด GPT-4์˜ ๋“ฑ์žฅ์„ ๊ธฐ์ ์œผ๋กœ 2027๋…„ ๊ฒฝ์—๋Š” Automated AI Research๋กœ ์ธํ•ด ์ง€์‹์ด ์„ฑ์žฅํ•  ๊ฒƒ์œผ๋กœ ์˜ˆ์ธกํ•  ์ˆ˜ ์žˆ๋‹ค.
๊ทธ๋Ÿฐ๋ฐ GPT-4 ๋“ฑ์žฅ ์ด์ „์„ 10010^0 ์ดํ•˜๋กœ ๋ณด๋Š” ๊ฑด ์ข€ ํฐ ์ƒ์ฒ˜์ธ๋ฐ..?ใ…‹ใ…‹ํใ… ใ… ใ… 


The AI Scientist

Sakana AI๋ผ๋Š” ์ผ๋ณธ์˜ ํ•œ ๋žฉ์—์„œ ์ด์ƒ์ ์ธ AGI์˜ ํ˜•ํƒœ์™€ ๊ฐ€์žฅ ์œ ์‚ฌํ•œ ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•˜๋Š” ํ”„๋กœ๊ทธ๋žจ์„ ๊ฐœ๋ฐœํ–ˆ๋‹ค.
์ „์ฒด ๋…ผ๋ฌธ์„ ํ™•์ธํ•˜๋ฉด ๋‚ด์šฉ์ด ๋” ๋งŽ๊ฒ ์ง€๋งŒ, ์•„์ง ๋…ผ๋ฌธ์„ ์ฝ์–ด๋ณด์ง€ ๋ชปํ–ˆ๊ธฐ ๋•Œ๋ฌธ์— ๊ณต์‹ ํ™ˆํŽ˜์ด์ง€์˜ ๋‚ด์šฉ์„ ๋ฐ”ํƒ•์œผ๋กœ ๋ฆฌ๋ทฐํ•œ๋‹ค.

Introduction

๊ธฐ์กด ๋ชจ๋ธ๋“ค์€ ์‚ฌ๋žŒ์˜ ์ž‘์—…์„ ๋•๊ธฐ ์œ„ํ•ด ๊ฐœ๋ฐœ๋˜๊ณ  ๋ฐœ์ „ํ•ด์™”์ง€๋งŒ, ์ด๋ฅผ ๋™์ž‘ํ•˜๊ฒŒ ํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ์—ฌ์ „ํžˆ ์ธ๋ ฅ์ด ํ•„์š”ํ•˜๊ณ , ํŠน์ • task์— ํŠนํ™”๋˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ์ „๋ฌธ ์ง€์‹ ๋˜ํ•œ ํ•„์š”ํ•˜๋‹ค๋Š” ๋‹จ์ ์ด ์žˆ๋‹ค. The AI Scientist๋Š” ๊ทธ๋Ÿฌํ•œ ๋‹จ์ ์ด ์‚ฌ๋ผ์ง€๊ณ , ์•„๋ž˜์™€ ๊ฐ™์€ ์ผ๋“ค์„ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋‹ค.

1. ์ „๋ฌธ๊ฐ€์˜ ๋„์›€ ์—†์ด ์Šค์Šค๋กœ ์—ฐ๊ตฌ ์ฃผ์ œ๋ฅผ ํ™•๋ฆฝํ•ด์„œ ์‹คํ—˜ ์„ค๊ณ„, ๊ฐ€์„ค ๊ฒ€์ฆ, ๋ฆฌ๋ทฐ, ๋…ผ๋ฌธ ์ž‘์„ฑ๊นŒ์ง€ ์—ฐ๊ตฌ ๋ผ์ดํ”„์‚ฌ์ดํด ์ „์ฒด๋ฅผ ์ž๋™ํ™”ํ•œ๋‹ค.
2. ์ž๋™ํ™”๋œ peer review ๊ณผ์ •์„ ํ†ตํ•ด ์ž‘์„ฑํ•œ ๋…ผ๋ฌธ์„ ๊ฒ€์ฆํ•˜๊ณ  ํ”ผ๋“œ๋ฐฑ์„ ์ฃผ๊ณ ๋ฐ›์œผ๋ฉฐ ๊ฐœ์„ ํ•œ๋‹ค. ์ด๋ ‡๊ฒŒ ์™„์„ฑ๋œ ๋…ผ๋ฌธ์„ ์ธ๊ฐ„๊ณผ ์œ ์‚ฌํ•œ ์ˆ˜์ค€์˜ ์ •ํ™•๋„๋ฅผ ๋ณด์ธ๋‹ค.
3. ์—ฐ๊ตฌ ๊ณผ์ •๊ณผ ๊ฒฐ๊ณผ์—์„œ ์Šต๋“ํ•œ ์ง€์‹์„ ๋‹ค์‹œ ํ™œ์šฉํ•ด์„œ ๋‹ค์Œ ์—ฐ๊ตฌ๋ฅผ ์ด์–ด๊ฐ„๋‹ค. ์ด๋Š” ์ธ๊ฐ„์˜ ์ปค๋ฎค๋‹ˆํ‹ฐ ํ™œ๋™๊ณผ ์œ ์‚ฌํ•˜๋‹ค.

๊ทธ ๊ฒฐ๊ณผ๋กœ ์—ฌ๋Ÿฌ ๋…ผ๋ฌธ์„ ์˜ˆ์‹œ๋กœ ์˜ฌ๋ ธ์ง€๋งŒ, ๊ทธ ์ค‘ ํ•˜๋‚˜์ธ Dualscale Diffusion: Adaptive Feature Balancing for Low-Dimensional Generative Models์˜ Figure 1์„ ์‚ดํŽด๋ณด์ž.

์ด ๊ทธ๋ฆผ๊ณผ ๊ฐ™์ด ํ•ด๋‹น ๋…ผ๋ฌธ์—์„œ ๊ฐœ๋ฐœํ•œ ๋ฐฉ์‹์œผ๋กœ ์‹คํ—˜์„ ์ˆ˜ํ–‰ํ•œ ๊ฒฐ๊ณผ๊ฐ€ ์ฒจ๋ถ€๋˜์–ด ์žˆ๋‹ค. The AI Scientist๊ฐ€ ์ƒ์„ฑํ•œ ๋…ผ๋ฌธ์€ LaTeX ํ˜•์‹์œผ๋กœ, ์œ„ ๋…ผ๋ฌธ์„ ๋ณด๋ฉด ์ˆ˜์‹ ๋˜ํ•œ ํ˜•์‹์— ๋งž์ถฐ ์ž˜ ์ž‘์„ฑ๋˜์–ด ์žˆ๋Š” ๊ฒƒ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค.
๋˜ํ•œ ํ•œ ํŽธ์˜ ๋…ผ๋ฌธ์„ ์ž‘์„ฑํ•˜๊ธฐ ์œ„ํ•ด ์•ž์„œ ์„ค๋ช…ํ•œ ๋ชจ๋“  ๊ณผ์ •์ด ๊ณ ์ž‘ 15$์ด๋‹ค.

Overview of The AI Scientist

The AI Scientist๊ฐ€ ์—ฐ๊ตฌํ•˜๋Š” ๊ณผ์ •์€ ์•„๋ž˜์™€ ๊ฐ™๋‹ค.

Idea Generation
์›ํ•˜๋Š” ์—ฐ๊ตฌ ์ฃผ์ œ์™€ ๊ด€๋ จ๋œ ๋‚ด์šฉ์ด ๋‹ด๊ธด starting template์ด ์ฃผ์–ด์ง€๋ฉด ๋ธŒ๋ ˆ์ธ์Šคํ† ๋ฐ์„ ์‹œ์ž‘ํ•œ๋‹ค. template์—๋Š” ๋ฌธ์„œ ์ž‘์„ฑ์„ ์œ„ํ•œ LaTeX ํด๋”๊ฐ€ ํฌํ•จ๋˜์–ด ์žˆ๋‹ค. ์—ฐ๊ตฌ ์ฃผ์ œ ํ™•๋ฆฝ์„ ์œ„ํ•ด ์—ฌ๋Ÿฌ ์—ฐ๊ตฌ๋“ค์„ ์ฐพ์•„๋ณด๋Š”๋ฐ, ์ด๋•Œ ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์•„์ด๋””์–ด์˜ ์ „๋ฌธ์„ฑ์„ ์œ„ํ•ด Semantic Scholar๋ฅผ ๊ฒ€์ƒ‰์˜ ๋Œ€์ƒ์œผ๋กœ ์ง€์ •ํ–ˆ๋‹ค.

Experimental Iteration
Idea Generation ๋‹จ๊ณ„์—์„œ ๋ฐ›์•„์˜จ ์•„์ด๋””์–ด์™€ template์„ ์‚ฌ์šฉํ•ด์„œ ์‹คํ—˜์„ ์„ค๊ณ„ํ•˜๊ณ  ์ˆ˜ํ–‰ํ•œ๋‹ค. ๊ฒฐ๊ณผ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ์‹œ๊ฐํ™”๋ฅผ ํ•˜๋Š”๋ฐ, ํ•ด๋‹น plot์— ๋Œ€ํ•œ ์„ค๋ช…์„ ์ ์„ ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ํ•„์š”ํ•œ ์ด๋ฏธ์ง€๋“ค์„ ์ €์žฅํ•˜๊ธฐ๋„ ํ•œ๋‹ค.

Paper Write-up
LaTeX ๋ฌธ๋ฒ•์„ ์‚ฌ์šฉํ•ด์„œ ์ง„ํ–‰ํ•œ ์—ฐ๊ตฌ์— ๋Œ€ํ•œ ๋…ผ๋ฌธ์„ ์ž‘์„ฑํ•˜๋ฉฐ, Semantic Scholar๋ฅผ ์‚ฌ์šฉํ•ด์„œ ๊ด€๋ จ ์žˆ๋Š” ๋…ผ๋ฌธ๋“ค์„ citeํ•œ๋‹ค.

Automated Paper Reviewing
์ƒ์„ฑํ•œ ๋…ผ๋ฌธ์— ๋Œ€ํ•ด ์ธ๊ฐ„๊ณผ ๋น„์Šทํ•œ ์ˆ˜์ค€์—์„œ ํ‰๊ฐ€ํ•  ์ˆ˜ ์žˆ๋‹ค. ํ•ด๋‹น ๋…ผ๋ฌธ์„ ๊ฐœ์„ ํ•˜๊ธฐ ์œ„ํ•œ ๋ฆฌ๋ทฐ๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜๋„ ์žˆ๊ณ , ์ดํ›„ ์—ฐ๊ตฌ์— ํ™œ์šฉํ•  ์ˆ˜ ์žˆ๋Š” ๋ฆฌ๋ทฐ๋ฅผ ์ƒ์„œ์•Œ ์ˆ˜๋„ ์žˆ๋‹ค. ์ดํ›„์— ํ™œ์šฉํ•  ์ˆ˜ ์žˆ๋Š” ๋ฆฌ๋ทฐ๋ฅผ ํ†ตํ•ด ์œ„ ๊ณผ์ •์ด ๋ฐ˜๋ณต์ ์œผ๋กœ ์ˆ˜ํ–‰๋˜๋ฉฐ, ๋์—†๋Š” ์—ฐ๊ตฌ๋ฅผ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๊ฒŒ ๋œ๋‹ค.

Example Papers Generated by The AI Scientist

์•„๋ž˜๋Š” Diffusion Modeling, Language Modeling, Grokking์„ ๊ฐ๊ฐ ํ† ํ”ฝ์œผ๋กœ ํ•ด์„œ template์œผ๋กœ ์ œ๊ณตํ•œ ์ •๋ณด๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ The AI Scientist๊ฐ€ ์ˆ˜ํ–‰ํ•œ ์—ฐ๊ตฌ ๊ฒฐ๊ณผ์ด๋‹ค.
code๋Š” The AI Scientist์˜ GitHub ๋ ˆํฌ์ง€ํ† ๋ฆฌ์— ์ €์žฅ๋˜์–ด ์žˆ๋‹ค. ์—ฐ๊ตฌ๋ฅผ ์ˆ˜ํ–‰ํ•˜๊ธฐ ์œ„ํ•ด ์ž…๋ ฅํ•œ template์ด ๋ฌด์—‡์ธ์ง€, The AI Scientist๊ฐ€ ์‹คํ—˜์„ ์œ„ํ•ด ์ž‘์„ฑํ•œ ์ฝ”๋“œ์™€ ์‹คํ—˜ ๊ฒฐ๊ณผ, ๋…ผ๋ฌธ ๋“ฑ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค.

Diffusion Modeling
Dualscale Diffusion: Adaptive Feature Balancing for Low-Dimensional Generative Models pdf, code

Language Modeling
Stylefusion: Adaptive Multi-Style Generation in Character-Level Language Models pdf, code

Adaptive Learning Rates for Transformers via Q-Learning pdf, code

Grokking
Unlocking Grokking: A Comparative Study of Weight Initialization Strategies in Transformer Models pdf, code

๋…ผ๋ฌธ์„ ๋ณด๋ฉด ์•Œ ์ˆ˜ ์žˆ๋“ฏ์ด ํ˜•์‹์ด ์ž˜ ๊ฐ–์ถฐ์ ธ ์žˆ์„ ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ LaTeX ๋ฌธ๋ฒ•์„ ์ค€์ˆ˜ํ•˜๋ฉฐ, citation๊นŒ์ง€ ๋ช…ํ™•ํžˆ ์จ๋†“์€ ๊ฒƒ์„ ์•Œ ์ˆ˜ ์žˆ๋‹ค.

Limitations and Challenges

  1. vision task๋ฅผ ์ˆ˜ํ–‰ํ•˜์ง€ ๋ชปํ•œ๋‹ค. ๋”ฐ๋ผ์„œ template์— vision task์™€ ๊ด€๋ จ๋œ ๋‚ด์šฉ์ด ์ œ๊ณต๋œ๋‹ค๋ฉด ์ฝ์„ ์ˆ˜ ์—†๋‹ค. multi-modal์„ ํ™œ์šฉํ•ด์„œ ์ด ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•  ์ˆ˜ ์žˆ์„ ๊ฒƒ์ด๋‹ค.
  2. The AI Scientist๋Š” ์™„๋ฒฝํ•˜์ง€ ์•Š๊ธฐ ๋•Œ๋ฌธ์— baseline์— ๋Œ€ํ•ด์„œ ์ž˜๋ชป๋œ ์•„์ด๋””์–ด๋ฅผ ์ƒ์„ฑํ•˜๊ณ , ๋น„ํ•ฉ๋ฆฌ์ ์ธ ์ถ”๋ก ์„ ํ•  ์ˆ˜๋„ ์žˆ๋‹ค.
  3. ๊ฒฐ๊ณผ๋ฅผ ์ƒ์„ฑํ•˜๊ฑฐ๋‚˜ ํ‰๊ฐ€ํ•จ์— ์žˆ์–ด์„œ ์‹ฌํ•œ ์˜ค๋ฅ˜๊ฐ€ ๋ฐœ์ƒํ•  ์ˆ˜ ์žˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค๋ฉด LLM์˜ ๊ณ ์งˆ์ ์ธ ๋ฌธ์ œ์™€ ๊ฐ™์ด ๋‘ ์ˆซ์ž๋ฅผ ๋น„๊ตํ•จ์— ์žˆ์–ด์„œ ์–ด๋ ค์›€์„ ๊ฒช๊ธฐ๋„ ํ•œ๋‹ค.

๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ด์™€ ๊ฐ™์€ ๋ฌธ์ œ๋“ค์€ AI๊ฐ€ ๋ฐœ์ „ํ•จ์— ์žˆ์–ด, multi-modal์„ ์‚ฌ์šฉํ•˜๊ฑฐ๋‚˜, The AI Scientist๋ฅผ ์—…๊ทธ๋ ˆ์ด๋“œํ•˜๋ฉฐ ํ•ด๊ฒฐ๋  ์ˆ˜ ์žˆ๋‹ค๊ณ  ๋งํ•œ๋‹ค.

The AI Scientist Bloopers

์—ฐ๊ตฌ๋ฅผ ์ง„ํ–‰ํ•จ์— ์žˆ์–ด์„œ ์Šค์Šค๋กœ ์‹คํ–‰๋˜๋„๋ก ์ฝ”๋“œ๋ฅผ ์งœ๋ผ๊ณ  ํ–ˆ๋”๋‹ˆ ๋ฌดํ•œ ๋ฃจํ”„์— ๊ฑธ๋ฆฌ๊ฑฐ๋‚˜, ๋ฐฐํฌ ํ›„ ์‹คํ–‰์ด ๋„ˆ๋ฌด ์˜ค๋ž˜ ๊ฑธ๋ ค timeout์ด ๋ฐœ์ƒํ•˜๊ธฐ๋„ ํ–ˆ๋‹ค. ์‹คํ–‰ ์†๋„๋ฅผ ๋†’์ด๊ธฐ๋ณด๋‹ค ์ฝ”๋“œ ์ž์ฒด๋ฅผ ์ˆ˜์ •ํ•˜๋ผ๊ณ  ํ–ˆ๋”๋‹ˆ, timeout ์‹œ๊ฐ„์„ ๋Š˜๋ฆฌ๋Š” ์ˆ˜์ค€๋ฐ–์— ๋˜์ง€ ๋ชปํ–ˆ๋‹ค๊ณ  ํ•œ๋‹ค. ์ถ”ํ›„ ๋…ผ์˜ํ•˜๊ณ  ์ˆ˜์ •ํ•ด์•ผ ํ•  ์‚ฌํ•ญ์ด๋ผ๊ณ  ์–ธ๊ธ‰๋œ๋‹ค.

Future Implications of The AI Scientist

์—ฌ๋А ์ƒˆ๋กœ์šด ๊ธฐ์ˆ ์ด ๊ฐœ๋ฐœ๋  ๋•Œ์™€ ๊ฐ™์ด ์ƒˆ๋กœ์šด ์ด์Šˆ๋“ค์— ๋Œ€์‘ํ•ด์•ผ ํ•œ๋‹ค.

Ethical Considerations
The AI Scientist๊ฐ€ ๋…ผ๋ฌธ์„ ๋ฌด๋ถ„๋ณ„ํ•˜๊ฒŒ ์ƒ์„ฑํ•˜์—ฌ ํ•™ํšŒ๋‚˜ ์ €๋„์— ํˆฌ๊ณ ํ•  ๊ฒฝ์šฐ ๋‹ค์–‘ํ•œ ๋ฌธ์ œ๊ฐ€ ๋ฐœ์ƒํ•  ์ˆ˜ ์žˆ๋‹ค. ๋ฆฌ๋ทฐ์–ด์˜ ๋ถ€๋‹ด์ด ์ฆ๊ฐ€ํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ๊ทธ ์–‘์ด ๋ฐฉ๋Œ€ํ•ด์ง์— ๋”ฐ๋ผ ์„ ๋ณ„์ด ์–ด๋ ค์›Œ์ ธ ๊ณผํ•™์  ์ง€์‹์˜ ํ’ˆ์งˆ์ด ์ข‹์ง€ ์•Š์„ ์ˆ˜ ์žˆ๋‹ค. ๋˜ํ•œ Image Generation๊ณผ ๊ฐ™์ด ์ €์ž‘๊ถŒ ๋ฌธ์ œ๋‚˜, ๊ฐ€์น˜ ์ €ํ•˜์™€ ๊ฐ™์€ ์ด์Šˆ๊ฐ€ ๋ฐœ์ƒํ•  ์ˆ˜ ์žˆ๋‹ค.
Automated Reviewer๊ฐ€ ๋“ฑ์žฅํ•˜์—ฌ ์˜จ๋ผ์ธ ์ƒ์— ๋ฐฐํฌ๋˜๋ฉด, ๋ฆฌ๋ทฐ์˜ ํ’ˆ์งˆ ์ €ํ•˜๋Š” ๋ฌผ๋ก , ๋…ผ๋ฌธ์— ๋Œ€ํ•œ ์˜๋„๋˜์ง€ ์•Š์€ ํŽธํ–ฅ์ด ์ถ”๊ฐ€๋  ์ˆ˜๋„ ์žˆ๋‹ค. ๋”ฐ๋ผ์„œ AI์˜ ๋ฆฌ๋ทฐ์—๋Š” ํŠน์ • ํ‘œ๊ธฐ๋ฅผ ํ•˜๋Š” ๊ฒƒ์ด ์ œ์•ˆ๋œ๋‹ค.
๋˜ํ•œ ๋‹ค๋ฅธ ์—ฌ๋А ๊ธฐ์ˆ ๋“ค๊ณผ ๋งˆ์ฐฌ๊ฐ€์ง€๋กœ ๋น„์œค๋ฆฌ์ ์œผ๋กœ ์‚ฌ์šฉ๋  ์—ฌ์ง€๊ฐ€ ์žˆ๋‹ค. ์ธ๊ฐ„์ด ๋ˆˆ์น˜์ฑ„์ง€ ๋ชปํ•˜๋Š” ๋™์•ˆ ์Šค์Šค๋กœ ์—ฐ๊ตฌ๋ฅผ ์ง„ํ–‰ํ•˜๋ฉฐ ์ธ๊ฐ„์—๊ฒŒ ํ•ด๋กœ์šด ๋ฐ”์ด๋Ÿฌ์Šค ๋˜๋Š” ์ปดํ“จํ„ฐ ๋ฐ”์ด๋Ÿฌ์Šค ๋“ฑ์„ ๊ฐœ๋ฐœํ•  ์ˆ˜๋„ ์žˆ๋Š” ๊ฒƒ์ด๋‹ค.

Open Models
๋ณธ ํ”„๋กœ์ ํŠธ๋ฅผ ์ˆ˜ํ–‰ํ•˜๊ธฐ ์œ„ํ•ด GPT-4o, Sonnet๊ณผ ๊ฐ™์€ ์ตœ์ฒจ๋‹จ LLM์„ ์‚ฌ์šฉํ–ˆ๋‹ค. ์ด ์™ธ์—๋„ DeepSeek, Llama-3์™€ ๊ฐ™์€ ์˜คํ”ˆ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•ด๋ดค์ง€๋งŒ, ๋…ผ๋ฌธ ์ƒ์„ฑ์— ์žˆ์–ด์„œ Sonnet์ด ๊ฐ€์žฅ ์ข‹์€ ๊ฒฐ๊ณผ๋ฅผ ๋„์ถœํ–ˆ๋‹ค. ๋”ฐ๋ผ์„œ ๊ถ๊ทน์ ์œผ๋กœ ํŠน์ • ๋ชจ๋ธ ์ œ๊ณต์ž์—๊ฒŒ ๊ตญํ•œ๋˜์ง€ ์•Š๋Š” The AI Scientist๋ฅผ ๊ฐœ๋ฐœํ•˜๋Š” ๊ฒƒ์ด ๋ชฉ์ ์ด๋ผ๊ณ  ํ•œ๋‹ค.

The Role of a Scientist
๊ถ๊ทน์ ์œผ๋กœ The AI Scientist๊ฐ€ ์—ฐ๊ตฌ์˜ ๋ผ์ดํ”„์‚ฌ์ดํด์„ ์ „๋ถ€ ๋Œ€์ฒดํ• ์ง€๋ผ๋„, ์‹ค์ œ ๊ณผํ•™์ž๋“ค์˜ ์—ญํ• ์€ ๋‹ค๋ฅธ ์˜์—ญ์œผ๋กœ ์ด์ „๋  ๋ฟ ์‚ฌ๋ผ์ง€์ง€ ์•Š์„ ๊ฒƒ์ด๋‹ค.

๋ฆฌ๋ทฐ ์†Œ๊ฐ

ํ˜„์žฌ๊นŒ์ง€ ๋“ฑ์žฅํ•œ ์ตœ์ฒจ๋‹จ LLM๋“ค ์ค‘ ๊ฐ€์žฅ ์„ฑ๋Šฅ์ด ์ข‹์€ ๋ชจ๋ธ๋“ค์„ ์‚ฌ์šฉํ•ด์„œ ์—ฐ๊ตฌ ๋ผ์ดํ”„์‚ฌ์ดํด์ด๋ผ๋Š” task๋ฅผ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋„๋ก ๋งŒ๋“ค์—ˆ๊ธฐ ๋•Œ๋ฌธ์— ๋ชจ๋ธ ์ž์ฒด๋ฅผ ๊ฐœ๋ฐœํ–ˆ๋‹ค๊ธฐ๋ณด๋‹ค AI ๋ถ„์•ผ์˜ ์ƒˆ๋กœ์šด ์ง€ํ–ฅ์ ์„ ๊ฐœ์ฒ™ํ•˜๋Š” ํ‹€์„ ์ œ๊ณตํ–ˆ๋‹ค๊ณ  ๋ณด๋Š” ๊ฒƒ์ด ๋” ์ ํ•ฉํ•  ๊ฒƒ ๊ฐ™๋‹ค. ํ•˜์ง€๋งŒ LLM์˜ ๋“ฑ์žฅ ์ดํ›„ ๋‹ค์–‘ํ•œ task์— ์‚ฌ์šฉ๋˜๋ฉฐ LLM์˜ ๋ชฉ์ ์ด ์• ๋งคํ•˜๋‹ค๋Š” ์–˜๊ธฐ๊ฐ€ ๋งŽ์€๋ฐ, ์ด๋ฅผ ๋ถ„๋ช…ํžˆ ํ•  ์ˆ˜ ์žˆ๋Š” ๋ฐœ๊ฑธ์Œ์ž„์€ ํ™•์‹คํ•˜๋‹ค.
ํŒ€์›๋“ค๊ณผ ์–˜๊ธฐํ•œ ๊ฒฐ๊ณผ ์•„๋ž˜์™€ ๊ฐ™์€ ๋ฌธ์ œ์ ์ด ๋ณด์˜€๊ณ , ์–ด๋–ป๊ฒŒ ํ•ด๊ฒฐ๋ ์ง€ ๊ถ๊ธˆํ•ด์กŒ๋‹ค.

1. ํ˜„์žฌ LLM์—์„œ ๊ฐ€์žฅ ํฐ ๋ฌธ์ œ๋ผ๊ณ  ์ง€์ ๋ฐ›๋Š” hallucination ๋ฌธ์ œ์— ๋Œ€ํ•ด์„œ๋Š” ์–ด๋–ป๊ฒŒ ํ•ด๊ฒฐํ•  ๊ฒƒ์ธ์ง€?
2. ๊ธฐ์กด์˜ ์—ฐ๊ตฌ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ research๋ฅผ ์ง„ํ–‰ํ•  ๋•Œ ๋ถ€์ •ํ™•ํ•œ ๋ฐ์ดํ„ฐ์˜ ์šฐ์œ„๋ฅผ ๊ฐ€๋ฆด ์ˆ˜ ์žˆ๋Š”์ง€?
3. ํ•„์š”ํ•œ ์ง€์‹์ด ๋ฌด์—‡์ธ์ง€ ํŒ๋‹จํ•  ๋•Œ ์—ฌ๋Ÿฌ ์ง€์‹๋“ค ๊ฐ„์˜ ์šฐ์œ„๋ฅผ ๊ฐ€๋ฆด ์ˆ˜ ์žˆ๋Š”์ง€? ์˜ˆ๋ฅผ ๋“ค์–ด, ์•„์ธ์Šˆํƒ€์ธ์˜ ์ƒ๋Œ€์„ฑ ์ด๋ก  vs. ๋‰ดํ„ด์˜ ๊ณ ์ „ ์—ญํ•™

์ถœ์ฒ˜, ์ž๋ฃŒ
AGI Situational Awareness
The AI Scientist ๊ณต์‹ ๋ฌธ์„œ, ๋…ผ๋ฌธ
The AI Scientist Code GitHub

profile
๐ŸŒˆ๐ŸŒผ๐ŸŒธโ˜€๏ธ

0๊ฐœ์˜ ๋Œ“๊ธ€