๐Ÿ“„ PaperBytes

Weekly AI Papers โ€” 2026-05-07

๐Ÿ“„ 3ํŽธ ๐Ÿ›๏ธ ๋น…ํ…Œํฌ 1ํŽธ ๐Ÿ”ฅ ํŠธ๋ Œ๋”ฉ 1ํŽธ
1
๐Ÿ”ฅ ํŠธ๋ Œ๋”ฉ 260+
Allen Institute for AI

๐Ÿค– GPT-5์™€ Gemini Robotics๋ฅผ ๊บพ์€ ์™„์ „ ์˜คํ”ˆ ๋กœ๋ด‡ ํ–‰๋™ ์ถ”๋ก  ๋ชจ๋ธ

MolmoAct2: Action Reasoning Models for Real-world Deployment

๐Ÿ›๏ธ ์†Œ์†: Allen Institute for AI (Ai2)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: Vision-Language-Action, Robot Deployment, Action Reasoning, Open Source

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

"VLA ๋กœ๋ด‡ ๋ชจ๋ธ์ด ๋น„์‹ผ ํ•˜๋“œ์›จ์–ด ์—†์ด, ์ง€๊ธˆ ๋‹น์žฅ ์‹ค์ œ ํ˜„์žฅ์— ๋ฐฐํฌ๋  ์ˆ˜ ์žˆ์„๊นŒ์š”?"

๊ธฐ์กด VLA ๋ชจ๋ธ๋“ค์€ ํด๋กœ์ฆˆ๋“œ์†Œ์Šค์ด๊ฑฐ๋‚˜, ๋น„์‹ผ ํ•˜๋“œ์›จ์–ด์— ๋ฌถ์—ฌ ์žˆ๊ฑฐ๋‚˜, ์ถ”๋ก  ์‹œ ์ง€์—ฐ์ด ๋„ˆ๋ฌด ํฌ๋‹ค๋Š” ๋ฌธ์ œ๊ฐ€ ์žˆ์—ˆ์Šต๋‹ˆ๋‹ค. MolmoAct2๋Š” ์ด ์„ธ ๊ฐ€์ง€๋ฅผ ๋™์‹œ์— ํ•ด๊ฒฐํ•œ ์™„์ „ ์˜คํ”ˆ์†Œ์Šค ํ–‰๋™ ์ถ”๋ก  ๋ชจ๋ธ๋กœ, ๊ฐ€์ค‘์น˜ยทํ•™์Šต ์ฝ”๋“œยท๋ฐ์ดํ„ฐ๋ฅผ ๋ชจ๋‘ ๊ณต๊ฐœํ–ˆ์Šต๋‹ˆ๋‹ค. ํ•ต์‹ฌ ํ˜์‹ ์ธ MolmoThink๋Š” ์žฅ๋ฉด ๋ณ€ํ™”๊ฐ€ ์ƒ๊ธด ์˜์—ญ๋งŒ ์žฌ์˜ˆ์ธกํ•ด ๊ธฐ์กด ์ถ”๋ก  ๋Œ€๋น„ ์ง€์—ฐ์„ ๋Œ€ํญ ๋‹จ์ถ•ํ•ฉ๋‹ˆ๋‹ค.

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • 7๊ฐœ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ยท์‹ค์„ธ๊ณ„ ๋ฒค์น˜๋งˆํฌ์—์„œ Pi-05 ์ƒํšŒ
  • MolmoER, 13๊ฐœ ๊ตฌํ˜„ ์ถ”๋ก  ๋ฒค์น˜๋งˆํฌ์—์„œ GPT-5ยทGemini Robotics ER-1.5 ๋Šฅ๊ฐ€
  • 720์‹œ๊ฐ„ ์–‘์† ์กฐ์ž‘ ๊ถค์  ๋ฐ์ดํ„ฐ์…‹(์—ญ๋Œ€ ์ตœ๋Œ€ ์˜คํ”ˆ ์–‘์† ๋ฐ์ดํ„ฐ์…‹) ๊ณต๊ฐœ
  • 3.3M ์ƒ˜ํ”Œ ๊ณต๊ฐ„ยท์ฒดํ™” ์ถ”๋ก  ํŠนํ™” ํ•™์Šต์œผ๋กœ 5๊ฐœ embodiment ์ง€์›

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€?

์ด์ „: ์‹ค์šฉ์  VLA = ํด๋กœ์ฆˆ๋“œ ์†Œ์Šค ๋…์  ๋ชจ๋ธ ์˜์กด โ†’ ์ดํ›„: ์™„์ „ ์˜คํ”ˆ ๋ชจ๋ธ์ด GPT-5๊ธ‰ ์„ฑ๋Šฅ, ์ €๊ฐ€ ํ•˜๋“œ์›จ์–ด์—์„œ ์‹ค์‹œ๊ฐ„ ๋ฐฐํฌ ๊ฐ€๋Šฅ

๋…ผ๋ฌธ ๋ณด๊ธฐ โ†’ Haoquan Fang, Jiafei Duan, Donovan Clay ์™ธ 2๋ช…
2
Shanghai Jiao Tong University

๐Ÿ”ฌ ์ž ๋“  ์‚ฌ์ด ๋…ผ๋ฌธ์„ ์“ฐ๋Š” AI โ€” ์ ๋Œ€์  ๋‹ค์ค‘ ์—์ด์ „ํŠธ ์ž์œจ ์—ฐ๊ตฌ ์‹œ์Šคํ…œ

ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

๐Ÿ›๏ธ ์†Œ์†: Shanghai Jiao Tong University

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: Autonomous Research, Multi-Agent, Adversarial Collaboration, LLM Harness

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

"LLM ์—์ด์ „ํŠธ๊ฐ€ ๋ฐค์ƒˆ ํ˜ผ์ž ์‹คํ—˜ํ•˜๊ณ , ๋…ผ๋ฌธ๊นŒ์ง€ ๊ฒ€์ฆํ•ด์„œ ์™„์„ฑํ•  ์ˆ˜ ์žˆ์„๊นŒ์š”?"

ARIS๋Š” Executor ๋ชจ๋ธ์ด ์—ฐ๊ตฌ๋ฅผ ์ง„ํ–‰ํ•˜๋ฉด, ๋‹ค๋ฅธ ๋ชจ๋ธ ํŒจ๋ฐ€๋ฆฌ์˜ Reviewer๊ฐ€ ์ ๋Œ€์ ์œผ๋กœ ์ค‘๊ฐ„ ์‚ฐ์ถœ๋ฌผ์„ ๋น„ํŒํ•˜๊ณ  ์ˆ˜์ •์„ ์š”์ฒญํ•˜๋Š” ๊ต์ฐจ ๋ชจ๋ธ ํ˜‘์—… ๊ตฌ์กฐ๋ฅผ ์ฑ„ํƒํ•ฉ๋‹ˆ๋‹ค. ์—ฐ๊ตฌ ๊ฒฐ๊ณผ ํ—ˆ์œ„ ์ฃผ์žฅ์„ ๋ง‰๊ธฐ ์œ„ํ•œ 3๋‹จ๊ณ„ ์ฆ๊ฑฐ ๊ฒ€์ฆ(๋ฌด๊ฒฐ์„ฑ ํ™•์ธ โ†’ ๊ฒฐ๊ณผ-์ฃผ์žฅ ๋งคํ•‘ โ†’ ์ฃผ์žฅ ๊ฐ์‚ฌ)๊ณผ 5๋‹จ๊ณ„ ๊ณผํ•™์  ํŽธ์ง‘ ํŒŒ์ดํ”„๋ผ์ธ์„ ๋‚ด์žฅํ•ฉ๋‹ˆ๋‹ค.

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • 65๊ฐœ ์ด์ƒ์˜ ์žฌ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ Markdown ์ •์˜ ์Šคํ‚ฌ๋กœ ์—ฐ๊ตฌ ์›Œํฌํ”Œ๋กœ ์ž๋™ํ™”
  • ์‹คํ—˜ ์ฃผ์žฅ๊ณผ raw ์ฆ๊ฑฐ๋ฅผ ๊ต์ฐจ ๊ฒ€์ฆํ•˜๋Š” Assurance Layer๋กœ ํ—ˆ์œ„ ์„ฑ๊ณต ์ฐจ๋‹จ
  • GitHub 8,389 ์Šคํƒ€ โ€” ๊ณต๊ฐœ ์งํ›„ ํญ๋ฐœ์  ๊ด€์‹ฌ
  • MCP ๊ธฐ๋ฐ˜ ๋ชจ๋ธ ํ†ตํ•ฉ, ๋ฐ˜๋ณต ์žฌ์‚ฌ์šฉ์„ ์œ„ํ•œ ์˜์†์  ์—ฐ๊ตฌ ์œ„ํ‚ค ํฌํ•จ

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€?

์ด์ „: AI ์—ฐ๊ตฌ ์ž๋™ํ™” = ํ™˜๊ฐยท๋ฏธ๊ฒ€์ฆ ์ฃผ์žฅ ์œ„ํ—˜ ์ƒ์กด โ†’ ์ดํ›„: ์ ๋Œ€์  ๋ฆฌ๋ทฐ์–ด + 3๋‹จ๊ณ„ ์ฆ๊ฑฐ ๊ฐ์‚ฌ๋กœ ์ž์œจ ์—ฐ๊ตฌ์˜ ์‹ ๋ขฐ์„ฑ ๋ฌธ์ œ ์ •๋ฉด ๋ŒํŒŒ

๋…ผ๋ฌธ ๋ณด๊ธฐ โ†’ Ruofeng Yang, Yongcan Li, Shuai Li
3
๐Ÿ›๏ธ ๋น…ํ…Œํฌ
Tencent Hunyuan

๐Ÿ” ์ƒ์—…์šฉ ๋…์  ๋ชจ๋ธ๊ณผ ์–ด๊นจ๋ฅผ ๋‚˜๋ž€ํžˆ ํ•œ ์™„์ „ ์˜คํ”ˆ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋”ฅ์„œ์น˜ ์—์ด์ „ํŠธ

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

๐Ÿ›๏ธ ์†Œ์†: Tencent Hunyuan (๋น…ํ…Œํฌ)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: Multimodal Search, Agentic RL, GRPO, Deep Search, Open Source

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

"๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋”ฅ์„œ์น˜ ์—์ด์ „ํŠธ๋ฅผ ์˜คํ”ˆ์†Œ์Šค๋กœ ์ตœ์ „์„  ์ˆ˜์ค€๊นŒ์ง€ ๋Œ์–ด์˜ฌ๋ฆด ์ˆ˜ ์žˆ์„๊นŒ์š”?"

์ตœ์ฒจ๋‹จ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๊ฒ€์ƒ‰ ์—์ด์ „ํŠธ๋Š” ์žฌํ˜„์ด ์–ด๋ ค์› ๋Š”๋ฐ, ๊ณ ํ’ˆ์งˆ ํ•™์Šต ๋ฐ์ดํ„ฐยทํŒŒ์ดํ”„๋ผ์ธยท๋ ˆ์‹œํ”ผ๊ฐ€ ๊ณต๊ฐœ๋˜์ง€ ์•Š์•˜๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค. OpenSearch-VL์€ ํ…์ŠคํŠธ ๊ฒ€์ƒ‰, ์ด๋ฏธ์ง€ ๊ฒ€์ƒ‰, OCR, ํฌ๋กญ, ์ดˆํ•ด์ƒ๋„ ๋“ฑ ๋‹ค์–‘ํ•œ ๋„๊ตฌ๋ฅผ ํ†ตํ•ฉํ•˜๊ณ , ๋„๊ตฌ ์‹คํŒจ ์—ฐ์‡„๋ฅผ ์ฒ˜๋ฆฌํ•˜๋Š” Multi-turn Fatal-aware GRPO๋ฅผ ์ œ์•ˆํ•ด ์ด ๋ฌธ์ œ๋ฅผ ์ •๋ฉด ๋ŒํŒŒํ•ฉ๋‹ˆ๋‹ค.

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • 7๊ฐœ ๋ฒค์น˜๋งˆํฌ์—์„œ ํ‰๊ท  10์  ์ด์ƒ ํ–ฅ์ƒ
  • ์ผ๋ถ€ ํƒœ์Šคํฌ์—์„œ ๋…์  ์ƒ์—… ๋ชจ๋ธ๊ณผ ๋™๋“ฑํ•œ ์ˆ˜์ค€ ๋‹ฌ์„ฑ
  • SFT์šฉ SearchVL-SFT-36k, RL์šฉ SearchVL-RL-8k ๋ฐ์ดํ„ฐ์…‹ ์ „๋Ÿ‰ ๊ณต๊ฐœ
  • Wikipedia ๊ฒฝ๋กœ ์ƒ˜ํ”Œ๋ง + ํผ์ง€ ์—”ํ„ฐํ‹ฐ ์žฌ์ž‘์„ฑ์œผ๋กœ ๋‹จ์ถ•ํ‚คยท์›์Šคํ… ๋ถ•๊ดด ๋ฐฉ์ง€

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€?

์ด์ „: ์ตœ์ „์„  ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๊ฒ€์ƒ‰ ์—์ด์ „ํŠธ = ์žฌํ˜„ ๋ถˆ๊ฐ€ ๋ธ”๋ž™๋ฐ•์Šค โ†’ ์ดํ›„: ๋ฐ์ดํ„ฐยท์ฝ”๋“œยท๋ชจ๋ธ ์ „๋Ÿ‰ ๊ณต๊ฐœ, ๋ˆ„๊ตฌ๋‚˜ GPT๊ธ‰ ๊ฒ€์ƒ‰ ์—์ด์ „ํŠธ๋ฅผ ์ง์ ‘ ํ•™์Šต ๊ฐ€๋Šฅ

๋…ผ๋ฌธ ๋ณด๊ธฐ โ†’ Shuang Chen, Kaituo Feng, Hangting Chen ์™ธ 2๋ช…

โœ‰๏ธ

๋งค์ผ ๋ฐ›์•„๋ณด์„ธ์š”

AI ๋ฐ์ผ๋ฆฌ ๋‰ด์Šค ยท ๋…ผ๋ฌธ ยท GitHub ํŠธ๋ Œ๋“œ๋ฅผ ๋งค์ผ ํ•œ๊ตญ์–ด๋กœ ์ •๋ฆฌํ•ด ๋ณด๋‚ด๋“œ๋ฆฝ๋‹ˆ๋‹ค.

์ŠคํŒธ ์—†์Œ ยท ์–ธ์ œ๋“  ๊ตฌ๋…์ทจ์†Œ ๊ฐ€๋Šฅ