๐Ÿ“„ PaperBytes

Weekly AI Papers โ€” 2026-05-18

๐Ÿ“„ 10ํŽธ ๐Ÿ›๏ธ ๋น…ํ…Œํฌ 10ํŽธ
1
๐Ÿ›๏ธ ๋น…ํ…Œํฌ
NVIDIA

๐Ÿง  โ€œ๋น„์ฃผ์–ผ ์ฆ๊ฑฐ ์—†์œผ๋ฉด 2% ์ •๋‹ต๋ฅ ? ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๊ธฐ์–ต์˜ ์ง„์งœ ํž˜์„ ๋ณด์—ฌ์ฃผ๋Š” ํ…Œ์ŠคํŠธโ€

MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models

๐Ÿ›๏ธ ์†Œ์†: NVIDIA (๋น…ํ…Œํฌ)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: multimodal memory, long-term reasoning, vision-language models, memory-augmented agents, cross-modal benchmark

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

  • โ€œ์ด์ „ ๋Œ€ํ™”์—์„œ ๋งํ•œ ๊ทธ ์ด๋ฏธ์ง€, ์ง€๊ธˆ ๋‹ค์‹œ ๋ณด์—ฌ์ค˜์•ผ ํ•  ๋•Œ ์–ด๋–ป๊ฒŒ ๊ธฐ์–ตํ•ด?โ€
  • โ€œ์‚ฌ์ง„์ด ์—†์–ด๋„ ์ด์ „ ๋Œ€ํ™” ๋‚ด์šฉ์„ ๋ฐ”ํƒ•์œผ๋กœ ์ •๋‹ต์„ ๋„์ถœํ•  ์ˆ˜ ์žˆ์„๊นŒ?โ€
  • โ€œ์‚ฌ์ง„์ด ์žˆ์œผ๋ฉด ์ •๋‹ต๋ฅ  80.4%์ธ๋ฐ, ์‚ฌ์ง„ ์—†์œผ๋ฉด 2%? ์ด๊ฑด ์ง„์งœ ๋†€๋ผ์šด ์ฐจ์ด์•ผ.โ€

[ํ•ต์‹ฌ ์„ค๋ช…: ๊ธฐ์กด์—๋Š” ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์žฅ๊ธฐ ๊ธฐ์–ต์„ โ€˜๊ธด ๋ฌธ๋งฅ ๋ชจ๋ธโ€™๊ณผ โ€˜๋ฉ”๋ชจ๋ฆฌ ๋ณด๊ฐ• ์—์ด์ „ํŠธโ€™ ๋‘ ๋ฐฉ์‹์œผ๋กœ ๋‚˜๋ˆ  ๋น„๊ตํ–ˆ์ง€๋งŒ, ์ด ๋…ผ๋ฌธ์€ ์ฒซ ๋ฒˆ์งธ๋กœ โ€˜์ง„์งœ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ฆ๊ฑฐ๊ฐ€ ํ•„์š”ํ•œ ์งˆ๋ฌธโ€™์„ ๊ธฐ์ค€์œผ๋กœ ๋‘ ๋ฐฉ์‹์„ ์ฒด๊ณ„์ ์œผ๋กœ ๋น„๊ตํ•ด๋ƒˆ๋‹ค.]

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • 789๊ฐœ์˜ ์งˆ๋ฌธ ์ค‘ 80.4%๋Š” ์ด๋ฏธ์ง€ ์ฆ๊ฑฐ๊ฐ€ ํ•„์š”ํ–ˆ๊ณ , ์ด๋ฅผ ์ œ๊ฑฐํ•˜๋ฉด ์ตœ๊ณ  ์„ฑ๋Šฅ ๋ชจ๋ธ๋„ ์ •๋‹ต๋ฅ  2%๋กœ ํญ๋ฝ
  • 27๊ฐœ์˜ LVLM๊ณผ 7๊ฐœ์˜ ๋ฉ”๋ชจ๋ฆฌ ์—์ด์ „ํŠธ๋ฅผ ํ‰๊ฐ€ํ•ด, ๋‹ค์ค‘ ์„ธ์…˜ ์ถ”๋ก  ์„ฑ๋Šฅ์€ ๋Œ€๋ถ€๋ถ„ 30% ์ดํ•˜๋กœ ์ œํ•œ๋จ

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€? :

โ€œ๋‹จ์ˆœํžˆ ๊ธด ๋ฌธ๋งฅ์„ ์ฒ˜๋ฆฌํ•˜๋Š” ๋ชจ๋ธ โ†’ ์‹œ๊ฐ ์ฆ๊ฑฐ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ๊ตฌ์กฐํ™”๋œ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๊ฒ€์ƒ‰ + ์žฅ๊ธฐ ๊ธฐ์–ต ๊ฒฐํ•ฉโ€

โ†’ โ€œ๋‹ค์ค‘ ์„ธ์…˜ ๋Œ€ํ™”์—์„œ ์‹œ๊ฐ ์ •๋ณด๋ฅผ ์žƒ์ง€ ์•Š์œผ๋ฉด์„œ๋„ ์žฅ๊ธฐ ๊ธฐ์–ต์„ ์œ ์ง€ํ•˜๋Š” ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์•„ํ‚คํ…์ฒ˜๋กœ ์ง„ํ™”โ€

2
๐Ÿ›๏ธ ๋น…ํ…Œํฌ
NVIDIA

๐ŸŽฌ "์˜์ƒ ์ƒ์„ฑ์ด โ€˜๋ช‡ ์Šคํ…โ€™์— ๊ตญํ•œ๋๋‹ค๋ฉด, ์ด์ œ๋Š” โ€˜์–ธ์ œ๋“  ์Šคํ…โ€™์œผ๋กœ ์ž์œ ๋กญ๊ฒŒ ์กฐ์ •ํ•  ์ˆ˜ ์žˆ๋‹ค!"

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

๐Ÿ›๏ธ ์†Œ์†: NVIDIA (๋น…ํ…Œํฌ)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: video diffusion, flow map distillation, on-policy learning, any-step generation, ODE sampling

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

  • โ€œ๋ช‡ ์Šคํ…๋งŒ์œผ๋กœ ์˜์ƒ ์ƒ์„ฑ์ด ๋๋‚˜๋Š” ๋ชจ๋ธ์ด ์™œ ๋” ๋‚˜์€๊ฐ€์š”?โ€
  • โ€œ์ƒ˜ํ”Œ๋ง ์Šคํ…์ด ๋งŽ์•„์ง€๋ฉด ์„ฑ๋Šฅ์ด ๋–จ์–ด์ง€๋Š” ๋ชจ๋ธ, ๊ทธ๊ฒŒ ์™œ ๋ฌธ์ œ์ธ๊ฐ€์š”?โ€
  • โ€œODER๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ๋ชจ๋ธ์ด โ€˜๋ชจ๋“  ์Šคํ…โ€™์— ์œ ์—ฐํ•˜๊ฒŒ ๋Œ€์‘ํ•  ์ˆ˜ ์žˆ๋‹ค๋ฉด, ์–ด๋–ค ํ˜์‹ ์ด ์ƒ๊ธธ๊นŒ์š”?โ€

[ํ•ต์‹ฌ ์„ค๋ช…: ๊ธฐ์กด์—๋Š” ์ผ์ •ํ•œ ์ƒ˜ํ”Œ๋ง ์Šคํ…์—๋งŒ ์ตœ์ ํ™”๋œ ์ผ๊ด€์„ฑ ๊ธฐ๋ฐ˜ ๋””์Šคํ‹ธ๋ ˆ์ด์…˜์„ ์‚ฌ์šฉํ–ˆ์ง€๋งŒ, ์ด ๋…ผ๋ฌธ์€ ์ „์ฒด ODE ์ƒ˜ํ”Œ๋ง ๊ฒฝ๋กœ๋ฅผ ์ตœ์ ํ™”ํ•ด โ€˜์–ธ์ œ๋“  ์Šคํ…โ€™์œผ๋กœ ์œ ์—ฐํ•˜๊ฒŒ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.]

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • 1.3B~14B ํŒŒ๋ผ๋ฏธํ„ฐ ๊ทœ๋ชจ์—์„œ **few-step regime์—์„œ ์ผ๊ด€์„ฑ ๊ธฐ๋ฐ˜ ๋ชจ๋ธ๊ณผ ์„ฑ๋Šฅ์„ ๋™์ผํ•˜๊ฑฐ๋‚˜ ์ดˆ๊ณผ** (14B ๋ชจ๋ธ์€ 16์Šคํ…์—์„œ 93.7%์˜ FID ์„ฑ๋Šฅ)
  • **์ƒ˜ํ”Œ๋ง ์Šคํ… ์ˆ˜๊ฐ€ ์ฆ๊ฐ€ํ• ์ˆ˜๋ก ์„ฑ๋Šฅ์ด ์„ ํ˜•์ ์œผ๋กœ ํ–ฅ์ƒ** (16์Šคํ…์—์„œ 93.7% โ†’ 32์Šคํ…์—์„œ 96.2% FID)

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€? :

**๊ธฐ์กด ๋ฐฉ์‹ โ†’ ์ผ๊ด€์„ฑ ๊ธฐ๋ฐ˜ ๋””์Šคํ‹ธ๋ ˆ์ด์…˜(๊ณ ์ • ์Šคํ…์— ์ตœ์ ํ™”)** โ†’ **์ƒˆ ๋ฐฉ์‹ โ†’ ํ๋ฆ„ ๋งต ๊ธฐ๋ฐ˜ ๋””์Šคํ‹ธ๋ ˆ์ด์…˜(ODER ์ „์ฒด ๊ฒฝ๋กœ ์ตœ์ ํ™”)**

3
๐Ÿ›๏ธ ๋น…ํ…Œํฌ
Google

๐Ÿš€ โ€œ3D ์ถ”์ ์— ๋””ํ“จ์ „ ๋ชจ๋ธ์„ ์“ฐ๋‹ค๋‹ˆโ€ฆ ์ด๊ฑฐ ์ง„์งœ ํ๋ฆ„์ด ๋’ค์ง‘์–ด์กŒ๋„ค!โ€

TrackCraft3R: Repurposing Video Diffusion Transformers for Dense 3D Tracking

๐Ÿ›๏ธ ์†Œ์†: Google (๋น…ํ…Œํฌ)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: video diffusion, 3D tracking, dense point tracking, temporal alignment, LoRA fine-tuning

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

  • โ€œ์™œ 3D ์ถ”์ ์€ ํ•ญ์ƒ โ€˜ํ”„๋ ˆ์ž„ ๋‹จ์œ„ ์ƒ์„ฑโ€™ ๋ชจ๋ธ์— ์˜์กดํ•ด์•ผ ํ•˜๋‚˜์š”?โ€
  • โ€œ์‹ค์ œ ์˜์ƒ์—์„œ ๋ฐฐ์šด ์›€์ง์ž„ prior๊ฐ€ 3D ์ถ”์ ์— ์™œ ์—†์—ˆ๋‚˜์š”?โ€
  • โ€œ์ด๊ฑฐ ํ›ˆ๋ จ ์‹œ๊ฐ„์ด ๊ธธ์–ด๋„ ์„ฑ๋Šฅ์ด ๋›ฐ์–ด๋‚˜๋ฉด, ์™œ ๊ธฐ์กด ๋ชจ๋ธ๋“ค์ด ์“ฐ์ด์ง€ ์•Š๋‚˜์š”?โ€

๊ธฐ์กด์—๋Š” 3D ์ถ”์  ๋ชจ๋ธ์ด ํ”„๋ ˆ์ž„ ๋‹จ์œ„๋กœ ์ƒ์„ฑํ•˜๋Š” ๋””ํ“จ์ „ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•ด๋„, ๋ฌผ๋ฆฌ์  ํฌ์ธํŠธ๋ฅผ ์‹œ๊ฐ„์— ๋”ฐ๋ผ ์ผ๊ด€๋˜๊ฒŒ ์ถ”์ ํ•  ์ˆ˜ ์—†์—ˆ๊ณ , ๋Œ€๋ถ€๋ถ„ ํ•™์Šต ๋ฐ์ดํ„ฐ๊ฐ€ ํ•ฉ์„ฑ ์˜์ƒ์ด๋‚˜ ์ •์  ๋‹ค์ค‘ ์‹œ์  ์ด๋ฏธ์ง€์˜€๊ธฐ์— ์‹ค์ œ ์›€์ง์ž„์„ ์ž˜ ๋ชจ๋ฐฉํ•˜์ง€ ๋ชปํ–ˆ๋‹ค. ์ด ๋…ผ๋ฌธ์€ โ€˜๋น„๋””์˜ค ๋””ํ“จ์ „ ํŠธ๋žœ์Šคํฌ๋จธโ€™๋ฅผ ๋‹จ์ˆœํžˆ ์žฌ์‚ฌ์šฉํ•˜๋Š” ๊ฒŒ ์•„๋‹ˆ๋ผ, **์ฐธ์กฐ ํ”„๋ ˆ์ž„ ๊ธฐ์ค€์œผ๋กœ ๋ฌผ๋ฆฌ์  ํฌ์ธํŠธ๋ฅผ ์ถ”์ ํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ์™„์ „ํžˆ ์žฌ์„ค๊ณ„**ํ•ด, ํ•œ ๋ฒˆ์˜ ์ „๋ฐฉํ–ฅ ํŒจ์Šค๋กœ ์ „์ฒด ์˜์ƒ์˜ 3D ํฌ์ธํŠธ ์ถ”์ ์„ ์™„์„ฑํ•œ๋‹ค.

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • **๊ธฐ์กด ์ตœ๊ฐ• ๋ชจ๋ธ ๋Œ€๋น„ 1.3๋ฐฐ ๋น ๋ฅธ ์†๋„**์™€ **4.6๋ฐฐ ์ ์€ ํ”ผํฌ ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์šฉ**
  • **์Šคํ”„๋ ˆ์Šค ๋ฐ ๋ฉ์Šค 3D ์ถ”์  ๋ฒค์น˜๋งˆํฌ์—์„œ SOTA ์„ฑ๋Šฅ ๋‹ฌ์„ฑ**

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€? :

**ํ”„๋ ˆ์ž„ ๊ธฐ์ค€ ์ƒ์„ฑ ๋ชจ๋ธ โ†’ ์ฐธ์กฐ ํ”„๋ ˆ์ž„ ๊ธฐ์ค€ ์ถ”์  ๋ชจ๋ธ**

4
๐Ÿ›๏ธ ๋น…ํ…Œํฌ
ByteDance Seed

๐Ÿง  โ€œ128K ๋ง‰์•„๋‘๋ฉด ๋? ์•„๋‹ˆ, 512K๊นŒ์ง€ ์“ฐ๋Š” ๋ฒ•์„ ์•Œ์•„์•ผ ํ•ด!โ€

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

๐Ÿ›๏ธ ์†Œ์†: ByteDance Seed (๋น…ํ…Œํฌ)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: long-context, vision-language model, continued pre-training, generalization, multimodal retrieval

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

  • โ€œ128K ๋ฌธ๋งฅ์ด ์ตœ๋Œ€ํ•œ์ด์•ผ, ๋” ๊ธธ๊ฒŒ ์“ฐ๋ ค๋ฉด ์–ด๋–ป๊ฒŒ ํ•ด?โ€
  • โ€œ๊ธด ๋ฌธ์„œ ์ดํ•ด์— OCR๋ณด๋‹ค VQA๊ฐ€ ๋” ์ข‹๋‹ค๋Š”๋ฐ, ์™œ?โ€
  • โ€œ๊ธด ๋ฌธ๋งฅ ๋ชจ๋ธ์„ ํ›ˆ๋ จํ•  ๋•Œ ์งง์€ ๋ฐ์ดํ„ฐ๋„ ํ•„์š”ํ• ๊นŒ?โ€

๊ธฐ์กด์—๋Š” ๊ธด ๋ฌธ๋งฅ ๋ชจ๋ธ์„ ํ›ˆ๋ จํ•  ๋•Œ ๊ธด ๋ฐ์ดํ„ฐ๋งŒ ์“ฐ๋Š” ๊ฒŒ ์ตœ์„ ์ด์—ˆ๋‹ค. ์ด ๋…ผ๋ฌธ์€ โ€œ์งง์€ ๋ฐ์ดํ„ฐ๋Š” ํ•„์š” ์—†์–ดโ€๋ผ๋Š” ๊ฒฐ๋ก ์„ ๋‚ด๋ฆฌ๋ฉฐ, 128K ๋ฌธ๋งฅ ๋ชจ๋ธ์„ 256K, 512K๊นŒ์ง€ ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ํ™•์žฅํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ–ˆ๋‹ค.

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • 128K ๋ฌธ๋งฅ ํ›ˆ๋ จ ๋ชจ๋ธ์ด 256K, 512K ๋ฌธ๋งฅ์—์„œ๋„ ์„ฑ๋Šฅ ์œ ์ง€ โ†’ **128K ํ›ˆ๋ จ ๋ชจ๋ธ์ด 512K ๋ฌธ๋งฅ์—์„œ๋„ 7.1% VQA ์ ์ˆ˜ ํ–ฅ์ƒ**
  • 5B-token ์˜ˆ์‚ฐ์œผ๋กœ 7B ๋ชจ๋ธ์„ 128K ๋ฌธ๋งฅ์œผ๋กœ ํ™•์žฅ โ†’ **5B ํ† ํฐ๋งŒ์œผ๋กœ 128K ๋ฌธ๋งฅ ๋ชจ๋ธ ์ƒ์„ฑ**

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€? :

โ€œ๊ธด ๋ฌธ๋งฅ ๋ฐ์ดํ„ฐ๋งŒ ์“ฐ๋Š” ํ›ˆ๋ จ ๋ฐฉ์‹โ€ โ†’ โ€œ์งง์€ ๋ฐ์ดํ„ฐ ํ˜ผํ•ฉ ์—†์ด๋„ ๊ธด ๋ฌธ๋งฅ + ๋‹ค์–‘ํ•œ ํƒœ์Šคํฌ์—์„œ ์„ฑ๋Šฅ ์œ ์ง€โ€

๋…ผ๋ฌธ ๋ณด๊ธฐ โ†’ Zhaowei Wang, Lishu Luo, Haodong Duan ์™ธ 9๋ช…
5
๐Ÿ›๏ธ ๋น…ํ…Œํฌ
Microsoft Research

๐ŸŽฏ "LLM์ด ์Šค์Šค๋กœ ๋ฌธ์ œ๋ฅผ ํ’€ ์ˆ˜ ์žˆ๋‹ค๋ฉด, ์™œ ์šฐ๋ฆฌ ์ธ๊ฐ„์ด ๊ณ„์† ์ฝ”๋“œ๋ฅผ ์จ์•ผ ํ•ด?"

Orchard: An Open-Source Agentic Modeling Framework

๐Ÿ›๏ธ ์†Œ์†: Microsoft Research (๋น…ํ…Œํฌ)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: agentic modeling, open-source framework, scalable training, credit-assignment, RL with distilled trajectories

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

  • โ€œ์˜คํ”ˆ์†Œ์Šค๋กœ๋„ LLM์ด ์Šค์Šค๋กœ ๋ฌธ์ œ๋ฅผ ํ’€ ์ˆ˜ ์žˆ์„๊นŒ?โ€
  • โ€œ์™œ ๋ชจ๋“  ์ตœ๊ณ  ์„ฑ๋Šฅ ๋ชจ๋ธ์ด ํ”„๋ผ์ด๋น— ์ฝ”๋“œ์—๋งŒ ์˜์กดํ•˜๋Š” ๊ฑธ๊นŒ?โ€
  • โ€œ์‚ฌ์šฉ์ž ํ™˜๊ฒฝ์—์„œ AGENT๊ฐ€ ์‹ค์ œ๋กœ ์„ฑ๊ณตํ•  ์ˆ˜ ์žˆ์„๊นŒ?โ€

[ํ•ต์‹ฌ ์„ค๋ช…: ๊ธฐ์กด์—๋Š” ํ”„๋ผ์ด๋น— ์ฝ”๋“œ์™€ ๊ณ ๋น„์šฉ ์ธํ”„๋ผ์— ์˜์กดํ•ด AGENT๋ฅผ ํ›ˆ๋ จํ–ˆ์ง€๋งŒ, ์ด ๋…ผ๋ฌธ์€ ์˜คํ”ˆ์†Œ์Šค + ๊ฐ€๋ฒผ์šด ํ™˜๊ฒฝ ๋ ˆ์ด์–ด๋กœ **์Šค์ผ€์ผ๋ง ๊ฐ€๋Šฅํ•œ AGENT ํ›ˆ๋ จ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ** ๋งŒ๋“ค์—ˆ์Šต๋‹ˆ๋‹ค.]

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • **Orchard-SWE**: Qwen3-30B-A3B-Thinking ๊ธฐ๋ฐ˜์œผ๋กœ SFT ํ›„ 64.3%, SFT+RL ํ›„ 67.5% ์„ฑ๊ณผ ๋‹ฌ์„ฑ โ†’ ์˜คํ”ˆ์†Œ์Šค ๋ชจ๋ธ ์ค‘ ์ตœ๊ณ  ๊ธฐ๋ก
  • **Orchard-GUI**: 0.4K distilled trajectories + 2.2K open-ended tasks๋กœ 74.1% (WebVoyager), 67.0% (Online-Mind2Web), 64.0% (DeepShop) ์„ฑ๊ณต๋ฅ  ๋‹ฌ์„ฑ โ†’ ํ”„๋ฆฌ๋ฏธ์—„ ์‹œ์Šคํ…œ๊ณผ ๊ฒฝ์Ÿ ์ˆ˜์ค€

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€? :

**โ€œํ”„๋ผ์ด๋น— ์ฝ”๋“œ + ๊ณ ๋น„์šฉ ์ธํ”„๋ผ๋กœ๋งŒ AGENT ํ›ˆ๋ จ์ด ๊ฐ€๋Šฅํ–ˆ๋‹คโ€** โ†’ **โ€œ์˜คํ”ˆ์†Œ์Šค ํ™˜๊ฒฝ ๋ ˆ์ด์–ด + ๊ฐ€๋ฒผ์šด ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ๋กœ๋„ AGENT ์„ฑ๋Šฅ์„ ๋Œ์–ด์˜ฌ๋ฆด ์ˆ˜ ์žˆ๋‹คโ€**

6
๐Ÿ›๏ธ ๋น…ํ…Œํฌ
NVIDIA

๐ŸŽฌ โ€œ1๋ถ„ ์˜์ƒ ์ƒ์„ฑ์— 36๋ฐฐ ๋น ๋ฅด๊ฒŒ? ์ด๊ฑด ๋ญ์ง€โ€ฆ ์™œ ์ด๊ฑธ ์•ˆ ์•Œ์•˜์„๊นŒ?โ€

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

๐Ÿ›๏ธ ์†Œ์†: NVIDIA (๋น…ํ…Œํฌ)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: world modeling, diffusion transformer, camera control, linear attention, video generation

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

  • โ€œ๋Œ€๊ทœ๋ชจ ์›”๋“œ ๋ชจ๋ธ๋ง์ด ์™œ ์ด๋ ‡๊ฒŒ ๋น„์‹ธ๊ณ  ๋А๋ฆฌ์ง€?โ€
  • โ€œํ•œ ๋ฒˆ์— 60์ดˆ ์˜์ƒ ์ƒ์„ฑํ•˜๋ ค๋ฉด GPU ๋ช‡ ๊ฐœ๊ฐ€ ํ•„์š”ํ•˜์ง€ ์•Š๋‚˜?โ€
  • โ€œ720p ์˜์ƒ๊นŒ์ง€ ๋†’์€ ํ’ˆ์งˆ๋กœ ๋งŒ๋“ค๋ฉด์„œ๋„, ๋‹จ์ผ GPU๋กœ ๋น ๋ฅด๊ฒŒ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ์„๊นŒ?โ€

[ํ•ต์‹ฌ ์„ค๋ช…: ๊ธฐ์กด์—๋Š” ๋Œ€๊ทœ๋ชจ ๋ชจ๋ธ๊ณผ ์ˆ˜์‹ญ ๊ฐœ์˜ GPU๊ฐ€ ํ•„์š”ํ–ˆ์ง€๋งŒ, ์ด ๋…ผ๋ฌธ์€ 2.6B ํŒŒ๋ผ๋ฏธํ„ฐ๋กœ 1๋ถ„ ์˜์ƒ ์ƒ์„ฑ์„ ๋‹จ์ผ GPU์—์„œ ๋น ๋ฅด๊ฒŒ ์™„์„ฑํ•ฉ๋‹ˆ๋‹ค.]

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • 15์ผ ๋™์•ˆ 64๊ฐœ์˜ H100 GPU๋กœ ํ›ˆ๋ จ ์™„๋ฃŒ, 60์ดˆ ์˜์ƒ ์ƒ์„ฑ์€ ๋‹จ์ผ GPU๋กœ ๊ฐ€๋Šฅ
  • 34์ดˆ ๋‚ด์— RTX 5090 + NVFP4 ์–‘์žํ™”๋กœ 720p 60์ดˆ ์˜์ƒ ๋…ธ์ด์ฆˆ ์ œ๊ฑฐ ๊ฐ€๋Šฅ

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€? :

โ€œ๋Œ€๊ทœ๋ชจ ์‚ฐ์—… ๊ธฐ์ค€ ๋ชจ๋ธ(์˜ˆ: LingBot-World)๊ณผ ๋™์ผํ•œ ์‹œ๊ฐ ํ’ˆ์งˆโ€ โ†’ โ€œ36๋ฐฐ ๋†’์€ ์ฒ˜๋ฆฌ ์†๋„๋กœ ์Šค์ผ€์ผ๋ง ๊ฐ€๋Šฅโ€

๋…ผ๋ฌธ ๋ณด๊ธฐ โ†’ Haoyi Zhu, Haozhe Liu, Yuyang Zhao ์™ธ 6๋ช…
7
๐Ÿ›๏ธ ๋น…ํ…Œํฌ
Microsoft

๐Ÿค– โ€œ์‚ฌ์šฉ์ž ํ–‰๋™ ๊ณต๊ฐ„์„ ์™„์ „ํžˆ ๋ฎ๋Š” ๊ฒŒ, AI๊ฐ€ ์ธ๊ฐ„์ฒ˜๋Ÿผ ์ปดํ“จํ„ฐ๋ฅผ ์“ฐ๋Š” ๋ฐ ํ•ต์‹ฌ์ด์•ผ?โ€

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

๐Ÿ›๏ธ ์†Œ์†: Microsoft (๋น…ํ…Œํฌ)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: computer-use agent, GUI interaction, data synthesis, benchmark, multimodal action

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

  • โ€œํด๋ฆญ๋งŒ์œผ๋กœ๋„ ๋ณต์žกํ•œ ์ž‘์—…์„ ์™„๋ฃŒํ•  ์ˆ˜ ์žˆ์„๊นŒ?โ€
  • โ€œAI๊ฐ€ ํ™”๋ฉด์—์„œ ๋“œ๋ž˜๊ทธ, ๊ทธ๋ฆผ ๊ทธ๋ฆฌ๊ธฐ, ํ‘œ ์กฐ์ž‘๊นŒ์ง€ ํ•  ์ˆ˜ ์žˆ์„๊นŒ?โ€
  • โ€œ์‹ค์ œ ์‚ฌ์šฉ์ž๋“ค์ด ํ•˜๋Š” ๋‹ค์–‘ํ•œ ํ–‰๋™์„ AI๊ฐ€ ํ•™์Šตํ•ด์•ผ ํ•˜๋Š”๋ฐ, ๋ฐ์ดํ„ฐ๊ฐ€ ๋ถ€์กฑํ•œ ๊ฑธ๊นŒ?โ€

[ํ•ต์‹ฌ ์„ค๋ช…: ๊ธฐ์กด์—๋Š” GUI ํด๋ฆญ ์ค‘์‹ฌ์˜ ๋ฒค์น˜๋งˆํฌ๋กœ ์ œํ•œ๋œ AI๊ฐ€ ์ธ๊ฐ„์˜ ๋ณต์žกํ•œ ํ–‰๋™ ๊ณต๊ฐ„์„ ๋ฎ์ง€ ๋ชปํ–ˆ์œผ๋‚˜, ์ด ๋…ผ๋ฌธ์€ 5๊ฐ€์ง€ ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ(ํ™”๋ฉด, ํ…์ŠคํŠธ, ํ‘œ, ์บ”๋ฒ„์Šค, ์ž์—ฐ ์ด๋ฏธ์ง€)์™€ ๋‹ค์–‘ํ•œ ์•ก์…˜(ํด๋ฆญ, ๋“œ๋ž˜๊ทธ, ๊ทธ๋ฆผ ๊ทธ๋ฆฌ๊ธฐ ๋“ฑ)์„ ํฌํ•จํ•œ โ€˜CUActSpotโ€™์„ ํ†ตํ•ด AI๊ฐ€ ์ธ๊ฐ„์˜ ํ–‰๋™ ๊ณต๊ฐ„์„ ์™„์ „ํžˆ ์ปค๋ฒ„ํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ–ˆ๋‹ค.]

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • **Phi-Ground-Any-4B ๋ชจ๋ธ์ด 32B ํŒŒ๋ผ๋ฏธํ„ฐ ์ดํ•˜ ์˜คํ”ˆ์†Œ์Šค ๋ชจ๋ธ๋ณด๋‹ค ์„ฑ๋Šฅ ์šฐ์œ„๋ฅผ ๋ณด์ž„**
  • **5๊ฐ€์ง€ ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ์™€ ๋‹ค์–‘ํ•œ ์•ก์…˜(ํด๋ฆญ, ๋“œ๋ž˜๊ทธ, ๊ทธ๋ฆผ ๊ทธ๋ฆฌ๊ธฐ ๋“ฑ)์„ ํฌํ•จํ•œ ๋ฒค์น˜๋งˆํฌ CUActSpot์„ ์ œ์•ˆ**

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€? :

**ํด๋ฆญ ์ค‘์‹ฌ์˜ GUI ๋ฒค์น˜๋งˆํฌ โ†’ 5๋ชจ๋‹ฌ๋ฆฌํ‹ฐ + ๋‹ค์ค‘ ์•ก์…˜์„ ํ†ตํ•œ ์ธ๊ฐ„ ํ–‰๋™ ๊ณต๊ฐ„ ์™„์ „ ์ปค๋ฒ„**

8
๐Ÿ›๏ธ ๋น…ํ…Œํฌ
ARC Lab, Tencent PCG

๐Ÿ–ผ๏ธ โ€œ3D ๋ชจ๋ธ์ด ์‚ฌ์ง„์„ โ€˜์ •ํ™•ํžˆโ€™ ์žฌํ˜„ํ•˜๋Š” ๊ฒŒ ๋ถˆ๊ฐ€๋Šฅํ•˜๋‹ค? ์ด ๋…ผ๋ฌธ์ด ๊ทธ ๋ฒฝ์„ ๋ฌด๋„ˆ๋œจ๋ฆฝ๋‹ˆ๋‹ค.โ€

Pixal3D: Pixel-Aligned 3D Generation from Images

๐Ÿ›๏ธ ์†Œ์†: ARC Lab, Tencent PCG (๋น…ํ…Œํฌ)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: pixel-aligned, 3D generation, image-to-3D, back-projection, multi-view synthesis

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

  • โ€œ3D ๋ชจ๋ธ์ด ์‚ฌ์ง„๊ณผ ๋˜‘๊ฐ™์ด ๋ณด์ด๋ ค๋ฉด ์–ด๋–ป๊ฒŒ ํ•ด์•ผ ํ• ๊นŒ?โ€
  • โ€œ์™œ 3D ์ƒ์„ฑ ๋ชจ๋ธ์ด ์‚ฌ์ง„์˜ ํ”ฝ์…€์„ ์ •ํ™•ํžˆ ๋”ฐ๋ผ๊ฐ€์ง€ ๋ชปํ•˜๋Š” ๊ฑธ๊นŒ?โ€
  • โ€œ๋‹จ์ผ ์ด๋ฏธ์ง€๋กœ 3D ์žฅ๋ฉด์„ ๋งŒ๋“ค ๋•Œ, ์–ด๋–ค ๊ธฐ์ˆ ์ด โ€˜์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๋Š”โ€™ ๊ฒฐ๊ณผ๋ฅผ ๋ณด์žฅํ•  ์ˆ˜ ์žˆ์„๊นŒ?โ€

[ํ•ต์‹ฌ ์„ค๋ช…: ๊ธฐ์กด์—๋Š” 3D ๋ชจ๋ธ์ด ์บ๋…ผ๋ฆฌ์ปฌ ํฌ์ฆˆ์—์„œ ์ƒ์„ฑํ•˜๊ณ , ์ด๋ฏธ์ง€ ์ •๋ณด๋ฅผ ์–ดํ…์…˜์œผ๋กœ ์ฃผ์ž…ํ•ด ํ”ฝ์…€-3D ๋Œ€์‘์ด ๋ถˆ๋ช…ํ™•ํ–ˆ์ง€๋งŒ, ์ด ๋…ผ๋ฌธ์€ ํ”ฝ์…€ ์ •๋ ฌ๋œ 3D ์ƒ์„ฑ์„ ํ†ตํ•ด ์ด๋ฏธ์ง€ ๋ทฐ์™€ ์ผ์น˜ํ•˜๋Š” 3D ๊ณต๊ฐ„์—์„œ ์ง์ ‘ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.]

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • **ํ”ฝ์…€ ์ •๋ ฌ 3D ์ƒ์„ฑ์œผ๋กœ ์ด๋ฏธ์ง€์˜ ํ”ฝ์…€ ์ˆ˜์ค€ ์‹ ๋ขฐ๋„๋ฅผ 87% ํ–ฅ์ƒ** โ€” ๊ธฐ์กด ๋ชจ๋ธ ๋Œ€๋น„ 2.3๋ฐฐ ๋†’์€ ํ”ฝ์…€ ์ •ํ™•๋„๋ฅผ ๋‹ฌ์„ฑ
  • **๋‹ค์ค‘ ๋ทฐ ํ•ฉ์„ฑ์—์„œ 3D ์žฅ๋ฉด ์ƒ์„ฑ ์„ฑ๊ณต๋ฅ  94%** โ€” 100๊ฐœ ์ด์ƒ์˜ ๋ทฐ์—์„œ 3D ๊ฐ์ฒด ๋ถ„๋ฆฌ์™€ ์žฅ๋ฉด ๋ณต์›์„ ๋™์‹œ์— ๋‹ฌ์„ฑ

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€? :

**๊ธฐ์กด ๋ฐฉ์‹: ์บ๋…ผ๋ฆฌ์ปฌ ๊ณต๊ฐ„์—์„œ 3D ์ƒ์„ฑ โ†’ ์ด๋ฏธ์ง€ ์ •๋ณด๋ฅผ ์–ดํ…์…˜์œผ๋กœ ์ฃผ์ž…**

**์ƒˆ ๋ฐฉ์‹: ํ”ฝ์…€ ์ •๋ ฌ๋œ 3D ๊ณต๊ฐ„์—์„œ ์ง์ ‘ ์ƒ์„ฑ โ†’ ํ”ฝ์…€-3D ๋Œ€์‘์„ ๋ช…ํ™•ํžˆ ํ•ด ์ •ํ™•๋„ ๊ทน๋Œ€ํ™”**

9
๐Ÿ›๏ธ ๋น…ํ…Œํฌ
IBM Research

๐Ÿš€ โ€œLLM์ด ๋„๊ตฌ๋ฅผ ์“ฐ๋Š” ๊ฒŒ ์•„๋‹ˆ๋ผ, ์„ธ์ƒ์„ ์˜ˆ์ธกํ•ด์„œ ์“ฐ๋Š” ๊ฑฐ์•ผ?โ€

MCP-Cosmos: World Model-Augmented Agents for Complex Task Execution in MCP Environments

๐Ÿ›๏ธ ์†Œ์†: IBM Research (๋น…ํ…Œํฌ)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: World Model, MCP, Agent, Task Planning, Execution Quality

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

  • โ€œ๋„๊ตฌ๋ฅผ ์“ฐ๋Š” ์ˆœ๊ฐ„๋งˆ๋‹ค ๊ณ„ํš์„ ๋‹ค์‹œ ์งœ์•ผ ํ•˜๋‚˜์š”?โ€
  • โ€œLLM์ด ํ™˜๊ฒฝ์„ ์ดํ•ดํ•˜๋Š” ๊ฒŒ ์•„๋‹ˆ๋ผ, ๊ทธ๋ƒฅ ๋ฐ˜์‘ํ•˜๋Š” ๊ฑฐ ์•„๋‹Œ๊ฐ€์š”?โ€
  • โ€œ๊ณ„ํš์€ ์ž˜ ์งœ๋Š”๋ฐ, ์‹คํ–‰์ด ์•ˆ ๋˜๋Š” ์ด์œ ๋Š” ๋ฌด์—‡์ผ๊นŒ?โ€

[ํ•ต์‹ฌ ์„ค๋ช…: ๊ธฐ์กด์—๋Š” LLM์ด ๋„๊ตฌ๋ฅผ ํ˜ธ์ถœํ•  ๋•Œ ํ™˜๊ฒฝ์„ ๋‹จ์ˆœํžˆ ๋ฐ˜์‘์ ์œผ๋กœ ์ฒ˜๋ฆฌํ–ˆ๊ณ , ์žฅ๊ธฐ์  ๊ณ„ํš์€ ๋ฌด์‹œ๋์Šต๋‹ˆ๋‹ค. ์ด ๋…ผ๋ฌธ์€ World Model์„ ๋„์ž…ํ•ด LLM์ด ์‹คํ–‰ ์ „์— โ€˜์ƒํƒœ ์ „์ด๋ฅผ ์‹œ๋ฎฌ๋ ˆ์ด์…˜โ€™ํ•˜๊ณ  โ€˜๊ณ„ํš์„ ๋ฏธ๋ฆฌ ์กฐ์ •โ€™ํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ–ˆ์Šต๋‹ˆ๋‹ค.]

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • 20๊ฐœ ์ด์ƒ์˜ MCP-Bench ํƒœ์Šคํฌ์—์„œ ๋„๊ตฌ ์„ฑ๊ณต๋ฅ ์ด ํ‰๊ท  **27% ์ฆ๊ฐ€** (ReAct ๊ธฐ๋ฐ˜)
  • ๋„๊ตฌ ํŒŒ๋ผ๋ฏธํ„ฐ ์ •ํ™•๋„๊ฐ€ **34% ํ–ฅ์ƒ** (SPIRAL ๊ธฐ๋ฐ˜ + 3๊ฐœ World Model)

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€? :

โ€œ๋‹จ์ˆœ ๋ฐ˜์‘ํ˜• ์‹คํ–‰ โ†’ ์‹คํ–‰ ์ „ ์ƒํƒœ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ + ๊ณ„ํš ์กฐ์ •โ€

์ด์ œ LLM์€ ํ™˜๊ฒฝ์„ โ€˜์˜ˆ์ธกโ€™ํ•˜๊ณ , โ€˜์กฐ์ •โ€™ํ•˜๋ฉฐ, โ€˜์‹คํ–‰โ€™ํ•˜๋Š” 3๋‹จ๊ณ„๋ฅผ ํ•˜๋‚˜์˜ ํ”„๋กœ์„ธ์Šค๋กœ ํ†ตํ•ฉํ•ฉ๋‹ˆ๋‹ค.

๋…ผ๋ฌธ ๋ณด๊ธฐ โ†’ Giridhar Ganapavarapu, Dhaval Patel
10
๐Ÿ›๏ธ ๋น…ํ…Œํฌ
Tencent Hunyuan

๐Ÿ”ฅ "๋ชจ๋ธ ๊ธฐ๋ฐ˜์€ ๋น ๋ฅด๊ฒŒ ํ•™์Šตํ•˜์ง€๋งŒ, ์™œๅๅ(์™œๅๅ) ๋นˆ๋ฒˆํ•œ ์˜ค๋ฅ˜๋ฅผ ๋‚ด๋ฑ‰๋Š” ๊ฑธ๊นŒ?"

Debiased Model-based Representations for Sample-efficient Continuous Control

๐Ÿ›๏ธ ์†Œ์†: Tencent Hunyuan (๋น…ํ…Œํฌ)

๐Ÿท๏ธ ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ: model-based representation, debiasing, Q-learning, experience replay, continuous control

๐Ÿ’ญ ์ด๋Ÿฐ ์งˆ๋ฌธ์„ ํ•ด๋ณธ ์  ์žˆ๋‚˜์š”?

  • โ€œ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ํ•™์Šต์€ ์™œ ๋น ๋ฅด๊ฒŒ ํ•™์Šตํ•˜์ง€๋งŒ ์„ฑ๋Šฅ์ด ๋–จ์–ด์ง€๋Š” ๊ฑธ๊นŒ?โ€
  • โ€œ๋ฆฌํ”Œ๋ ˆ์ด ๋ฒ„ํผ์—์„œ ์ด๋ฅธ ์‹œ๊ธฐ์˜ ๊ฒฝํ—˜์„ ๊ณผ๋„ํ•˜๊ฒŒ ์‚ฌ์šฉํ•˜๋ฉด ์–ด๋–ค ๋ฌธ์ œ๊ฐ€ ์ƒ๊ธฐ๋‚˜์š”?โ€
  • โ€œ๋ชจ๋ธ ๊ธฐ๋ฐ˜๊ณผ ๋ชจ๋ธ ํ”„๋ฆฌ์˜ ์žฅ์ ์„ ๋™์‹œ์— ์žก๋Š” ๋ฐฉ๋ฒ•์€ ์ •๋ง๋กœ ๊ฐ€๋Šฅํ• ๊นŒ?โ€

[ํ•ต์‹ฌ ์„ค๋ช…: "๊ธฐ์กด์—๋Š” ๋ชจ๋ธ ๊ธฐ๋ฐ˜ ํ‘œํ˜„์ด ๋น ๋ฅธ ํ•™์Šต์„ ์œ„ํ•ด ๋ฆฌํ”Œ๋ ˆ์ด ๋ฒ„ํผ์˜ ์ด๋ฅธ ๊ฒฝํ—˜์„ ๊ณผ๋„ํ•˜๊ฒŒ ์‚ฌ์šฉํ•ด ์˜ค๋ฅ˜๋ฅผ ์ค„์ด๋ ค ํ–ˆ์ง€๋งŒ, ์ด ๋…ผ๋ฌธ์€ ์ด ์˜ค๋ฅ˜๋ฅผ โ€˜์ •๋ณด ํŽธํ–ฅโ€™์œผ๋กœ ๋ช…ํ™•ํžˆ ์ •์˜ํ•˜๊ณ , โ€˜ํ‘œํ˜„์˜ ์ƒํ˜ธ ์ •๋ณดโ€™์™€ โ€˜๋ณ€์ด ์ตœ์†Œํ™”โ€™๋ฅผ ๋™์‹œ์— ์ตœ์ ํ™”ํ•จ์œผ๋กœ์จ ํ•™์Šต์˜ ์ •ํ™•๋„์™€ ํšจ์œจ์„ฑ์„ ๋™์‹œ์— ๋Œ์–ด์˜ฌ๋ฆฝ๋‹ˆ๋‹ค."]

ํŠนํžˆ ์ฃผ๋ชฉํ•  ์ :

  • DR.Q๋Š” ๋‹จ์ผ ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ์„ธํŠธ๋กœ ์—ฌ๋Ÿฌ ์—ฐ์† ์ œ์–ด ๋ฒค์น˜๋งˆํฌ์—์„œ ์ตœ์‹  ๊ฐ•๋ ฅํ•œ ๊ธฐ์ค€ ๋ชจ๋ธ๊ณผ ๋™๋“ฑํ•˜๊ฑฐ๋‚˜ ๊ทธ ์ด์ƒ์˜ ์„ฑ๋Šฅ์„ ๋ณด์ด๋ฉฐ, **์ตœ๋Œ€ 20% ์ด์ƒ์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ**์„ ๊ธฐ๋ก
  • **๋ฆฌํ”Œ๋ ˆ์ด ๋ฒ„ํผ์—์„œ ๊ฒฝํ—˜์˜ ์šฐ์„ ์ˆœ์œ„๋ฅผ ํฌ๋ฏธํ•˜๊ฒŒ ์กฐ์ •**ํ•ด ๊ณผ๋„ํ•œ ์ดˆ๊ธฐ ๊ฒฝํ—˜์— ์˜ํ•œ ํŽธํ–ฅ์„ ์ค„์ž„์œผ๋กœ์จ, ํ•™์Šต ๊ณผ์ •์˜ ์•ˆ์ •์„ฑ์„ 3๋ฐฐ ์ด์ƒ ํ–ฅ์ƒ

๐ŸŽฏ ์™œ ์ด๊ฒƒ์ด ๊ฒŒ์ž„ ์ฒด์ธ์ €์ธ๊ฐ€? :

"๋ชจ๋ธ ๊ธฐ๋ฐ˜ ํ‘œํ˜„์ด ๋ฆฌํ”Œ๋ ˆ์ด ๋ฒ„ํผ์˜ ์ด๋ฅธ ๊ฒฝํ—˜์— ๊ณผ๋„ํ•˜๊ฒŒ ์˜์กดํ•ด ํŽธํ–ฅ์„ ์ผ์œผํ‚ค๋Š” ๋ฐฉ์‹" โ†’ "ํ‘œํ˜„์˜ ์ƒํ˜ธ ์ •๋ณด์™€ ๋ณ€์ด๋ฅผ ๋™์‹œ์— ์ตœ์ ํ™”ํ•˜๋ฉฐ, ๊ฒฝํ—˜ ์šฐ์„ ์ˆœ์œ„๋ฅผ ํฌ๋ฏธํ•˜๊ฒŒ ์กฐ์ •ํ•ด ํŽธํ–ฅ์„ ์ œ๊ฑฐํ•˜๋Š” ์ƒˆ๋กœ์šด ํ•™์Šต ํŒŒ๋ผ๋ฏธํ„ฐ ์กฐ์ • ๋ฐฉ์‹"

โœ‰๏ธ

๋งค์ผ ๋ฐ›์•„๋ณด์„ธ์š”

AI ๋ฐ์ผ๋ฆฌ ๋‰ด์Šค ยท ๋…ผ๋ฌธ ยท GitHub ํŠธ๋ Œ๋“œ๋ฅผ ๋งค์ผ ํ•œ๊ตญ์–ด๋กœ ์ •๋ฆฌํ•ด ๋ณด๋‚ด๋“œ๋ฆฝ๋‹ˆ๋‹ค.

์ŠคํŒธ ์—†์Œ ยท ์–ธ์ œ๋“  ๊ตฌ๋…์ทจ์†Œ ๊ฐ€๋Šฅ