==> 1 ok
==> 2 ok
==> 3 ok
==> 4 ok
==> 5 ok
==> 6 ok
==> 7 ok
==> 8 ok
==> 9 ok
==> 10 ok
==> 11 ok
==> 12 ok
==> 13 ok
==> 14 ok
==> 15 ok
==> 16 ok
==> 17 ok
==> 18 ok
==> 19 ok
==> 20 ok
==> 21 ok
==> 22 ok
==> 23 ok
==> 24 ok
==> 25 ok
==> 26 ok
==> 27 ok
==> 28 ok
==> 29 ok
==> 30 ok
==> 31 ok
==> 32 ok
==> 33 ok
==> 34 ok
==> 35 ok
==> 36 ok
==> 37 ok
==> 38 ok
==> 39 ok
==> 40 ok
==> 41 ok
==> 42 ok
==> 43 ok
==> 44 ok
==> 45 ok
==> 46 ok
==> 47 ok
==> 48 ok
==> 49 ok
==> 50 ok
==> 1 ok
==> 2 ok
==> 3 ok
==> 4 ok
==> 5 ok
==> 6 ok
==> 7 ok
==> 8 ok
==> 9 ok
==> 10 ok
==> 11 ok
==> 12 ok
==> 13 ok
==> 14 ok
==> 15 ok
==> 16 ok
==> 17 ok
==> 18 ok
==> 19 ok
==> 20 ok
==> 21 ok
==> 22 ok
==> 23 ok
==> 24 ok
==> 25 ok
==> 26 ok
==> 27 ok
==> 28 ok
==> 29 ok
==> 30 ok
==> 31 ok
==> 32 ok
==> 33 ok
==> 34 ok
==> 35 ok
==> 36 ok
==> 37 ok
==> 38 ok
==> 39 ok
==> 40 ok
==> 41 ok
==> 42 ok
==> 43 ok
==> 44 ok
==> 45 ok
==> 46 ok
==> 47 ok
==> 48 ok
==> 49 ok
==> 50 ok
感谢本站网友 zhao_31 的线索投递!
本站 2 月 26 日消息,北京时间今日凌晨,微软在官网开源了多模态 AI Agent 基础模型 ——Magma。与传统 Agent 相比,Magma 具备跨数字、物理世界的多模态能力,能自动处理图像、视频、文本等不同类型数据,此外,Magma 还能内置了心理预测功能,增强了对未来视频帧中时空动态的理解能力,能够准确推测视频中人物或物体的意图和未来行为。
用户可以用 Magma 来自动下电商订单、查询天气;也可以自动操作实体机器人,或者在下真实象棋时获得帮助。
根据官方介绍,Magma 能够帮助 AI 驱动的助手或机器人理解周围环境并采取相应行动。例如,它可以帮助家用机器人学习如何整理以前从未见过的物品,或帮助虚拟助手为不熟悉的任务生成逐步的用户界面导航说明。
Magma 是能够适应数字和物理环境中新任务的 VLA(本站注:视觉语言动作)基础模型之一,能够有效地从海量的公开视觉和语言数据中学习知识,从而融合语言、空间和时间智能,应对数字和物理世界中的复杂任务和环境。
本站附开源链接:https://microsoft.github.io/Magma/