最近的人工智能模型，如 和 已经展示了令人印象深刻的生成文本和图像的能力。 ChatGPT Midjourney， 但是，也有专门用于理解这些输入的模型，例如用于图像的 Vision Transformers (ViT) 和用于文本的 Pathways 语言模型 (PaLM)。这些模型可以解释和理解图像和句子的含义。 结合文本和图像模型将产生一个可以理解各种形式的数据并且能够理解几乎所有内容的人工智能。 然而，乍一看，这种模型的能力似乎有限，因为它只能理解事物。但是，如果这个模型与可以在物理世界中移动的机器人系统集成在一起呢？这就是 PaLM-E 的用武之地。 什么是 Google 的 PaLM-E AI 模型？  Google 的最新出版物 PaLM-E 是一种 。 具体化的多模态语言模型 这意味着它是一个可以解释和理解各种类型数据的模型，包括分别来自 ViT 和 PaLM 模型的图像和文本，并通过机械手将这些信息转化为动作。 在视频中了解更多信息……   https://youtu.be/1RF06BL7VAc?embedable=true&transcript=true

Watch more on YouTube: https://www.youtube.com/c/WhatsAI

I explain Artificial Intelligence terms and news to non-experts.

2021 - HackerNoon Contributor of the Year - FACEBOOK

2022 - Best Data Science Newsletter

2022 - HackerNoon Contributor of the Year - Artificial Intelligence

2022 - HackerNoon Contributor of the Year - Computer Vision

2022 - HackerNoon Contributor of the Year - Data Science

2022 - HackerNoon Contributor of the Year - Google

2022 - HackerNoon Contributor of the Year - Innovation

2022 - HackerNoon Contributor of the Year - Machine Learning

2022 - HackerNoon Contributor of the Year - Natural Language Processing

2022 - Top Tech Youtuber

2021 - HackerNoon Contributor of the Year - DEEP-LEARNING

Nominated for 2022 - Best Data Science Newsletter

Nominated for 2022 - HackerNoon Contributor of the Year - Artificial Intelligence

Nominated for 2022 - Top Tech Youtuber

Nominated for 2022 - HackerNoon Contributor of the Year - Innovation

Nominated for 2022 - HackerNoon Contributor of the Year - Data Science

Nominated for 2022 - HackerNoon Contributor of the Year - Natural Language Processing

谷歌的 PaLM-E（AI 机器人）可以看到和理解语言

About Author

註釋

標籤

这篇文章刊登在

Related Stories

释放人工智能的力量。前沿技术的系统评价：摘要与介绍

加密货币增长：创建有效的用户角色

架构师指南：构建 AI/ML 数据湖参考架构

从论坛到信息流：社交媒体算法如何塑造数字互动

释放人工智能的力量。前沿技术的系统评价：摘要与介绍

加密货币增长：创建有效的用户角色

架构师指南：构建 AI/ML 数据湖参考架构

从论坛到信息流：社交媒体算法如何塑造数字互动

Light-Mode

Classic

Newspaper

Minty

Dark-Mode

Neon Noir

Minty

HN StartUps