Recent AI models such as and have showcased impressive capabilities in generating text and images. ChatGPT Midjourney However, there are also models that specialize in understanding these inputs, such as the Vision Transformers (ViT) for images and Pathways language model (PaLM) for text. These models can interpret and comprehend the meaning of images and sentences. Combining both text and image models would result in an AI that can understand various forms of data and would be able to comprehend nearly everything. However, the capabilities of such a model may seem limited at first glance, as it would only be able to understand things. But, what if this model is integrated with a robotic system that can move in the physical world? This is where PaLM-E comes in. What is The PaLM-E AI Model by Google? Google's latest publication, PaLM-E, is an . embodied multimodal language model This means that it is a model that can interpret and understand various types of data, including images and text from ViT and PaLM models respectively, and convert this information into actions through a robotic hand. Learn more in the video… https://youtu.be/1RF06BL7VAc?embedable=true&transcript=true

Prompting: The Unique Language of AI

What Does a Deep Learning Architect at NVIDIA Do? (Video Interview)

Watch more on YouTube: https://www.youtube.com/c/WhatsAI

2021 - HackerNoon Contributor of the Year - DEEP-LEARNING

2021 - HackerNoon Contributor of the Year - FACEBOOK

Nominated for 2022 - Best Data Science Newsletter

Nominated for 2022 - HackerNoon Contributor of the Year - Artificial Intelligence

Nominated for 2022 - Top Tech Youtuber

Nominated for 2022 - HackerNoon Contributor of the Year - Innovation

Nominated for 2022 - HackerNoon Contributor of the Year - Data Science

Nominated for 2022 - HackerNoon Contributor of the Year - Natural Language Processing

Google's PaLM-E (AI Robot) Can See and Understand Language

About Author

Comments

TOPICS

THIS ARTICLE WAS FEATURED IN

Related Stories

3D Articulated Shape Reconstruction from Videos

The Noonification: How Often Do NFTs Pass The Howey Test? (1/13/2023)

Darwin's Hybrid Intelligence to Align AI & Human Goals for Startups & VCs

The Noonification: White Man (11/26/2022)

The Noonification: The Metaverse is a Sh*tshow (11/2/2022)

100 Days of AI Day 1: From Newsletter to Podcast, Leveraging AI for Audio Transformation

3D Articulated Shape Reconstruction from Videos

The Noonification: How Often Do NFTs Pass The Howey Test? (1/13/2023)

Darwin's Hybrid Intelligence to Align AI & Human Goals for Startups & VCs

The Noonification: White Man (11/26/2022)

The Noonification: The Metaverse is a Sh*tshow (11/2/2022)

100 Days of AI Day 1: From Newsletter to Podcast, Leveraging AI for Audio Transformation

Light-Mode

Classic

Newspaper

Minty

Dark-Mode

Neon Noir

Minty

HN StartUps