や などの最近の AI モデルでは、テキストや画像を生成する優れた機能が紹介されています。 ChatGPT Midjourney ただし、画像用の Vision Transformers (ViT) やテキスト用の Pathways 言語モデル (PaLM) など、これらの入力の理解に特化したモデルもあります。これらのモデルは、画像や文章の意味を解釈して理解することができます。 テキスト モデルと画像モデルの両方を組み合わせることで、さまざまな形式のデータを理解し、ほぼすべてを理解できる AI が実現します。 ただし、そのようなモデルの機能は、物事を理解することしかできないため、一見制限されているように見えるかもしれません。しかし、このモデルが物理世界を移動できるロボット システムと統合されている場合はどうなるでしょうか。ここで、PaLM-E の出番です。  Google の PaLM-E AI モデルとは?  Google の最新の出版物、PaLM-E は、 です。 具現化されたマルチモーダル言語モデル これは、ViT モデルと PaLM モデルのそれぞれから画像とテキストを含むさまざまなタイプのデータを解釈して理解し、この情報をロボットハンドを介してアクションに変換できるモデルであることを意味します。 詳しくは動画で…   https://youtu.be/1RF06BL7VAc?embedable=true&transcript=true

Watch more on YouTube: https://www.youtube.com/c/WhatsAI

I explain Artificial Intelligence terms and news to non-experts.

2021 - HackerNoon Contributor of the Year - FACEBOOK

2022 - Best Data Science Newsletter

2022 - HackerNoon Contributor of the Year - Artificial Intelligence

2022 - HackerNoon Contributor of the Year - Computer Vision

2022 - HackerNoon Contributor of the Year - Data Science

2022 - HackerNoon Contributor of the Year - Google

2022 - HackerNoon Contributor of the Year - Innovation

2022 - HackerNoon Contributor of the Year - Machine Learning

2022 - HackerNoon Contributor of the Year - Natural Language Processing

2022 - Top Tech Youtuber

2021 - HackerNoon Contributor of the Year - DEEP-LEARNING

Nominated for 2022 - Best Data Science Newsletter

Nominated for 2022 - HackerNoon Contributor of the Year - Artificial Intelligence

Nominated for 2022 - Top Tech Youtuber

Nominated for 2022 - HackerNoon Contributor of the Year - Innovation

Nominated for 2022 - HackerNoon Contributor of the Year - Data Science

Nominated for 2022 - HackerNoon Contributor of the Year - Natural Language Processing

Nominated for 2022 - HackerNoon Contributor of the Year - Computer Vision

Nominated for 2022 - HackerNoon Contributor of the Year - Google

Nominated for 2022 - HackerNoon Contributor of the Year - Machine Learning

このオーディオは、ストーリーの元の言語で制作されています。

長すぎる; 読むには

GoogleのPaLM-E（AIロボット）は言語を見て理解できる

GoogleのPaLM-E（AIロボット）は言語を見て理解できる

About Author

コメント

ラベル

この記事は

Related Stories

ユーザー中心の暗号通貨製品の作成: 顧客からのフィードバックの重要性

AI の力を解き放つ。最先端技術の体系的レビュー: 概要と序論

Telegram: クリプト島と本土を結ぶ橋

デジタルノマドの皆さん、タイの新しい DTV ビザについて知っておくべきこと

ユーザー中心の暗号通貨製品の作成: 顧客からのフィードバックの重要性

AI の力を解き放つ。最先端技術の体系的レビュー: 概要と序論

Telegram: クリプト島と本土を結ぶ橋

デジタルノマドの皆さん、タイの新しい DTV ビザについて知っておくべきこと

Light-Mode

Classic

Newspaper

Minty

Dark-Mode

Neon Noir

Minty

HN StartUps