paint-brush
A Brief Intro to the GPT-3 Algorithm by@albertchristopher
3,381 reads
3,381 reads

A Brief Intro to the GPT-3 Algorithm

by albertchristopherJune 19th, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Generative Pre-trained Transformer 3 (GPT-3) embraces and augments the GPT-2 model architecture, including pre-normalization, modified initialization, and reversible tokenization. It exhibits strong performance on many Natural Language Processing (NLP) tasks. It is a massive artificial neural network that takes help from deep learning to generate human-like text and is trained on huge text datasets with thousands of billions of words. The total number of weights the OpenAI Gpt-3 dynamically holds in its memory and utilizes to process every query is 175 billion.

Company Mentioned

Mention Thumbnail
featured image - A Brief Intro to the GPT-3 Algorithm
albertchristopher HackerNoon profile picture

Generative Pre-trained Transformer 3 (GPT-3) embraces and augments the GPT-2 model architecture, including pre-normalization, modified initialization, and reversible tokenization. It exhibits strong performance on many Natural Language Processing (NLP) tasks.

GPT-3 is an auto-regressive artificial intelligence algorithm developed by OpenAI, an AI-powered research laboratory located in San Francisco, California.

It is a massive artificial neural network that takes help from deep learning to generate human-like text and is trained on huge text datasets with thousands of billions of words. It is the third-generation AI language prediction model in the GPT-n series and the successor to GPT-2.

In simple words, OpenAI GPT-3 was fed inputs the ways how billions of people write and also was taught how to pick up on writing patterns based on user entry. Once few inputs are offered, the model will generate intelligent text following the submitted pattern and structure. It is also the largest AI language algorithm that produces billions of words a day.

GPT-3 working process

This artificial intelligence algorithm is a program that can calculate the word or even the character which must appear in a text given in relation to the words around it. This is called the conditional probability of words. It is a generative neural network that allows out a numeric score or a yes or no answer. It also generates long sequences of the original text as its output.

The total number of weights the OpenAI GPT-3 dynamically holds in its memory and utilizes to process every query is 175 billion.

Examples

•noun + verb = subject + verb

• noun + verb + adjective = subject + verb + adjective

• verb + noun = subject + verb

• noun + verb + noun = subject + verb + noun

• noun + noun = subject + noun

• noun + verb + noun + noun = subject + verb + noun + noun

The stream of algorithmic content in GPT-3

Every month over 409 million people view more than 20 billion pages, and users publish around 70 million posts on WordPress, which is the dominant content management system online.

The main specialty of OpenAI GPT-3 is the capacity to respond intelligently to minimal input. It is extensively trained on billions of parameters and produces up to 50,000 characters without any supervision. This one-of-a-kind AI neural network generates texts at an amazing quality, making it quite tough for a normal human to understand whether the output was written by GPT-3 or a human.

Training of the GPT-3

The training of the GPT-3 artificial intelligence algorithm has two steps.

Step – 1: It needs to create the vocabulary, production rules, and the various categories. It can be achieved by offering inputs in the form of books. For each word, the model predicts the category to that the word belongs, and afterward, a production rule should be built.

• Step – 2: The development of the vocabulary and production rules for each category takes place. This can be achieved by offering the inputs to the model with sentences. For every sentence, the model will be predicting the category to which each word belongs, and after that, a production rule should be built.

The model consists of a few tricks that allow it a provision to boost its capability to generate texts. For example, it can guess the inception of a word by understanding the context of the word. It also predicts the next word depending on the last word of a sentence. It can also predict the length of a sentence.

Conclusion

There's a lot of hype for the GPT-3 AI algorithm right now. One can say that in the future, it will be offering more beyond the text that includes pictures, videos, and many more. Many researchers also predicted that GPT-3 would possess the capability to translate words to pictures and pictures to words.