paint-brush
So, How Do They Really Train AI Models?by@mayankvikash
384 reads
384 reads

So, How Do They Really Train AI Models?

by Mayank VikashDecember 26th, 2023
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Generative AI models are a fascinating technology that humans still are not able to fully harness.

People Mentioned

Mention Thumbnail
Mention Thumbnail

Company Mentioned

Mention Thumbnail
featured image - So, How Do They Really Train AI Models?
Mayank Vikash HackerNoon profile picture

The year 2023 is coming to an end. This year, Generative AI, Artificial Intelligence, AI taking over the world, ChatGPT, Sam Altman and Elon Musk made millions of headlines. The OpenAI coup was the most interesting.


Artificial has impacted our way of living quite interestingly. People generally do not need to care about fake content or fake news as long as their source of information is just mainstream media - not disagreeing that they publish fake news mistakingly. The fear of fake content was at its peak this year. I do not know why it was so hyped. But the fear was real, the common man was genuinely scared of the negative impact of AI. The fear of fake content was so widespread that Merriam-Webster chose the word, "authentic" as their word of the year 2023.


Many people write about AI and its impact on the modern way of living. Generative AI models are a fascinating technology that humans still are not able to fully harness. It has the power to revolutionize the world. You must have read the AI models work on the principles of Machine Learning, that somehow some exceptionally genius professionals and corporations can teach machines by feeding them data. This sounds like some alien tech.


But the question I was asking myself is how these AI trainers train their models. What approach do they take? How do they do it? Are there any drawbacks to the widely used methods?


Do you feel curious? Then sit right there and let me take you on the journey of finding the answer to these questions. But before that, let me confirm one thing I am not an AI professional and not an expert in the field. I am just a curious student who loves to share what he learns every day. Training an AI model is a complex process and takes years of experience, hard work and numerous collaborators. I wish to delve deeper into the topic, but that topic is for another article.


When we train an AI model, we need to mainly keep three goals in mind:

  • Accuracy - It should provide the output of the query with absolute possible accuracy, with chances of low to zero errors.
  • Generalisation - When we train an AI model, we train it with certain pre-designed queries. The AI model should learn these patterns and should have the ability to apply this knowledge to unforeseen data.
  • Efficiency - The cost of operating an AI model is high. It takes best-performing servers and a lot of computing to answer even the smallest possible query. The AI model should be efficient to lessen the operating cost and provide the best answers to the user's query while also learning about the user's interests, minimizing mistakes, and logging in unexpected operations and functional issues.


While doing all this, we also need to remember that training an AI model is a tiring process that requires thousands of hours and lots of resources. We also need to remember the basic principle of doing it, that is we should be able to teach the model how to transform the raw data (training data) into actionable insights that enable the model to apply its knowledge - from the training data - across the multiple sectors and in various fields.


We can train an AI model in three ways by training it under supervision known as Supervised Learning, by letting it figure out itself called Unsupervised Learning, or by Learning from Interactions. First, we will discuss how Supervised Learning works.


Supervised Learning is when we give the model a list of inputs and train it on the expected outputs. It is similar to training a dog, when we give the dog a treat for performing the actions of sitting and standing on our orders, the dog learns the patterns and soon gets used to it. It knows that if hears the particular word, "sit" and sits down, then he will get his favourite treat. We do not give any treat to the AI model for its good work, but we can kind of give it our acceptance by agreeing on the output it gives to the query. Supervised Learning can guide the model and help it to better its abilities.


The model rates its performance by comparing how far it is from the expected answer. It calculates the difference and makes the report of necessary adjustments on the parts on which it needs to be improved and adjustments are to be made. It is an important part of supervised learning where the AI model learns from its mistakes and also learns to provide answers accurately and its decision-making skills are also improved.


Unsupervised Learning is called when an AI model is given a vast amount of data to play with like a toddler is given lots of building blocks to play with. The child puts them together and makes structures or shapes which helps develop its cognitive abilities. The same is the case with the AI models. It recognizes patterns in this large ocean of training data and learns to map them. Unsupervised learning uses techniques and tools that help the AI model to organise the mapped data and visualise it. It then learns to create possible inputs and suitable outputs from them. Organising the data helps to simplify the complexity and makes the task easier and more efficient for the AI model.


There is also another form of learning which is called Reinforcement Learning. In this type of training method, the AI learn from the interaction. It is similar to the learning of an intern in a job that he wishes to join. The trainer presents questions to the model and the model has to use its decision-making skills to generate outputs that are best suited to answer the user's query. The AI model is then given a reward or penalty by the trainer based on its output. The AI model constantly tries to level up its skill as it strives to get more rewards. It should be noted that the rewards and penalties here are not real and, therefore are virtual. It is just a virtual pointing system created by the trainers for the AI models. The models are programmed to get as many rewards as possible and to avoid penalties.


Reinforcement Learning or Learning from Interaction depends on several complex algorithms to guide the model to make the best decisions.


These were the most general and standard approach taken to train an AI model.


One might also ask the question of how the data to train the AI models are obtained.


Well, there are lots of ways of obtaining data. The best and most recommended one is to buy a data set from a reputable vendor, but most AI models are trained on all the data available online. The second option should be avoided in every case. According to recent reports, many countries including the United States, the United Kingdom and India are going to make it necessary to disclose the information for the AI companies as to from where they have acquired the training data. At the start of this year, many AI companies and services were taken to court on the accusation of stealing and violating the copyrighted works of artists to train their AI models. The most notable case was against Stability AI, Midjourney and Deviant Art.


It is quite an endeavour to train an AI model as it is no less difficult than fostering a child or a pet dog. It requires patience, hard work and resources. The field is new and there are lots of possibilities. Information provided by The Data Scientist.


Also published here.