Artificial intelligence has outdone itself. This is due to the original approach

Once an AI model has specific knowledge or skills, it can easily compete with a human. However, the learning stage itself is the Achilles heel of this type of algorithm. As a result, problems that take a few seconds to solve can be insurmountable for machines. And even if you manage to deal with them, the amount of time and resources turns out to be disproportionate to the actual scale of the difficulties.

Read also: Artificial intelligence advises the prime minister. A European country has hired a robot

As noted by the authors of the post that is now Available as a preliminary printEncouraging the AI to read the user manual before starting the imposed task can speed up the learning process. This is what reinforcement learning looks like: You set a goal and then reward the AI for taking the necessary actions to achieve it.

Wanting to further improve the whole process, scientists from Carnegie Mellon University decided to help the algorithms learn faster. To do this, they combined it with a language model that is able to read user manuals. The effects were not long in coming: the AI learned to play the video game much faster than in the case of the model developed by DeepMind.

Artificial intelligence has been trained as part of what is called reinforcement learning

First, however, the language model had to be trained to be able to extract and summarize key information found in the official game manual. This data was later used to ask questions about this game. The answer, of course, was given by a trained mechanic. These were then used to generate additional rewards and fed into the reinforcement learning algorithm.

Read also: DuckDuckGo is going like a storm. DuckAssist is meant to be your assistant – smart but secure

Finally, it’s time to test. To evaluate their approach, the researchers tested it in a game known as skateboarding. When they compared the results achieved by other tools with those that could have been “thrown away” thanks to the new approach, their hands began to clap. Suffice it to say that before, artificial intelligence had to complete 80 billion methods to achieve performance comparable to human performance. However, scientists managed to reduce this number to 13 million. So we’re talking about a 6,000 times better result. Other more complex products, such as the popular Minecraft, await next.

Echo Richards

Echo Richards embodies a personality that is a delightful contradiction: a humble musicaholic who never brags about her expansive knowledge of both classic and contemporary tunes. Infuriatingly modest, one would never know from a mere conversation how deeply entrenched she is in the world of music. This passion seamlessly translates into her problem-solving skills, with Echo often drawing inspiration from melodies and rhythms. A voracious reader, she dives deep into literature, using stories to influence her own hardcore writing. Her spirited advocacy for alcohol isn’t about mere indulgence, but about celebrating life’s poignant moments.

Artificial intelligence has outdone itself. This is due to the original approach

Up next

Box Office USA: “Scream VI” breaks series record. “Third Creed”, the second “hundred” of the year

Author

Echo Richards

Artificial intelligence has been trained as part of what is called reinforcement learning

Leave a Reply Cancel reply

Number of short-lived gamma-ray bursts resulting from neutron star mergers in dwarf galaxies | Urania

Drivers from this country cause most of the crashes in Tricity

Hiking on a snail. This is how tardigrades travel

Impersonating with the title of professor? Past mistakes come back to us after many years

Why “PlayNumberGame Slot” Is My New Obsession

Padel: The Fast-Growing Sport Shaping the Future of Recreation

How Blockchain is Changing Sports Marketing

Healthy Acrylic Nail Practices for Kids: What to Know

Artificial intelligence has outdone itself. This is due to the original approach

Up next

Author

Echo Richards

Artificial intelligence has been trained as part of what is called reinforcement learning

Leave a Reply Cancel reply

You May Also Like