OpenAI successfully trained a Minecraft bot using 70,000 hours of gameplay videos

Jimmy2x

Posts: 146   +12
Staff
Why it matters: Minecraft may not sound like an important tool that supports advanced AI research. After all, what could possibly be so important about teaching a machine to play a sandbox game released more than a decade ago? Based on OpenAI's recent efforts, a well-trained Minecraft bot is more relevant to AI advancement than most people might realize.

OpenAI has always focused on artificial intelligence (AI) and machine learning advances that benefit humanity. Recently, the company successfully trained a bot to play Minecraft using more than 70,000 hours of gameplay videos. The achievement is far more than just a bot playing a game. It marks a giant stride forward in advanced machine learning using observation and imitation.

OpenAI's bot is an excellent example of imitation learning (also called "supervised learning") in action. Unlike reinforcement learning, where a learning agent is rewarded after reaching a goal through trial and error, imitation learning trains neural networks to perform specific tasks by watching humans complete them. In this case, OpenAI leveraged available gameplay videos and tutorials to teach their bot to execute complex in-game sequences that would take the typical player approximately 24,000 individual actions to achieve.

Imitation learning requires video inputs to be labeled to provide the context of the action and observed outcome. Unfortunately, this approach can be highly labor intensive, resulting in limited available datasets. This shortage of available datasets ultimately limits the agent's ability to learn via observation.

Rather than muscling through an extensive manual data tagging exercise, OpenAI's research team used a specific approach, known as Video Pre-Training (VPT), to significantly expand the number of labeled videos available. Researchers initially captured 2,000 hours of annotated Minecraft gameplay and used it to train an agent to associate specific actions with specific on-screen outcomes. The resulting model was then used to automatically generate labels for 70,000 hours of previously unlabeled Minecraft content readily available online, providing the Minecraft bot with a much larger dataset to review and imitate.

The entire exercise proves the potential value of available video repositories, such as YouTube, as an AI training resource. Machine learning scientists could use available and properly labeled videos to train AI to conduct specific tasks, ranging from simple web navigation to aiding users with real-life physical needs.

Permalink to story.

 

Uncle Al

Posts: 9,363   +8,581
It is also a useful tool for those wanting to manipulate game sites and hack into users ..... wonder if they will take any of that into consideration? Doubtful, very doubtful
 

ferrellsl

Posts: 102   +107
This seems like such a useless endeavor when remembering all the hype surrounding AI at it's inception. How about AI doing something useful like finding a cure for cancer?
 

Lew Zealand

Posts: 2,277   +2,854
TechSpot Elite
This seems like such a useless endeavor when remembering all the hype surrounding AI at it's inception. How about AI doing something useful like finding a cure for cancer?

Training an AI to monkey-see-monkey-do an easily solvable problem (playing Minecraft) is very slightly different than training one to solve an unsolved, open-ended problem which itself is over 100 different kinds of problems.
 

Hexic

Posts: 1,284   +2,037
TechSpot Elite
This seems like such a useless endeavor when remembering all the hype surrounding AI at it's inception. How about AI doing something useful like finding a cure for cancer?

Because you have to start somewhere, and this progress is not instant, nor can you pick a target down the road that you can't even identify within the scope of the AI test.

We didn't go from the Industrial Revolution to modern technology in 30 years, and the same concept will apply (although most likely at a faster rate) to AI, and it's applications in the future.
 

ShadowDeath

Posts: 206   +204
It's nice to know that when our robot overlords come for us it won't be because of something like in the movies, it'll be because we made them watch 70k hours of Minecraft and then play it.
 
Last edited:

Philip BM

Posts: 6   +2
Correction: Imitation Learning is not "also called Supervised Learning"; it is just one approach that uses Supervised Learning, which means that the training data has been labelled (as opposed to Unsupervised Learning, where the data is unlabelled).
 

Hodor

Posts: 418   +295
Seems that online games are just a very powerful (but cheap) way of training various AIs.

The more we humans fight, the more info AI collects on how to outsmart and defeat us.