• TheTechOasis
  • Posts
  • Google's Adorable AI Robots Show Us The Future of Robotics... and Sports?

Google's Adorable AI Robots Show Us The Future of Robotics... and Sports?

šŸ TheTechOasis šŸ

šŸ¤– This Weekā€™s AI Insight šŸ¤–

For all the impressive features that ChatGPT, Stable Diffusion, or DINO have exposed to the world, they all have one common ā€œlimitationā€:

They are bounded to the digital world, with no possibility to impact the physical realmā€¦ at least not explicitly.

But now weā€™re one step closer to achieving one of the grand dreams of AI scientists:

ā

The development of embodied intelligence

That is, the creation of agents that can act in the physical world with the agility, dexterity, and understanding of animals and humans.

In the not-so-distant future, this could not only fully automate all our production lines, but also take up the form of robot assistants, robot soldiersā€¦ or even robot lovers.

ā€œRobot in the style of Hito Steyerlā€ Source: Author with Diffusion model

And the latest demonstration of potential comes with soccer (or football as we call it in Europe) robots that will make you smile, laugh, and also reflect on how society is not prepared, in the slightest, for whatā€™s coming.

The Fascinating Concept of ā€˜Rewardsā€™

The main characters of our story are sports robots trained using what we describe as Deep Reinforcement Learning, or Deep RL.

In a nutshell, this is the idea of using neural networks, that same concept behind ChatGPT, Dolly, or even Teslaā€™s Autopilot system, but applied to RL.

But what is RL?

While NLP models like ChatGPT work by processing natural language and generating a textual or image response, and CV models like SAM learn to process images and identify elements and infer patterns from them, RL teaches AI agents to interact with their environment.

In laymanā€™s terms, they learn the best way to execute the set of actions that will yield the maximum reward possible.

To further understand, you can think of training RL agents as training your dog puppy.

By rewarding him/her with treats for good behavior and ā€œpunishingā€ without them otherwise, the dog learns the pattern of actions (the policy) that will yield him/her the greatest amount of treats (rewards) possible.

But how do we teach robots to play sports?

Itā€™s all about winning

As you know, the aforementioned sport is not only about kicking a ball; the robot needs to learn to get up, move, shoot, and, in 1v1 situations, defend its goal.

Hence, the DeepMind team assembled a set of rewards that would allow the robots to learn to play it by:

  • Rewarding it for scoring, getting up, and defending

  • Penalizing it when receiving goals, or staying on the ground

To put this into practice, Google DeepMind leveraged a very powerful concept thatā€™s gaining a lot of traction recently in the world of AI, distillation.

Like Father, Like Son

The researchers first trained two robots:

  • A scoring teacher, a robot that learns to score goals effectively

  • A get-up teacher, a robot that learns to get up from different poses effectively

Next, they should this process of distillation, which is getting a student robot to learn from them.

To do so, they defined a training objective that first forced the robot to ā€œcloneā€ actions from their teachers.

But wait, why not just learn both things from the beginning?

By applying distillation, a smaller model can learn to act like larger models very efficiently, considerably reducing costs while achieving similar results.

Then, once the student equaled its teachersā€™ abilities to score and get up, they limited their influence and made it play against copies of itself in previous versions, something known as self-play.

By making the student play worse versions of itself in a 1v1 environment, it developed new emergent behaviors apart from getting up and scoring, like:

  • Defending its goal (by penalizing goal-receiving)

  • Running (it increases the chances of winning)

  • Understanding the opponentā€™s movements and reacting to them

  • Getting up quickly, as being on the ground is a surefire way to lose

And the results?

Well, the following videos blew my mind and will blow yours.

Resilient and Committed to Winning

I could bore you with numbers as to how superior are these robots to previous research on this topic (156% faster, 63% quicker to get up, or 24% better at scoring), but videos are better than anything I can explain to you:

But this isnā€™t about sports, and you know that.

From Sports to Love

The field of robotics is poised to impact the labor market as much as anything we can think of.

  • It has the power to fully automate factories, killing millions of jobs.

  • It has the power of creating machines that interact with the real world as well as you do, with all the amazing use cases and risks that entails.

Soon, Black Mirror could be seen as a predictor of our future more than a series portraying dystopian realities.

Needless to say, AI is converging quickly into making these futuristic science fiction stories lose the fiction part.

Even scarier, humans are naturally drawn to become romantically entrenched with machines if these anthropomorphize.

To me, human-machine relationships seem to be a matter of when, not if, and I donā€™t think we really want that.

šŸ‘¾Top AI news for the weekšŸ‘¾

šŸ¤Ø According to a Google employee, Microsoft wonā€™t win the AI raceā€¦ and neither will Google

ā˜£ļø The biggest risk with GenAI is not jobs, itā€™s hyper-personalized advertising

šŸ¶ Snoop Dogg shares his alarming views on AI in a hilarious way

šŸ’ø Now we know how much it cost to build ChatGPTā€¦ and it was huge

šŸ¤© The biggest breakthrough in programming in years is Mojo

šŸ§ While ChatGPT has surface-level conversations, Pi goes deep and gets to know you