Home > Article > Technology peripherals > CMU robot dog, standing upside down and going downstairs! Release is open source

CMU robot dog, standing upside down and going downstairs! Release is open source

王林forward: 2023-10-04 11:21:081076browse

There are really too many tricks for robot dogs -

But today is still amazing .

The latest results from CMU allow dogs to directly learn:

High jump twice as long as , long jump, handstand and even Handstand Down the Stairs Without further ado, just show the picture to experience it:

CMU robot dog, standing upside down and going downstairs! Release is open source △ This is the long jump

CMU robot dog, standing upside down and going downstairs! Release is open source △ This is a high jump

CMU robot dog, standing upside down and going downstairs! Release is open source △ Handstand Sa Huaner

CMU robot dog, standing upside down and going downstairs! Release is open source Needs a heavy jump The content written is: △Handstand down the stairs

I have to say, especially the "struggle" in the high jump part makes the dog particularly soulful.

In addition to these s operations, CMU also released several parkour videos,

completely autonomous.

. It’s such a refreshing feeling to step on the ridge, pass through the gap, and cross the slope

CMU robot dog, standing upside down and going downstairs! Release is open source Even if there are some "mistakes" in the middle, It will not affect its immediate progress

CMU robot dog, standing upside down and going downstairs! Release is open source Laughing Rat, and even arranged a

stress test, and the result is of course "Passed"~

CMU robot dog, standing upside down and going downstairs! Release is open source The most amazing thing is that, according to CMU, all of the above extreme operations are completed by a single neural network

After hearing this, Mr. LeCun had to give him a thumbs up.

How to refine such a soul? CMU robot dog, standing upside down and going downstairs! Release is open source

In this tweet, the author analyzes this dog’s technique in detail

First of all, compared with end-to-end walking based on vision, the extreme operation of high jump and long jump is 2 times the body length. Not at the same level at all

After all, any mistake may cause "fatal" consequences

In this regard, CMU adopts

sim2real

Achieve precise foot control and challenge to maximize mechanical advantage. Among them, Gym is used for simulator

Furthermore, handstand. Obviously, walking on two legs is much more difficult than walking on four

However, Carnegie Mellon University's robot dog uses the same basic method to accomplish both tasks at the same time, and is even able to Stairs while maintaining an inverted state

Third, for parkour operations

(the focus of this study)

, the robot dog must

# through precise "eye muscle" coordination ##Decide the way forward by yourself instead of following human instructions. For example, when passing two slopes in succession, it needs to jump up the slope at a very specific angle and then immediately change direction

In order to learn CMU uses the

CMU robot dog, standing upside down and going downstairs! Release is open source MTS

(Mixed Teacher Student) system to teach the robot dog these correct directions. The system will only adopt it if the predicted direction is close to the true valueSpecifically, the system is divided into

two stages

: In the first stage, RL is first used to learn a movement strategy. This process can access some privileged information. In addition to environment parameters and scan points (scandots) , CMU also provides appropriate settings for the robot dog. Some

sign points(waypoints) are provided to guide the general direction.

Then, regularized online adaptation (Regularized Online Adaptation, ROA) is used to train the evaluator to recover environmental information from the observation history.

In the second stage, the strategy is extracted from the scan points (scandots) . The system will independently decide how to move forward based on the strategy and depth information, thus outputting motor commands quickly.

The whole process is like "teachers teach, students learn by analogy"

In addition to this system, because parkour requires a variety of different movements to cross obstacles, a specific design is designed for each obstacle. Reward function is also a headache.

Here, the author chose to formulate a unified and simple inner product reward function for all tasks.

It can automatically generate various rewards and can fully adapt to various terrain shapes