Skip to comments.Teaching self-driving cars to predict pedestrian movement
Posted on 02/12/2019 10:27:32 AM PST by ETL
By zeroing in on humans' gait, body symmetry and foot placement, University of Michigan researchers are teaching self-driving cars to recognize and predict pedestrian movements with greater precision than current technologies.
Data collected by vehicles through cameras, LiDAR and GPS allow the researchers to capture video snippets of humans in motion and then recreate them in 3-D computer simulation. With that, they've created a "biomechanically inspired recurrent neural network" that catalogs human movements.
With it, they can predict poses and future locations for one or several pedestrians up to about 50 yards from the vehicle. That's at about the scale of a city intersection.
"Prior work in this area has typically only looked at still images. It wasn't really concerned with how people move in three dimensions," said Ram Vasudevan, U-M assistant professor of mechanical engineering. "But if these vehicles are going to operate and interact in the real world, we need to make sure our predictions of where a pedestrian is going doesn't coincide with where the vehicle is going next."
Equipping vehicles with the necessary predictive power requires the network to dive into the minutiae of human movement: the pace of a human's gait (periodicity), the mirror symmetry of limbs, and the way in which foot placement affects stability during walking.
Much of the machine learning used to bring autonomous technology to its current level has dealt with two dimensional imagesstill photos. A computer shown several million photos of a stop sign will eventually come to recognize stop signs in the real world and in real time.
But by utilizing video clips that run for several seconds, the U-M system can study the first half of the snippet to make its predictions, and then verify the accuracy with the second half.
"Now, we're training the system to recognize motion and making predictions of not just one single thingwhether it's a stop sign or notbut where that pedestrian's body will be at the next step and the next and the next," said Matthew Johnson-Roberson, associate professor in U-M's Department of Naval Architecture and Marine Engineering.
To explain the kind of extrapolations the neural network can make, Vasudevan describes a common sight.
"If a pedestrian is playing with their phone, you know they're distracted," Vasudevan said. "Their pose and where they're looking is telling you a lot about their level of attentiveness. It's also telling you a lot about what they're capable of doing next."
The results have shown that this new system improves upon a driverless vehicle's capacity to recognize what's most likely to happen next.
"The median translation error of our prediction was approximately 10 cm after one second and less than 80 cm after six seconds. All other comparison methods were up to 7 meters off," Johnson-Roberson said. "We're better at figuring out where a person is going to be."
To rein in the number of options for predicting the next movement, the researchers applied the the physical constraints of the human bodyour inability to fly or our fastest possible speed on foot.
To create the dataset used to train U-M's neural network, researchers parked a vehicle with Level 4 autonomous features at several Ann Arbor intersections. With the car's cameras and LiDAR facing the intersection, the vehicle could record multiple days of data at a time.
Researchers bolstered that real-world, "in the wild" data from traditional pose data sets captured in a lab. The result is a system that will raise the bar for what driverless vehicles are capable of.
"We are open to diverse applications and exciting interdisciplinary collaboration opportunities, and we hope to create and contribute to a safer, healthier, and more efficient living environment," said U-M research engineer Xiaoxiao Du.
I don’t see any mention of how the neural networks do at the problem of finding the people, which they have to do before they can start making decision based on analysis of gait.
how do they take into acount the 3:00 am drunks....
You know, just when I thought we stopped talking about these stupid machines, you bring ‘em back in.
They also need to factor in humans on their cell phones. Texting and Talking.
That raises the probability that a human will walk stupidly into oncoming traffic by a factor of 1000.
Russian Dash Cams on Youtube.
lol looks like a cop turns around and chases someone for ‘unsafe driving’ lol
Doesn’t U of M have a “Diversity” department budget of $79 million? No respect from me, fwiw.
Humans do random things.
If they are staring at their phones and walking in the street they are fair game.
Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.