Learning to predict human behaviour in crowded scenes

نویسندگان

  • Alexandre Alahi
  • Vignesh Ramanathan
  • Kratarth Goel
  • Alexandre Robicquet
  • Amir Abbas Sadeghian
  • Li Fei-Fei
  • Silvio Savarese
چکیده

Humans are much more predictable in their transit patterns than we expect. In the presence of su cient observations, it has been shown that our mobility is highly predictable even at a city-scale level [1]. The location of a person at any given time can be predicted with an average accuracy of 93% supposing 3 km of uncertainty. How about at finer resolutions such as in shopping malls, in airports, or within train terminals for safety or resource optimization? What are the relevant cues to best predict human behavior within a margin of few centimeters? Recently, Kitani et al. [2] showed that scene semantics provide strong cues for forecasting pedestrians’ trajectories. Helbing et al. [3, 4] also showed that our mobility is influenced by our neighbors, either consciously, e.g., by relatives or friends, or even unconsciously, e.g., by following an individual to facilitate navigation. More broadly, when humans walk in a crowded public space such as a train terminal, mall, or city centers, they obey a large number of (unwritten) common sense rules and comply with social conventions. For instance, as they consider where to move next, they respect personal space and yield right-of-way. The ability to model these rules and use them to understand and predict human motion in complex real world environments is extremely valuable for a wide range of applications from the deployment of socially-aware robots [5] to the design of intelligent tracking systems [6] in smart environments. In this chapter, we present two families of methods to forecast human trajectories in crowded environments. The first one is based on the popular Social Forces model [3] where the causalities behind human navigation is hand-designed by a set of functions that have been carefully chosen based on our understanding of physics underlying social behaviour. The second method is a fully data-driven approach based on Recurrent Neural Networks [7] that does not impose any hand-designed functions or explicit mobility based constraints. The causality behind human mobility is an interplay between both observable and nonobservable cues (e.g., intentions). Humans have the innate ability to “read” one another. When they need to avoid each other, there is an implicit cooperation on where to move next. They have the ability to get along well with each other by preserving a personal distance. These capabilities are often referred to as Social Intelligence [8]. Any forecasting method needs to infer the same behaviors to develop socially-aware intelligent systems. This requires

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online multiple people tracking-by-detection in crowded scenes

Multiple people detection and tracking is a challenging task in real-world crowded scenes. In this paper, we have presented an online multiple people tracking-by-detection approach with a single camera. We have detected objects with deformable part models and a visual background extractor. In the tracking phase we have used a combination of support vector machine (SVM) person-specific classifie...

متن کامل

Contextual anomaly detection in crowded surveillance scenes

This work addresses the problem of detecting human behavioural anomalies in crowded surveillance environments. We focus in particular on the problem of detecting subtle anomalies in a behaviourally heterogeneous surveillance scene. To reach this goal we implement a novel unsupervised context-aware process. We propose and evaluate a method of utilising social context and scene context to improve...

متن کامل

Spatio-Temporal Motion Pattern Modeling of Extremely Crowded Scenes

The abundance of video surveillance systems has created a dire need for computational methods that can assist or even replace human operators. Research in this field, however, has yet to tackle an important real-world scenario: extremely crowded scenes. The excessive amount of people and their activities in extremely crowded scenes present unique challenges to motion-based video analysis. In th...

متن کامل

ADABeV: Automatic Detection of Abnormal Behavior in Video-surveillance

Intelligent Video-Surveillance (IVS) systems are being more and more popular in security applications. The analysis and recognition of abnormal behaviours in a video sequence has gradually drawn the attention in the field of IVS, since it allows filtering out a large number of useless information, which guarantees the high efficiency in the security protection, and save a lot of human and mater...

متن کامل

Tracking Using Motion Patterns for Very Crowded Scenes

This paper proposes Motion Structure Tracker (MST) to solve the problem of tracking in very crowded structured scenes. It combines visual tracking, motion pattern learning and multi-target tracking. Tracking in crowded scenes is very challenging due to hundreds of similar objects, cluttered background, small object size, and occlusions. However, structured crowded scenes exhibit clear motion pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017