• Deep generative appearance modeling in visual tracking
The Client : Javna agencija za raziskovalno dejavnost RS
Project duration: 2019 - 2021
  • Description

Predicting object state in video streams is one of the fundamental challenges of computer vision with numerous application domains. Knowing where the object is at a given point in time can help autonomous vehicles avoid obstacles, alert if elderly people fall at home, analyze performance in professional sport, discover the behaviour of animals, or help robots actively learn new concepts. These are just a few scenarios where methods that perform visual tracking can be used extensively. Yet, there are numerous open challenges that have to be solved to develop a general visual tracking method capable of handling scenarios, mentioned above. Visual object tracking without prior information about the object is an ill-posed problem, it cannot be solved by an on-line learning method alone for an arbitrary object. Humans, on the other hand, can solve complex tracking scenarios by relying on a massive amount of knowledge about the world accumulated through life-long learning. This knowledge contains info about object categories, their possible deformations and appearance variations which are crucial for retaining a stable representation of the tracked object. In machine learning terms we can say that this knowledge is contained in a generative model of the objects appearance. The challenge that we will address in this project is a robust design of such a generative model, training and application in a visual tracking scenario. We believe that a generative appearance model of the entire object is a crucial step towards grounding visual object tracking in high-level concepts behind raw pixel values.

Research activity

Engineering sciences and technologies

Range on year


Research organisations <https://www.sicris.si/public/jqm/prj.aspx?lang=eng&opdescr=search&opt=2&subopt=403&code1=cmn&code2=auto&psize=10&hits=1&page=1&count=&search_term=Globoko%20generativno%20modeliranje%20izgleda%20v%20vizualnem%20sledenju&id=18003&slng=&order_by=>

Researchers <https://www.sicris.si/public/jqm/prj.aspx?lang=eng&opdescr=search&opt=2&subopt=402&code1=cmn&code2=auto&psize=10&hits=1&page=1&count=&search_term=Globoko%20generativno%20modeliranje%20izgleda%20v%20vizualnem%20sledenju&id=18003&slng=&order_by=>

Project phases and their realization

Workpackages: The work will be divided into four work packages, the first three address the project scientific goals:

WP1: development of generative deep neural network models, suitable for appearance modelling of many object classes,

WP2: application of developed models to visual tracking,

WP3: training and testing data acquisition and generation,

WP4: dissemination.

Project bibliographic references <https://www.vicos.si/Projects/Gaptrack>

Financed by

Slovenian Research Agency