Stable and efficient video segmentation via GAN predicting adjacent frame

Show simple item record

dc.contributor.author Ilnytskyi, Ivan
dc.date.accessioned 2019-02-18T15:58:50Z
dc.date.available 2019-02-18T15:58:50Z
dc.date.issued 2019
dc.identifier.citation Ilnytskyi, Ivan. Stable and efficient video segmentation via GAN predicting adjacent frame : Master Thesis : manuscript / Ivan Ilnytskyi ; Supervisor Pavel Akapian ; Ukrainian Catholic University, Department of Computer Sciences. – Lviv : [s.n.], 2019. – 24 p. : ill. uk
dc.identifier.uri http://er.ucu.edu.ua/handle/1/1328
dc.language.iso en uk
dc.subject GAN predicting adjacent frame uk
dc.subject neural networks uk
dc.subject semantic segmentation problem uk
dc.title Stable and efficient video segmentation via GAN predicting adjacent frame uk
dc.type Preprint uk
dc.status Публікується вперше uk
dc.description.abstracten Analyzing video streams represents a huge problem not only in terms of accuracy and speed, but also consistency of analysis between adjacent frames as videos are consistent due to real-world nature. Jittering effect of predictions is easily noticed by human vision in video semantic segmentation tasks. But it is not usually taken into account by design of algorithms as being suited for single image recognition and lack of easy solution via classical filters. This jittering leads to quite negative human assessment of algorithms while being good at accuracy. In addition it may lead to unstable or conflicting behavior of control systems that use computer vision. We propose the methods of efficient video semantic segmentation that take into account video consistency and can be implemented without annotated video dataset. Some methods require annotated photo only dataset, other methods additionally use generative adversarial network trained on relevant video dataset with no supervision. The solution is relevant for cases when the domain does not contain large annotated video datasets, but there are available annotated photo datasets and significantly large unlabeled videos. We show that using semantic segmentation mask of previous frame as a feature for current frame segmentation improves accuracy and consistency. We achieve best results using the network trained with features obtained from GAN and baseline segmentation network. uk


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Browse

My Account