Add systemstream=false to caps, otherwise the decoder may be picked for MPEG-PS files. Also parsed=true, as video toolbox expects entire frame in VTDecompressionSessionDecodeFrame. https://bugzilla.gnome.org/show_bug.cgi?id=770049