Use GstVideoInfo to store the parsed caps. Remove outsize from the caps parsing code, it's wrong because it does not use the stride given by the driver.