Fix some stride problems, fix the nv12 to nv21 direct transformation, and implement a direct conversion to yuv444 to save CPU.
Fix some stride problems, fix the nv12 to nv21 direct transformation, and implement a direct conversion to yuv444 to save CPU.