This renders a test string on top of a red video stream and checks if the text is only rendered at the correct timestamps.