You can create scene (1) with your scene (2) as only input and add render delay filter(s) (one is 500ms max) to the scene (1) and use it normally and switch back to the scene (2) if you need to skip few seconds. You'd also probably have to delay the sound the same amount as video in scene (2).
1
1