Just watch. (You might need to rapidly click/tap the flag for it to switch at the right time. It's supposed to be fully lined up.)
IDK where the audio is from. I drew this. It's animatic style because it's faster, and easier to animate. (Plus I can't do lipsyncing.)