How to match lip-sync video with separate audio track

Hi Members, hope this is the correct forum for my question. I am a songwriter and using only my phone to record video tracks of me lip-syncing snippets of my lyrics (audio track to these videos will not be used). I need to make a youtube video of my recorded song and will only be using specific parts of my song to lip-sync. The remaining parts of my video will be filled with still photos that help explain my story line.

Since it took many attempts to originally capture a successful vocal track, it is impossible to recreate that perfect track (on the fly) with a live video. Since many songwriters opt to use lip-syncing against their original vocal track(s), is there a procedure for this to make this task successful? In other words, it’s more like how to time/sync the video lip movements to the separate audio track.

I understand I need to either put headphones on or turn up the original track on my monitors during the video shoot (based on what video aspect I’m aiming for) in order to hear my original track as this helps with the timing. Thank you.