Press Space to Add a Phoneme, press D to remove one.
I had an idea for an audio file type which would just store the phoneme instead of the whole audio so it could heavily compress human voice and I made a demo.