In conversational mode user is able to combine 2 or more voices in the same audio generation.