generate facial animation from recorded speech