Skip to content

Visual Speech Recognition for Multiple Languages

Latest
Compare
Choose a tag to compare
@mpc001 mpc001 released this 09 Sep 15:03
· 11 commits to master since this release

This is the repository of Visual Speech Recognition for Multiple Languages, which is the successor of End-to-End Audio-Visual Speech Recognition with Conformers. The repository is mainly based on ESPnet. We provide state-of-the-art algorithms for end-to-end visual speech recognition in the wild.