Scientists have used the LRS2 dataset, which contains about 50,000 individual sentences spoken by the BBC announcers, as well as the CMLR dataset, the most comprehensive set for teaching neural networks to read lips in Mandarin. The database of the latter contains about 100 thousand offers from CNTV.