VIPL AVSU

VIPL-AVSU-Group Public

Collection of works from VIPL-AVSU

CAS-VSR-S101 Public

CAS-VSR-S101: A large-scale Mandarin dataset from TV broadcasts for audio-visual speech research

learn-an-effective-lip-reading-model-without-pains Public

The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the state-of-art performance in LRW-1000 dataset.

Python 165 38

CAS-VSR-MOV20 Public

CAS-VSR-MOV20: A challenging dataset for Chinese visual speech recognition, consisting of video clips from 20 movies.

3

LRW1000--CAS-VSR-W1k Public

DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

Python 118 20

CAS-VSR-S68 Public

CAS-VSR-S68: A dataset for lip reading with unseen speakers, spanning 68 hours of news broadcasts.

7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VIPL AVSU

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!