2024 Reading Group
2024.1.16 ASRU 2023 Paper List
- Cross-Modal Alignment With Optimal Transport For CTC-Based ASR
- WaveNeXt ConvNeXt based fast neural vocoder without iSTFT layer
- Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction
- Whisper-Slu: Extending a Pretrained Speech-to-Text Transformer for Low Resource Spoken Language Understanding
2024.1.30 ASRU 2023 Paper List
- Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition
- CTC Blank Triggered Dynamic Layer-Skipping for Efficient CTC-based Speech Recognition
- MelHuBERT: A simplified HuBERT on Mel spectrograms