DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model With Self-Generated Cross-Modal Alignment
National Taiwan University · National Taipei University · +2 more institutions
Indexed incrossref
Abstract
No abstract available for this paper.
Citation impact
4
total citations
- FWCI
- 122.92
- Percentile
- 100%
- References
- 72
Too recent for citation history.
Authors
28- KCK C LuCorresponding
National Taiwan University, National Taipei University
- ZCZhehuai Chen
Nvidia (United States)
- SFSzu‐Wei Fu
Nvidia (United States)
- CHChao-Han Huck Yang
National Taiwan University, Nvidia (United States), National Taipei University
- SHSung-Feng Huang
Nvidia (United States)
Topics & keywords
Keywords
- Language model
- Audio signal processing
- Context model
- Speech processing
- Audio analyzer
- Natural language
No related works found for this paper.