TensorBoard
Safetensors
English
long_speech_qwen2audio
File size: 741 Bytes
08d3a65
 
 
 
 
 
 
 
 
 
 
 
e54daf7
08d3a65
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
---
license: apache-2.0
datasets:
- ICTNLP/LongSpeech-Eval
language:
- en
metrics:
- accuracy
base_model:
- Qwen/Qwen2-Audio-7B-Instruct
---

# The model for the paper '[FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing](https://arxiv.org/abs/2507.14815)'

## Usage

Please refer to [Github Page](https://github.com/ictnlp/FastLongSpeech)

### Requirements

We suggest to run with Python 3.10.
Examples of usage:
```
git clone https://github.com/ictnlp/FastLongSpeech.git
cd transformers-main
pip install -e .
pip install deepspeed sentencepiece librosa 
```

## Evaluation Datasets
- https://huggingface.co/datasets/ICTNLP/LongSpeech-Eval
## Github Pages
- https://github.com/ictnlp/FastLongSpeech