Skip to content

Commit a332081

Browse files
[doc] add new s0 aishell non-streaming result (#143) (#144)
Co-authored-by: Di Wu <1176705630@qq.com>
1 parent b8dad2b commit a332081

File tree

3 files changed

+23
-23
lines changed

3 files changed

+23
-23
lines changed

examples/aishell/s0/README.md

Lines changed: 16 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -3,17 +3,17 @@
33
## Conformer Result
44

55
* Feature info: using fbank feature, dither=0, cmvn, speed perturb
6-
* Training info: lr 0.002, batch size 16, 8 gpu, acc_grad 4, 200 epochs, dither 0.0
7-
* Decoding info: ctc_weight 0.6, average_num 30
8-
* Git hash: 132954a9ce27d0381ed3879c4f43cc158860167e
9-
* Model link: http://mobvoi-speech-public.ufile.ucloud.cn/public/wenet/aishell/20210116_conformer_exp.tar.gz
6+
* Training info: lr 0.002, batch size 18, 4 gpu, acc_grad 4, 240 epochs, dither 0.1
7+
* Decoding info: ctc_weight 0.5, average_num 20
8+
* Git hash: 919f07c4887ac500168ba84b39b535fd8e58918a
9+
* Model link: http://mobvoi-speech-public.ufile.ucloud.cn/public/wenet/aishell/20210204_conformer_exp.tar.gz
1010

1111
| decoding mode | CER |
1212
|------------------------|------|
13-
| attention decoder | 5.36 |
14-
| ctc greedy search | 5.14 |
15-
| ctc prefix beam search | 5.14 |
16-
| attention rescoring | 4.77 |
13+
| attention decoder | 5.18 |
14+
| ctc greedy search | 4.94 |
15+
| ctc prefix beam search | 4.94 |
16+
| attention rescoring | 4.61 |
1717

1818
## Unified Conformer Result
1919

@@ -33,16 +33,17 @@
3333
## Transformer Result
3434

3535
* Feature info: using fbank feature, dither=0, with cmvn, no speed perturb.
36-
* Training info: lr 0.002, batch size 16, 8 gpu, acc_grad 1, 120 epochs, dither 0.0
37-
* Git hash: fb8e0f8c12b5d547fc22e62365e1e114f059c609
38-
* Model link: http://mobvoi-speech-public.ufile.ucloud.cn/public/wenet/aishell/20210120_transformer_exp.tar.gz
36+
* Training info: lr 0.002, batch size 26, 4 gpu, acc_grad 4, 240 epochs, dither 0.1
37+
* Decoding info: ctc_weight 0.5, average_num 20
38+
* Git hash: 919f07c4887ac500168ba84b39b535fd8e58918a
39+
* Model link: http://mobvoi-speech-public.ufile.ucloud.cn/public/wenet/aishell/20210204_transformer_exp.tar.gz
3940

4041
| decoding mode | CER |
4142
|------------------------|------|
42-
| attention decoder | 5.76 |
43-
| ctc greedy search | 6.21 |
44-
| ctc prefix beam search | 6.21 |
45-
| attention rescoring | 5.47 |
43+
| attention decoder | 5.69 |
44+
| ctc greedy search | 5.92 |
45+
| ctc prefix beam search | 5.91 |
46+
| attention rescoring | 5.30 |
4647

4748
## Unified Transformer Result
4849

examples/aishell/s0/conf/train_conformer.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -41,10 +41,10 @@ raw_wav: true
4141
collate_conf:
4242
# waveform level config
4343
wav_distortion_conf:
44-
wav_dither: 0.0
44+
wav_dither: 0.1
4545
wav_distortion_rate: 0.0
4646
distortion_methods: []
47-
speed_perturb: false
47+
speed_perturb: true
4848
feature_extraction_conf:
4949
feature_type: 'fbank'
5050
mel_bins: 80
@@ -74,7 +74,7 @@ dataset_conf:
7474

7575
grad_clip: 5
7676
accum_grad: 4
77-
max_epoch: 200
77+
max_epoch: 240
7878
log_interval: 100
7979

8080
optim: adam

examples/aishell/s0/conf/train_transformer.yaml

Lines changed: 4 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -36,18 +36,17 @@ raw_wav: true
3636
collate_conf:
3737
# waveform level config
3838
wav_distortion_conf:
39-
wav_dither: 0.0
39+
wav_dither: 0.1
4040
wav_distortion_rate: 0.0
4141
distortion_methods: []
42-
speed_perturb: false
42+
speed_perturb: true
4343
feature_extraction_conf:
4444
feature_type: 'fbank'
4545
mel_bins: 80
4646
frame_shift: 10
4747
frame_length: 25
4848
using_pitch: false
4949
# spec level config
50-
# spec_swap: false
5150
feature_dither: 0.0 # add dither [-feature_dither,feature_dither] on fbank feature
5251
spec_aug: true
5352
spec_aug_conf:
@@ -64,12 +63,12 @@ dataset_conf:
6463
max_length: 40960
6564
min_length: 0
6665
batch_type: 'static' # static or dynamic
67-
batch_size: 16
66+
batch_size: 26
6867
sort: true
6968

7069
grad_clip: 5
7170
accum_grad: 1
72-
max_epoch: 120
71+
max_epoch: 240
7372
log_interval: 100
7473

7574
optim: adam

0 commit comments

Comments
 (0)