Skip to content

Commit 900a607

Browse files
Wei-Ning Hsufacebook-github-bot
Wei-Ning Hsu
authored andcommitted
add timit w2vu recipe (#1991)
Summary: ## What does this PR do? Add TIMIT data preparation scripts for wav2vec-U Pull Request resolved: fairinternal/fairseq-py#1991 Reviewed By: alexeib Differential Revision: D29284481 Pulled By: wnhsu fbshipit-source-id: dccd75159a9de4f3cd95f9e4a90ce4bdf9264f2b
1 parent e47a4c8 commit 900a607

File tree

10 files changed

+14373
-0
lines changed

10 files changed

+14373
-0
lines changed

examples/wav2vec/unsupervised/README.md

+10
Original file line numberDiff line numberDiff line change
@@ -48,6 +48,16 @@ The fifth argument is which phonemizer to use. Supported values are [espeak](htt
4848

4949
Pre-trained fasttext LID models can be downloaded [here](https://fasttext.cc/docs/en/language-identification.html).
5050

51+
### Prepare TIMIT data
52+
TIMIT transcripts include silence. Therefore VAD is not used for audio preprocessing, and we do not wrap transcripts with silences or insert random silence in between words.
53+
54+
To prepare TIMIT data for both the matched an unmatched setup:
55+
```shell
56+
bash scripts/prepare_timit.sh /dir/to/timit/raw/data /output/dir /path/to/wav2vec2/model.pt
57+
```
58+
59+
Note that we assume the TIMIT distribution with capitalized directories and filenames are used (e.g., `TRAIN/DR1/FCJF0/SA1.PHN`).
60+
5161
## Generative adversarial training (GAN)
5262

5363
We then use a GAN model to build a first unsupervised ASR model. The data preparation above of both speech features and text data is a necessary procedure that enables the generator to match speech to text in an unsupervised way.
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,192 @@
1+
FDHC0_SI1559
2+
FDHC0_SI2189
3+
FDHC0_SI929
4+
FDHC0_SX119
5+
FDHC0_SX209
6+
FDHC0_SX29
7+
FDHC0_SX299
8+
FDHC0_SX389
9+
FELC0_SI1386
10+
FELC0_SI2016
11+
FELC0_SI756
12+
FELC0_SX126
13+
FELC0_SX216
14+
FELC0_SX306
15+
FELC0_SX36
16+
FELC0_SX396
17+
FJLM0_SI1043
18+
FJLM0_SI1673
19+
FJLM0_SI2303
20+
FJLM0_SX143
21+
FJLM0_SX233
22+
FJLM0_SX323
23+
FJLM0_SX413
24+
FJLM0_SX53
25+
FMGD0_SI1564
26+
FMGD0_SI2194
27+
FMGD0_SI934
28+
FMGD0_SX124
29+
FMGD0_SX214
30+
FMGD0_SX304
31+
FMGD0_SX34
32+
FMGD0_SX394
33+
FMLD0_SI2185
34+
FMLD0_SI822
35+
FMLD0_SI925
36+
FMLD0_SX115
37+
FMLD0_SX205
38+
FMLD0_SX25
39+
FMLD0_SX295
40+
FMLD0_SX385
41+
FNLP0_SI1308
42+
FNLP0_SI1938
43+
FNLP0_SI678
44+
FNLP0_SX138
45+
FNLP0_SX228
46+
FNLP0_SX318
47+
FNLP0_SX408
48+
FNLP0_SX48
49+
FPAS0_SI1272
50+
FPAS0_SI2204
51+
FPAS0_SI944
52+
FPAS0_SX134
53+
FPAS0_SX224
54+
FPAS0_SX314
55+
FPAS0_SX404
56+
FPAS0_SX44
57+
FPKT0_SI1538
58+
FPKT0_SI2168
59+
FPKT0_SI908
60+
FPKT0_SX188
61+
FPKT0_SX278
62+
FPKT0_SX368
63+
FPKT0_SX8
64+
FPKT0_SX98
65+
MBPM0_SI1577
66+
MBPM0_SI1584
67+
MBPM0_SI947
68+
MBPM0_SX137
69+
MBPM0_SX227
70+
MBPM0_SX317
71+
MBPM0_SX407
72+
MBPM0_SX47
73+
MCMJ0_SI1094
74+
MCMJ0_SI464
75+
MCMJ0_SI602
76+
MCMJ0_SX104
77+
MCMJ0_SX14
78+
MCMJ0_SX194
79+
MCMJ0_SX284
80+
MCMJ0_SX374
81+
MDAB0_SI1039
82+
MDAB0_SI1669
83+
MDAB0_SI2299
84+
MDAB0_SX139
85+
MDAB0_SX229
86+
MDAB0_SX319
87+
MDAB0_SX409
88+
MDAB0_SX49
89+
MGRT0_SI1450
90+
MGRT0_SI2080
91+
MGRT0_SI820
92+
MGRT0_SX10
93+
MGRT0_SX100
94+
MGRT0_SX190
95+
MGRT0_SX280
96+
MGRT0_SX370
97+
MJDH0_SI1354
98+
MJDH0_SI1984
99+
MJDH0_SI724
100+
MJDH0_SX184
101+
MJDH0_SX274
102+
MJDH0_SX364
103+
MJDH0_SX4
104+
MJDH0_SX94
105+
MJLN0_SI1449
106+
MJLN0_SI2079
107+
MJLN0_SI819
108+
MJLN0_SX189
109+
MJLN0_SX279
110+
MJLN0_SX369
111+
MJLN0_SX9
112+
MJLN0_SX99
113+
MJMP0_SI1535
114+
MJMP0_SI1791
115+
MJMP0_SI905
116+
MJMP0_SX185
117+
MJMP0_SX275
118+
MJMP0_SX365
119+
MJMP0_SX5
120+
MJMP0_SX95
121+
MKLT0_SI1213
122+
MKLT0_SI1843
123+
MKLT0_SI583
124+
MKLT0_SX133
125+
MKLT0_SX223
126+
MKLT0_SX313
127+
MKLT0_SX403
128+
MKLT0_SX43
129+
MLLL0_SI1363
130+
MLLL0_SI1993
131+
MLLL0_SI733
132+
MLLL0_SX103
133+
MLLL0_SX13
134+
MLLL0_SX193
135+
MLLL0_SX283
136+
MLLL0_SX373
137+
MLNT0_SI1574
138+
MLNT0_SI1902
139+
MLNT0_SI642
140+
MLNT0_SX102
141+
MLNT0_SX12
142+
MLNT0_SX192
143+
MLNT0_SX282
144+
MLNT0_SX372
145+
MNJM0_SI1580
146+
MNJM0_SI2210
147+
MNJM0_SI950
148+
MNJM0_SX140
149+
MNJM0_SX230
150+
MNJM0_SX320
151+
MNJM0_SX410
152+
MNJM0_SX50
153+
MPAM0_SI1189
154+
MPAM0_SI1819
155+
MPAM0_SI1961
156+
MPAM0_SX109
157+
MPAM0_SX19
158+
MPAM0_SX199
159+
MPAM0_SX289
160+
MPAM0_SX379
161+
MTAS1_SI1473
162+
MTAS1_SI2098
163+
MTAS1_SI838
164+
MTAS1_SX118
165+
MTAS1_SX208
166+
MTAS1_SX28
167+
MTAS1_SX298
168+
MTAS1_SX388
169+
MTLS0_SI1370
170+
MTLS0_SI2000
171+
MTLS0_SI740
172+
MTLS0_SX110
173+
MTLS0_SX20
174+
MTLS0_SX200
175+
MTLS0_SX290
176+
MTLS0_SX380
177+
MWBT0_SI1553
178+
MWBT0_SI2183
179+
MWBT0_SI923
180+
MWBT0_SX113
181+
MWBT0_SX203
182+
MWBT0_SX23
183+
MWBT0_SX293
184+
MWBT0_SX383
185+
MWEW0_SI1361
186+
MWEW0_SI1991
187+
MWEW0_SI731
188+
MWEW0_SX101
189+
MWEW0_SX11
190+
MWEW0_SX191
191+
MWEW0_SX281
192+
MWEW0_SX371

0 commit comments

Comments
 (0)