whisper-tiny.en

Commit Graph

Author	SHA1	Message	Date
Sanchit Gandhi	23d5e63e34	Revert: Add "<\|startoftranscript\|>" to forced decoder ids (#6 ) - Revert: Add "<\|startoftranscript\|>" to forced decoder ids (db578b003807f8d58a7cd90e56b5012bf26dac42)	2022-12-07 14:14:53 +00:00
Sanchit Gandhi	e5ed3f5164	Update config.json	2022-12-05 13:33:59 +00:00
Sanchit Gandhi	eb2a3e30c3	Force <\|startoftranscript\|> Updates the `forced_decoder_ids` to force the `<\|startoftranscript\|>` token at position 1. This is to match the official Whisper implementation, which always predicts `<\|startoftranscript\|>` at position 1: ```python #!pip install git+https://github.com/openai/whisper.git import whisper from datasets import load_dataset import torch device = "cuda" if torch.cuda.is_available() else "cpu" model = whisper.load_model("tiny.en").to(device) tokenizer = whisper.tokenizer.get_tokenizer(False, task="transcribe", language="en") tokenizer = tokenizer.tokenizer librispeech = load_dataset("hf-internal-testing/librispeech_asr_dummy", "clean", split="validation") def to_pad_to_mel(array): """Static function which: 1. Pads/trims a list of audio arrays to a max length of 30s 2. Computes log-mel filter coefficients from padded/trimmed audio sequences Inputs: array: list of audio arrays Returns: input_ids: torch.tensor of log-mel filter bank coefficients """ padded_input = whisper.pad_or_trim(np.asarray(array, dtype=np.float32)) input_ids = whisper.log_mel_spectrogram(padded_input) return input_ids audio_array = librispeech[0]["audio"]["array"] log_mel = to_pad_to_mel(audio_array).unsqueeze(0) tokens = model.generate(log_mel.to(device))[0] transcript = tokenizer.decode(tokens, skip_special_tokens=False) print(transcript) ``` Print Output: ``` <\|startoftranscript\|><\|notimestamps\|> Mr. Quilter is the apostle of the middle classes, and we are glad to ```	2022-12-05 12:28:32 +00:00
Arthur Zucker	b8d7ffd4a8	Allow single quotes "'" and hyphens "-" (#4 ) - Allow single quotes "'" and hyphens "-" (54f836ce498930b225ff2a6db8c2a40bddece9f8) Co-authored-by: Sanchit Gandhi <sanchit-gandhi@users.noreply.huggingface.co>	2022-11-28 08:32:13 +00:00
Arthur Zucker	407a3a8d4f	Update `max_length` param	2022-10-17 11:09:07 +00:00
Arthur Zucker	215213def0	Upload config	2022-10-05 16:27:20 +00:00
Arthur Zucker	f11df2a199	Upload config	2022-10-05 11:33:24 +00:00
Arthur Zucker	950343312d	Update config.json	2022-10-05 10:10:39 +00:00
Arthur Zucker	47d9e19fb2	Upload config	2022-10-04 22:26:55 +00:00
Arthur Zucker	d13d0e4bec	Upload config	2022-10-04 14:58:21 +00:00
Arthur Zucker	c83d640540	Upload WhisperForConditionalGeneration	2022-09-27 08:10:50 +00:00
Arthur Zucker	2ff81621b1	Upload config	2022-09-27 07:12:01 +00:00
Arthur Zucker	09fa5b9e62	Upload model	2022-09-26 06:57:54 +00:00

13 Commits