Commit Graph

41 Commits

Author SHA1 Message Date
sanchit-gandhi 1f7ce4a468 Add Flax weights 2023-02-23 16:31:39 +01:00
Sanchit Gandhi 74d3cae0a5 Update README.md 2023-02-06 17:06:03 +00:00
Sanchit Gandhi dcbf5967d0 Update README.md (#11)
- Update README.md (7e03d0f9bd5a90404b5cb0fb3843d8b3952c4b9c)
2023-01-27 16:05:42 +00:00
Arthur Zucker 57ea0193de Changes for fast tokenizer (#10)
- Add tokenizer.json (124eda19723059d73b3e24d22089ff0266a5c981)


Co-authored-by: Jonatan Kłosko <jonatanklosko@users.noreply.huggingface.co>
2023-01-26 13:50:51 +00:00
Arthur Zucker d964250a15 Update generation_config.json 2023-01-24 14:59:14 +00:00
Arthur Zucker 7dd9e1ca6e Update generation_config.json 2023-01-24 14:57:20 +00:00
Arthur Zucker cad221b554 Update generation_config.json 2023-01-24 14:50:13 +00:00
Arthur Zucker 56db7eed1f Upload config 2023-01-24 14:37:28 +00:00
Sanchit Gandhi 23d5e63e34 Revert: Add "<|startoftranscript|>" to forced decoder ids (#6)
- Revert: Add "<|startoftranscript|>" to forced decoder ids (db578b003807f8d58a7cd90e56b5012bf26dac42)
2022-12-07 14:14:53 +00:00
Sanchit Gandhi e5ed3f5164 Update config.json 2022-12-05 13:33:59 +00:00
Sanchit Gandhi eb2a3e30c3 Force <|startoftranscript|>
Updates the `forced_decoder_ids` to force the `<|startoftranscript|>` token at position 1. This is to match the official Whisper implementation, which always predicts `<|startoftranscript|>` at position 1:
```python
#!pip install git+https://github.com/openai/whisper.git

import whisper
from datasets import load_dataset

import torch

device = "cuda" if torch.cuda.is_available() else "cpu"

model = whisper.load_model("tiny.en").to(device)

tokenizer = whisper.tokenizer.get_tokenizer(False, task="transcribe", language="en")
tokenizer = tokenizer.tokenizer

librispeech = load_dataset("hf-internal-testing/librispeech_asr_dummy", "clean", split="validation")

def to_pad_to_mel(array):
    """Static function which:
        1. Pads/trims a list of audio arrays to a max length of 30s
        2. Computes log-mel filter coefficients from padded/trimmed audio sequences
        Inputs:
            array: list of audio arrays
        Returns:
            input_ids: torch.tensor of log-mel filter bank coefficients
    """
    padded_input = whisper.pad_or_trim(np.asarray(array, dtype=np.float32))
    input_ids = whisper.log_mel_spectrogram(padded_input)
    return input_ids

audio_array = librispeech[0]["audio"]["array"]
log_mel = to_pad_to_mel(audio_array).unsqueeze(0)

tokens = model.generate(log_mel.to(device))[0]
transcript = tokenizer.decode(tokens, skip_special_tokens=False)
print(transcript)
```
**Print Output:**
```
<|startoftranscript|><|notimestamps|> Mr. Quilter is the apostle of the middle classes, and we are glad to
```
2022-12-05 12:28:32 +00:00
Arthur Zucker f3b5e97e6e Update README.md (#3)
- Update README.md (933dd954d44445fa823390fd4ceeeed6049bcc35)


Co-authored-by: Luigi Liu <Luigi@users.noreply.huggingface.co>
2022-11-28 08:32:41 +00:00
Arthur Zucker b8d7ffd4a8 Allow single quotes "'" and hyphens "-" (#4)
- Allow single quotes "'" and hyphens "-" (54f836ce498930b225ff2a6db8c2a40bddece9f8)


Co-authored-by: Sanchit Gandhi <sanchit-gandhi@users.noreply.huggingface.co>
2022-11-28 08:32:13 +00:00
Arthur Zucker 407a3a8d4f Update `max_length` param 2022-10-17 11:09:07 +00:00
Arthur Zucker ea8f5d0f78 Update `return_mask` param 2022-10-17 11:08:52 +00:00
Arthur Zucker 2ac54a8a67 Update README.md 2022-10-10 09:36:23 +00:00
Arthur Zucker 9e20f927b5 Create README.md 2022-10-10 09:36:05 +00:00
Amy Roberts 1f9fe7b01b Add TF weights (#1)
- Add TF weights (7bed60b379d9f0c588b77137f2af66d0f7b09ed8)
2022-10-10 09:35:28 +00:00
Arthur Zucker 215213def0 Upload config 2022-10-05 16:27:20 +00:00
Arthur Zucker c703e332b3 Upload tokenizer 2022-10-05 16:27:18 +00:00
Arthur Zucker f11df2a199 Upload config 2022-10-05 11:33:24 +00:00
Arthur Zucker 2b05ff791b Upload tokenizer 2022-10-05 11:33:23 +00:00
Arthur Zucker 3b1e4bbdd2 Upload tokenizer 2022-10-05 10:11:49 +00:00
Arthur Zucker 950343312d Update config.json 2022-10-05 10:10:39 +00:00
Arthur Zucker 18d92b3976 Upload tokenizer 2022-10-05 09:12:56 +00:00
Arthur Zucker c6a40e32d6 Upload tokenizer 2022-10-05 09:08:51 +00:00
Arthur Zucker 7a93d853ae Upload processor 2022-10-04 22:30:53 +00:00
Arthur Zucker 7dabb1da76 Upload config 2022-10-04 22:30:51 +00:00
Arthur Zucker 8710e8b8b9 Upload processor 2022-10-04 22:26:57 +00:00
Arthur Zucker 47d9e19fb2 Upload config 2022-10-04 22:26:55 +00:00
Arthur Zucker 33560bff06 Upload processor 2022-10-04 21:54:52 +00:00
Arthur Zucker b961dda4c9 Create new file 2022-10-04 21:31:37 +00:00
Arthur Zucker 10c973633b Upload processor 2022-10-04 14:58:23 +00:00
Arthur Zucker d13d0e4bec Upload config 2022-10-04 14:58:21 +00:00
Arthur Zucker 80bf224040 Upload tokenizer 2022-09-27 08:10:54 +00:00
Arthur Zucker c83d640540 Upload WhisperForConditionalGeneration 2022-09-27 08:10:50 +00:00
Arthur Zucker f74ab995ce Upload processor 2022-09-27 07:12:05 +00:00
Arthur Zucker 2ff81621b1 Upload config 2022-09-27 07:12:01 +00:00
Arthur Zucker 40083c0971 Upload processor 2022-09-26 06:57:57 +00:00
Arthur Zucker 09fa5b9e62 Upload model 2022-09-26 06:57:54 +00:00
Arthur Zucker 36ced721c6 initial commit 2022-09-26 06:57:49 +00:00