How to use openai/whisper-base with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="openai/whisper-base")
# Load model directly from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq processor = AutoProcessor.from_pretrained("openai/whisper-base") model = AutoModelForSpeechSeq2Seq.from_pretrained("openai/whisper-base")
This is what I'm doing in a node/express app. I don't know if it's possible to return the timestamps. This is a must in my project. Please help. I can't switch to a Python backend :( .
· Sign up or log in to comment