Native R 'torch' Implementation of 'OpenAI' 'Whisper'


[Up] [Top]

Documentation for package ‘whisper’ version 0.1.0

Help Pages

apply_bpe Apply BPE Merges
audio_duration Get Audio Duration
audio_to_mel Convert Audio to Mel Spectrogram
byte_to_token Convert Byte to BPE Token
clean_text Clean Transcribed Text
compute_stft Compute STFT Magnitude
copy_if_exists Copy Weight if Exists
create_decoder Create Decoder from Config
create_encoder Create Encoder from Config
create_mel_filterbank_fallback Create Mel Filterbank (Fallback)
decode_bpe_bytes Decode BPE Bytes Back to Text
decode_timestamp Decode Timestamp Token
download_tokenizer_files Download Tokenizer Files from HuggingFace
download_whisper_model Download Model from HuggingFace
ensure_tokenizer_files Ensure Tokenizer Files are Downloaded
extract_segments Extract Segments with Timestamps
get_initial_tokens Get Initial Decoder Tokens
get_model_path Get Model Cache Path
get_weights_path Get Path to Model Weights
greedy_decode Greedy Decoding
hz_to_mel Convert Hz to Mel Scale
is_timestamp_token Check if Token is Timestamp
list_downloaded_models List Downloaded Models
list_whisper_models List Available Models
load_added_tokens Load Added Tokens from HuggingFace
load_audio Load and Preprocess Audio
load_decoder_weights Load Decoder Weights
load_encoder_weights Load Encoder Weights
load_mel_filterbank Load Pre-computed Mel Filterbank
load_whisper_model Load Whisper Model
load_whisper_weights Load Weights from Safetensors
mel_to_hz Convert Mel Scale to Hz
model_exists Check if Model is Downloaded
pad_or_trim Pad or Trim Audio to Fixed Length
parse_device Parse Device Argument
parse_dtype Parse Dtype Argument
split_audio Split Long Audio into Chunks
tokenizer_decode Decode Token IDs to Text
tokenizer_encode Encode Text to Token IDs
transcribe Whisper Transcription
transcribe_chunk Transcribe Single Chunk
transcribe_long Transcribe Long Audio
whisper_attention Whisper Encoder
whisper_config Whisper Model Configurations
whisper_decoder Text Decoder
whisper_decoder_layer Whisper Decoder
whisper_device Device and Dtype Management
whisper_dtype Get Default Dtype
whisper_encoder Audio Encoder
whisper_encoder_layer Encoder Layer
whisper_lang_token Get Language Token ID
whisper_model Whisper Model
WHISPER_SAMPLE_RATE Audio Preprocessing for Whisper
whisper_special_tokens Special Token IDs
whisper_tokenizer Whisper BPE Tokenizer