Text Transcription using Transformers to your Mother Tongue.
- model-centric approaches:
- tune hyperparameter - set
warmup_steps=100
andmax_steps=1000
. We found that the model starts overfitting after 1000 epochs. - change model - use
openai/whisper-medium
instead ofopenai/whisper-small
. We found the eval loss stop decreasing after 1000 epochs which means the performance ceiling of small model has been reached.
- data-centric approach: Increase sampling rate to 24000 to get more details.
App URL: https://huggingface.co/spaces/esnagy/hungarian_speech_transcriber