Files
scribe/README.md
T
2023-07-07 12:57:31 +02:00

1.2 KiB

AutoTranscript: Fully Automated Transcription using AI

AutoTranscript is a PyTorch based interface for. To enable fully auomated Transcription using AI models containing speaker diarization models:

  • whisper: an a general-purpose speech recognition model
  • payannote-audio an open-source toolkit for speaker diarization

Therefore AutoTranscript can be used as a Commandline Interface a Webserver or as a Python API.

Setup:

For this Project, Python 3.9 were PyTorch version 1.11.0

The following command will pull and install the latest commit from this repository, along with its Python dependencies.

pip install https://github.com/JSchmie/autotranscript.git

Example Python usage

from autotranscript import AutoTranscribe

model = AutoTranscribe()

text = model.transcribe("audio.wav")

print(f"Transcription: \n{text}")

Command-line usage

If you not want to control the optimization using python, you also can use the Command-line:

autotranscript audio.wav

Run the following to view all available options:

autotranscript -h

License

Citation