f0d7e25ecdff5723901eb228da86922a772e7367
AutoTranscript: Fully Automated Transcription using AI
AutoTranscript is a PyTorch based interface speech-to-text tool to generate fully automated transcriptions. AutoTranscript uses AI models containing speaker diarization models:
- whisper: A general-purpose speech recognition model.
- payannote-audio: An open-source toolkit for speaker diarization-.
AutoTranscript can be used as a command-line interface, a webserver, or as a Python API.
Install AutoTranscript :
The following command will pull and install the latest commit from this repository, along with its Python dependencies.
pip install https://github.com/JSchmie/autotranscript.git
- Python version: Python 3.9
- PyTorch version: Python 1.11.0
Usage examples
Python usage
from autotranscript import AutoTranscribe
model = AutoTranscribe()
text = model.transcribe("audio.wav")
print(f"Transcription: \n{text}")
Command-line usage
If you do not want to control the optimization using Python, you also can use the command-line:
autotranscript audio.wav
Run the following to view all available options:
autotranscript -h
Contact
License
Cite AutoTranscript :
Languages
Python
90.4%
HTML
6%
CSS
2.3%
Dockerfile
1%
Makefile
0.3%