Deep Neural Audio
    1.
    ​KALDI speech recognition toolkit with many SOTA models.
    4.
    ​speech recognition with DL - how to convert sounds to vectors, feeding into an RNN.

Tools

​Gecko - (github.com/gong-io/gecko) youtube, is an open-source tool for the annotation of the linguistic content of conversations. It can be used for segmentation, diarization, and transcription. With Gecko, you can create and perfect audio-based datasets, compare the results of multiple models simultaneously, and highlight differences between transcriptions.
Last modified 2mo ago
Copy link
Contents
Tools