Automatic Speech Recognition, ASR (語音識別)

那些語音處理 (Speech Processing) 踩的坑

DEMO | FB Messenger | API

通過語音信號處理和模式識別讓機器自動識別和理解人類的口述
Meeting & Minutes Master | TW Deep Master
Call for Partner or POC (Proof of Concept) Contact: TonTon ( at ) TWMAN.ORG
中文說話者識別中文語音增強(去噪)中文語者分離

Kaldi-ASR Install Guide & error

https://zhuanlan.zhihu.com/p/148524930

https://zhuanlan.zhihu.com/p/44483840

  1. 下載Kaldi原始碼

  2. 安裝相關套件

    • cd kaldi/tools/extras

    • 會發現很多相依套件沒安裝
      ./check_dependencies.sh
      tools/extras/check_dependencies.sh: sox is not installed.

      tools/extras/check_dependencies.sh: gfortran is not installed.
      tools/extras/check_dependencies.sh: subversion is not installed

    • 安裝相依套件
      sudo apt-get install zlib1g-dev automake autoconf sox gfortran subversion

    • 安裝 intel-mkl 很常報錯,像是下面這樣
      sudo ./install_mkl.sh -sp debian intel-mkl-64bit-2020.0-088
      Err:9 http://ppa.launchpad.net/jonathonf/python-3.6/ubuntu xenial Release 403 Forbidden [IP:91.189.95.85 80]
      Reading package lists... Done
      W: GPG error: http://dl.google.com/linux/chrome/deb stable InRelease:
      The following signatures couldn't be verified because the public key is not available: NO_PUBKEY 78BD65473CB3BD13
      E: The repository 'http://dl.google.com/linux/chrome/deb stable InRelease' is not signed. N: Updating from such a repository can't be done securely, and is therefore disabled by default. N: See apt-secure(8) manpage for repository creation and user configuration details. E: The repository 'http://ppa.launchpad.net/jonathonf/python-3.6/ubuntu xenial Release' does not have a Release file. N: Updating from such a repository can't be done securely, and is therefore disabled by default. N: See apt-secure(8) manpage for repository creation and user configuration details.

    • 打開Software & Updates,切換到Other Software選項,找到如下方右圖的選項,把前面的勾去掉 (但是我只有 terminal 怎辦 ?)

    • 直接把相關的安裝源刪除,然後再安裝一次 intel-mkl
      /etc/apt/sources.list.d
      sudo rm /etc/apt/sources.list.d/jonathonf-ubuntu-python-3_6-xenial.list
      sudo rm /etc/apt/sources.list.d/google-chrome.list
      sudo apt update

  3. 安裝第三方工具及語言模型

    • cd kaldi/tools
      sudo apt-get install libtool

sudo make openfst
sudo make cub

    • sudo extras/install_irstlm.sh
      sudo extras/install_openblas.sh

  1. 編譯安裝與測試

    • cd kaldi/src
      ./configure --shared
      make depend -j 8
      make -j 8

    • cd ../egs/yesno/s5/
      ./run.sh
      %WER 0.00 [ 0 / 232, 0 ins, 0 del, 0 sub ] exp/mono0a/decode_test_yesno/wer_10_0.0