Logo Deepak Baby
  • Home
  • About
  • Experience
  • Education
  • Publications
  • More
    Recent Posts
  • Posts
  • Dark Theme
    Light Theme Dark Theme System Theme
Logo Inverted Logo
  • Tags
  • Ai
  • Basic
  • Content Organization
  • DDP
  • Deep Learning
  • Distributed Training
  • FSDP
  • Kaldi
  • Keras
  • Linux
  • Malayalam
  • Markdown
  • Matlab
  • Mkl
  • Multi-Lingual
  • Pipeline Parallelism
  • Presentation
  • Python
  • PyTorch
  • UGent
  • Vae
  • Variational Autoencoder
Hero Image
Training kaldi models with custom features

Kaldi Speech Recognition Toolkit is a freely available toolkit that offers several tools for conducting research on automatic speech recognition (ASR). It lets us train an ASR system from scratch all the way from the feature extraction (MFCC,FBANK, ivector, FMLLR,…), GMM and DNN acoustic model training, to the decoding using advanced language models, and produce state-of-the-art results. While kaldi offers so much flexibilty at every stage, sometimes we also need to play with features that are not offered by the kaldi repository. Kaldi makes use of ark format to store the features. If we want to perform experiments with customized features, they must be converted to the ark format first. The goal of this post is to explain how we can extract and store the custom features in the ark format using matlab and python.

  • kaldi
  • matlab
  • python
Wednesday, March 6, 2019 | 10 minutes Read
Navigation
  • About
  • Experience
  • Education
  • Publications
  • Recent Posts
Contact me:
  • deepakbabycet@gmail.com
  • deepakbaby
  • Deepak Baby
  • +32 483 611 040

Liability Notice: This theme is under MIT license. So, you can use it for non-commercial, commercial, or private uses. You can modify or distribute the theme without requiring any permission from the theme author. However, the theme author does not provide any warranty or takes any liability for any issue with the theme.


Toha Theme Logo Toha
© 2025 Copyright.
Powered by Hugo Logo