A curated list of speech and natural language processing resources

Source: Internet
Author: User
Tags fsm

A curated list of speech and natural language processing resources

At Josh.ai, we ' re often asked for developer resources relating to natural language processing, machine learning, and artif Icial Intelligence. Paul Dixon, a researcher living in Kyoto Japan, put together a curated list of excellent speech and natural language proce Ssing tools. Below is the list current as of Oct 1, 2015. Check out the GitHub repo for more here.

Finite state toolkits and Regular Expressions
  • AT/t FSM Library the LIBRARYTM FSM is a set of general-purpose software tools available for Unix, for Buildin G, combining, optimizing, and searching weighted finite-state acceptors and transducers.
  • Carmel finite-state Toolkit, EM and Bayesian (Gibbs sampling) training for FST and Context-free derivation forests/
  • Categorial semiring categorial semiring as described in Sproat et al. 2014
  • Dk.brics.automaton Java Toolkit for FSAs and regular expression.
  • Fare Fare is a finite state and regular expression libary for the. NET framework written in C #. AM is a JavaScript library for working with automata and formal grammars for regular and Context-free languages
  • Foma finite-state compiler and C Library
  • FSA Toolkit used in RWTH ASR engine
  • fsm2.0 Thomas hanneforths FSM 2.0 library written C + + have a few nice operations such as Three-way composition
  • Fstrain A Toolkit for training finite-state models
  • JOPENFST Java Port of the C + + OPENFST library; Originally forked from the CMU Sphinx project
  • Kleene programming language High level finite state programming language built on top of OPENFST.
  • MIT FST Toolkit wfst Toolkit no maintained anymore but feature a few commands not found in other toolkits
  • Moms-for-stochasticlanguages spectral and other training algorithms for Wfsas.
  • N Shortest path for PDT N Shortest path for PDT
  • Noam "Noam is a JavaScript library for working with automata and formal grammars for regular and Context-free languages". Also has pretty cool examples using viz.js
  • Openfst Openfst is a library for constructing, combining, optimizing, and searching weighted finite-state transducers (FST s).
  • Openfst-utils Nice set of utilities for OPENFST includes implementation of Categorial semirings.openfst-utils.
  • Openlat Toolkit for manipulating word lattice built on top of OPENFST. Includes support for reading and writing HTK compatible lattices.
  • Pyfst Python interface to OPENFST
  • Sfst-stuttgart finite state transducer Tools sfst are a toolbox for the implementation of morphological analysers and OT Her tools which is based on finite state transducer technology. "
  • Treba "Treba is a basic command-line tool for training, decoding, and calculating with weighted (probabilistic) finite sta Te automata (PFSA) and Hidden Markov Models (HMMs). "

Many of the tools in the machine translation sections also implement interesting graph and semiring operations.

Language Modeling Toolkits
  • Bayesian recurrent neural Network for Language Modeling the is a C + + implementation for Bayesian recurrent neural netwo RK for Language Modeling (BRNNLM)
  • Berkeley LM
  • BIGFATLM provides Hadoop training of Kneser-ney language models, written in Java.
  • CSLM "Continuous Space Language Model Toolkit. CSLM Toolkit is Open-source software which implements the so-called continuous space language model.
  • Dalm Double Array language model.
  • KENLM Kenneth Heafield ' s Language Model toolkit, uses a very fast and low memory representation.
  • LWLM LWLM is a exact, full Bayesian implementation of the latent Words Language Model (Deschacht and Moens, 2009).
  • Maximum Entropy Modeling Le Zhang has a comprehensive set of links related MaxEnt.
  • Maximum Entropy Language models:srilm extension "This patch adds the functionality to train and apply Maximum entropy (Ma Xent) language models to the Srilm Toolkit. Currently, only N-gram features is supported "
  • MITLM My personal favourite LM toolkit, super fast and seems to get slightly higher accuracy.
  • MSRLM "This scalable Language-model tool was used to build language models from large amounts of data. It supports modified absolute discounting and Kneser-ney smoothing. "
  • OPENGRM Language Modelling Toolkit for use with OPENFST.
  • CPYP C + + library for modeling with Pitman-yor processes
  • RANDLM Bloom filter based random language models
  • RNNLM Recurrent Neural Network language Model toolkit.
  • REFR re-ranking Framework from the Johns-hopkins Workshop on Confusion language modelling.
  • RWTHLM A Toolkit for Training Neural Network language models (Feedforward, recurrent, and long short-term memory neural NE Tworks). The software was written by Martin Sundermeyer.
  • Srilm Very Popular Toolkit, source code avaliable but only non-free for commerical use.
Speech recognition
  • AALTOASR Aalto Automatic Speech recognition tools
  • Barista Barista is a open-source framework for concurrent speech processing.
  • Bavieca New Open source Toolkit featuring static and dynamic decoders.
  • Kaldi-nnet-dur-model Neural network phone duration model on top of the Kaldi speech recognition Framework, (Interspeech PA Per
  • CMU Sphinx Open Source Toolkit for Speech recognition Project by Carnegie Mellon University
  • HTK "The Hidden Markov Model Toolkit (HTK) is a portable Toolkit for building and manipulating Hidden Markov models."
  • Juicer Juicer is a Weighted finite state transducer (WFST) based decoder for Automatic Speech recognition (ASR).
  • Julius "Julius is a high-performance, two-pass large vocabulary continuous speech recognition (LVCSR) decoder software for Speech-related researchers and developers. "
  • Kaldi Modern open Source Toolkit leads by Dan Povey featuring many state-of-the-art techniques.
  • OPENDCD an Open Source wfst based Speech recognition Decoder.
  • Phonetisaurus Josef Novak's super fast WFST based Phoneticizer, site also have some really nice tutorials slides.
  • Sail Align Sailalign is a Open-source software toolkit for robust long speech-text alignment implementing an adaptive, it Erative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) Audi O and is robust to transcription errors. It is mainly written as a Perl library but its functionality also depends ...
  • Scarf:a segmental CRF Toolkit for Speech recognition "SCARF are A Toolkit for doing Speech recognition with segmental cond itional random fields. "
  • Trainc David Rybach and Michael Riley ' s tool for direct construction of context-dependency transducers (Interspeech best P Aper).
  • RASR RWTH asr-the RWTH Aachen University Speech recognition System
Signal processing
    • An Interactive Source separation Editor "Isse are an open-source, freely available, Cross-platform Audio Editing Tool Allows a user to perform source separation by painting on time-frequency visualisations the sound. "
    • Bob Bob is a free signal-processing and machine learning Toolbox originally developed by the Biometrics group at Idiap Res Earch Institute, in Switzerland.
    • Matlab Audio Processing Examples
    • Sacc-subband autocorrelation classification Pitch Tracker "SACC is a (compiled) Matlab script that performs Noise-robus T pitch tracking by classifying the autocorrelations of a set of subbands using an MLP neural network. "
Text-to-Speech
    • HTS hmm-based speech synthesis
    • Rusphonetizer Grammar rules and dictionaries for the phonetic transcription of Russian sentences
Speech Data
    • Cmudict cmudict (The Carnegie Mellon pronouncing Dictionary) is a free pronouncing Dictionary of 中文版.
    • Librispeech ASR Corpus Librispeech is a corpus of approximately hours of 16kHz read 中文版 speech, prepared by Vassi L Panayotov with the assistance of Daniel Povey. The data is derived from read audiobooks from the LibriVox project, and have been carefully segmented and aligned.
    • Ted-lium Corpus The Ted-lium Corpus was made from audio talks and their transcriptions available on the TED website.
Machine translation
  • Berkeley aligner "... a word alignment software package, implements recent innovations in unsupervised word alignment."
  • CDec "Decoder, Aligner, and model optimizer for statistical machine translation and other structured prediction models BAS Ed on (mostly) Context-free formalisms "
  • Jane "Jane is RWTH's Open source statistical machine translation Toolkit. Jane supports State-of-the-art techniques for phrase-based and hierarchical phrase-based machine translation. "
  • Joshua hierarchical and syntax based machine translation decoder written in Java.
  • Moses standard open source Machine translation toolkit.
  • Alignment-with-openfst
  • Zmert Nice Java Mert implementation by Omar F. Zaidan
Machine learning
    • Biddata Bidmat is a matrix library intended-to-support large-scale exploratory data analysis. Its sister library Bidmach implements, the machine learning layer.
    • Libfm:factorization Machine Library
    • SOFIA-ML Fast Incremental Learning algorithms for classification, regression, ranking from Google.
    • Spearmint Spearmint is a package to perform Bayesian optimization according to the algorithms outlined in the Paper:pract ical Bayesian optimization of machine learning Algorithms Jasper Snoek, Hugo Larochelle and Ryan p. Adams advances in Neur Al Information Processing Systems, 2012
Deep learning
  • Benchmarks-comparison of different convolution network implementations.
  • Cafee-really Active Deep Learning Toolkit with support for CuDNN and lots of other backends.
  • Cudnn-deep Neural network from Nvidia with paper here. Torch 7 have support for CuDnn Andhere is some Python wrappers.
  • Currennt-munich open-source CUDA Recurrent neural Network Toolkit described in Thispaper
  • Gensim-python Topic Modeling Toolkit with WORD2VEC implementation. Extremly easy-to-use and to install.
  • Glove Global vectors for word representation.
  • Groundhog neural network based machine translation toolkit.
  • KALDI LSTM C + + implementation of LSTM (Long short term Memory), in KALDI ' s NNET1 framework. Used for automatic speech recognition, possibly language modeling etc.
  • Oxlm:oxford Neural Language Modelling Toolkit Neural network Toolkit for machine translation described in the paper here
  • Neural probabilistic Language Model Toolkit "NPLM is a Toolkit for training and using Feedforward neural Language models ( Bengio, 2003). It's fast even for large vocabularies (100k or more): A model can being trained on a billion words of data in about a week, And can be queried in about 40μs, which are usable inside a decoder for machine translation. "
  • Rnnlm2wfst Tool to convert Rnnlms to Wfsts
  • Viennacl
Natural Language Processing
    • Bllip reranking Parser "Bllip parser is a statistical natural language parser including a generative constituent parser (f irst-stage) and discriminative maximum entropy Reranker (second-stage). "
    • OPENNLP the Apache OPENNLP Library is a machine learning based toolkit for the processing of natural language text.
    • SEAL Set Expander for any language described in this paper
    • Stanford CORENLP "Stanford CORENLP provides a set of natural language analysis tools written in Java"
Applications
    • Cloud ASR using Pykaldi "CLOUDASR is a software platform and a public ASR webservice."
Other Tools
    • Graphviz.sty really handy tool adding dot languge directly to a LATEX document, useful for tweaking the small colorized WF ST figure in papers and presentations.
Blogs
    • Between one and Zero by William Hartmann
    • Cmusphinx CMU Sphinx Related Blog
    • Language Log
    • Lingpipe Blog Natural Language processing and Text Analytics
    • Natural Language processing Blog by Hal DAUMÉIII
    • Spoken Language Processing "Some thoughts on spoken Language processing, with tangents on Natural Language processing, MAC Hine learning, and Signal processing thrown in for good measure. "
Books
    • Deep learning:methods and Applications by Li Deng and Dong Yu
    • Foundations of Data Science Draft by John Hopcroft and Ravindran Kannan
    • Introduction to Matrix Methods and Applications (working Title) S. Boyd and L. Vandenberghe

A curated list of speech and natural language processing resources

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.