Julius

Provided by:
Kyoto University - Kawahara Lab


Julius is a high-performance, two-pass large vocabulary continuous speech recognition (LVCSR) decoder software for speech-related researchers and developers. Based on word N-gram and context-dependent HMM, it can perform almost real-time decoding on most current PCs in 60k word dictation task. Major search techniques are fully incorporated such as tree lexicon, N-gram factoring, cross-word context dependency handling, enveloped beam search, Gaussian pruning, Gaussian selection, etc. Besides search efficiency, it is also modularized carefully to be independent from model structures, and various HMM types are supported such as shared-state triphones and tied-mixture models, with any number of mixtures, states, or phones. Standard formats are adopted to cope with other free modeling toolkit such as HTK, CMU-Cam SLM toolkit, etc.

 

The main platform is Linux and other Unix workstations, and also works on Windows. Most recent version is developed on Linux and Windows (cygwin / mingw), and also has Microsoft SAPI version. Julius is distributed with open license together with source codes.

Since Julius is a language-independent decoding program, you can make a recognizer of a language given an appropriate language model and acoustic model for the target language. The recognition accuracy largely depends on the models.

Julius adopts acoustic models in HTK ascii format, pronunciation dictionary in almost HTK format, and word 3-gram language models in ARPA standard format (forward 2-gram and reverse 3-gram trained from same corpus).

Japanese and English language/acoustic models are available for free from the Julius homepage.


Get Started with the service

: Free of charge

Support

Helpdesk: julius-info@lists.sourceforge.jp

Access the service

Request - Source And Executable Code : http://julius.osdn.jp/

Terms of Use: Large Vocabulary Continuous Speech Recognition Engine Julius

Tool