edu.cmu.sphinx.tools.batch
Class BatchNISTRecognizer

java.lang.Object
  extended by edu.cmu.sphinx.tools.batch.BatchModeRecognizer
      extended by edu.cmu.sphinx.tools.batch.BatchNISTRecognizer
All Implemented Interfaces:
Configurable
Direct Known Subclasses:
BatchForcedAlignerRecognizer

public class BatchNISTRecognizer
extends BatchModeRecognizer

Copyright 1999-2002 Carnegie Mellon University. Portions Copyright 2002 Sun Microsystems, Inc. Portions Copyright 2002 Mitsubishi Electric Research Laboratories. All Rights Reserved. Use is subject to license terms.

See the file "license.terms" for information on usage and redistribution of this file, and for a DISCLAIMER OF ALL WARRANTIES.

User: Peter Wolf Date: Nov 10, 2005 Time: 2:42:06 PM Copyright 2005, Peter Wolf

Runs a NIST corpus as used by the GALE project. The inputs are a CTL file, and a REF file. The output is a CTM file.

A CTL file contains a list of utterances to decode. The format is

The is a base to which the property "dataDirectory" is prepended, and ".raw" is appended. The utterance file should be raw PCM that agrees with the "bitsPerSample", "channelCount", "samplesPerSecond", and "framesPerSecond" properties.

The and are specified in frames, where

bytesPerFrame = (bitsPerSample/8)*channelCount*samplesPerSecond/framesPerSecond

The should be a unique string. For example "__".

A REF file contains the correct transcripts of the utterances specified in the CTL file. Each line should be of the form

()

The output is a "processed" CTM file. It is used by the NIST tools to compute the performance on the copus. The format is not documented because it is currently a hack to get the Dry Run going. We need to think more about it. If you want to use this tool talk to Peter Wolf, or Arthur Chan.


Nested Class Summary
 class BatchNISTRecognizer.CTLUtterance
           
 
Field Summary
static java.lang.String PROP_BITS_PER_SAMPLE
          The sphinx properties that specify the format of the PCM audio in the data file
static java.lang.String PROP_CHANNEL_COUNT
           
static java.lang.String PROP_CTL_FILE
          The sphinx property that specifies the file containing the corpus utterance audio
static java.lang.String PROP_CTM_FILE
          The sphinx property that specifies the the directory where the output XXX files should be placed
static java.lang.String PROP_DATA_DIR
          The sphinx property that specifies the file containing the corpus utterance audio
static java.lang.String PROP_FRAMES_PER_SECOND
           
static java.lang.String PROP_REF_FILE
          The sphinx property that specifies the file containing the transcripts of the corpus
static java.lang.String PROP_SAMPLES_PER_SECOND
           
 
Fields inherited from class edu.cmu.sphinx.tools.batch.BatchModeRecognizer
PROP_COUNT, PROP_COUNT_DEFAULT, PROP_INPUT_DATA_PROCESSORS, PROP_RECOGNIZER, PROP_SKIP, PROP_SKIP_DEFAULT, PROP_TOTAL_BATCHES, PROP_TOTAL_BATCHES_DEFAULT, PROP_USE_POOLED_BATCH_MANAGER, PROP_USE_POOLED_BATCH_MANAGER_DEFAULT, PROP_WHICH_BATCH, PROP_WHICH_BATCH_DEFAULT
 
Constructor Summary
BatchNISTRecognizer()
           
 
Method Summary
 void decode()
           
static byte[] hex2Binary(java.lang.String spelling)
           
static void main(java.lang.String[] argv)
           
 void newProperties(PropertySheet ps)
          This method is called when this configurable component needs to be reconfigured.
 
Methods inherited from class edu.cmu.sphinx.tools.batch.BatchModeRecognizer
decode, recognize, setBatchFile, shell, start, stop
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

PROP_DATA_DIR

@S4String(defaultValue="")
public static final java.lang.String PROP_DATA_DIR
The sphinx property that specifies the file containing the corpus utterance audio

See Also:
Constant Field Values

PROP_CTL_FILE

@S4String(defaultValue="")
public static final java.lang.String PROP_CTL_FILE
The sphinx property that specifies the file containing the corpus utterance audio

See Also:
Constant Field Values

PROP_REF_FILE

@S4String(defaultValue="")
public static final java.lang.String PROP_REF_FILE
The sphinx property that specifies the file containing the transcripts of the corpus

See Also:
Constant Field Values

PROP_CTM_FILE

@S4String(defaultValue="")
public static final java.lang.String PROP_CTM_FILE
The sphinx property that specifies the the directory where the output XXX files should be placed

See Also:
Constant Field Values

PROP_BITS_PER_SAMPLE

@S4Integer(defaultValue=16)
public static final java.lang.String PROP_BITS_PER_SAMPLE
The sphinx properties that specify the format of the PCM audio in the data file

See Also:
Constant Field Values

PROP_CHANNEL_COUNT

@S4Integer(defaultValue=1)
public static final java.lang.String PROP_CHANNEL_COUNT
See Also:
Constant Field Values

PROP_SAMPLES_PER_SECOND

@S4Integer(defaultValue=16000)
public static final java.lang.String PROP_SAMPLES_PER_SECOND
See Also:
Constant Field Values

PROP_FRAMES_PER_SECOND

@S4Integer(defaultValue=100)
public static final java.lang.String PROP_FRAMES_PER_SECOND
See Also:
Constant Field Values
Constructor Detail

BatchNISTRecognizer

public BatchNISTRecognizer()
Method Detail

newProperties

public void newProperties(PropertySheet ps)
                   throws PropertyException
Description copied from interface: Configurable
This method is called when this configurable component needs to be reconfigured.

Specified by:
newProperties in interface Configurable
Overrides:
newProperties in class BatchModeRecognizer
Parameters:
ps - a property sheet holding the new data
Throws:
PropertyException - if there is a problem with the properties.

decode

public void decode()

hex2Binary

public static byte[] hex2Binary(java.lang.String spelling)

main

public static void main(java.lang.String[] argv)