ANSI/INCITS 456-2010
1 Scope
This standard specifies a concept and data format for representation of the human voice at the raw-datalevel with optional inclusion of nonstandardized extended data. It does not address handling of data thathas been processed to the feature or voice model levels.
The data format is generic in that it may be applied to and used in a wide range of application areas whereautomated and human-to-human SIV is performed. No application-specific requirements, equipment, orfeatures are addressed in this standard. Through its XML orientation, this standard does, however,reflect recognition of the overwhelming dominance of the VoiceXML standard in speech processing andassociated XML-based standards.
This standard contains definitions of relevant terms, a description of the basic speaker-recognitionSession, a data format for containing the data, and conformance information.
SIV applications and engines utilize adaptation to automatically update the information in a referencemodel. Despite its value, representing adaptation lies outside of the bounds of this standard because itoperates on voice model data and this standard focuses on raw data transmission.