Those requirements can be imposed by a little clever engineering.
Our first feature vector
A key step in parameterising speech is to move from the time domain to a domain in which distances make more sense, and so where we can perform pattern matching.