no different than voice recognitionIt would be different in the sense that you would first need to associate a particular sound with a particular key on a keyboard. Then you'd have to piece it all together to make sense of the number and/or letter combinations. It would be like breaking a code.
ie, A = x sound pattern, B = y sound pattern, etc, etc.