How would voice recognition beat it? Do you really think they’d write software to understand and then ‘press’ the right button? I sorta doubt it.
That software type already exists. All it has to do is parse that greeting and generate a tone.
Modern cell phones already have this capability. You can buy systems from Amazon and Google that listen to free form speech and parse it into questions or orders for products.
If software can do this, software can easily parse your message and generate a tone:
https://www.youtube.com/watch?v=ufBLI6bB9sg
https://www.youtube.com/watch?v=hPXS7rC1PWo
https://www.youtube.com/watch?v=9I20frbeawg
And software can do that. ALL of the videos above show real world current commercial product capabilities. You can buy them right now.
And you think a little voice greeting directing the caller to press zero is going to stop this kind of software?