The voice was a warning from the police to evacuate the area before the bomb detonated. It was probably synthetic, so it was definitely binary.
The police themselves say the warning was coming from the RV.
Sounded female, middle aged, white, and a touch of the local accent. But none of that means anything, because, as you say, It was probably synthesized.