The audio clips would have to be adjusted for their position to account for the speed of sound and various echos, but it seems quite possible to get an accurate composite to evaluate.
Some of those 4chan/reddit peeps could do it if TPTB choose not to do it transparently.
Yes, getting a good composite should be no problem.