I don’t consider sharing 20 percent of protein coding to be very ‘similar’ to one another at all.
Can you explain the difference between DNA, which is something like 96% identical (depending on how you measure it), versus "protein coding" which is far less so?
Is it possible that these differences have something to do with the rate at which random mutations accumulate in DNA versus "protein coding"?