What version are you using? Plus there’s a ‘temperature’ variable (you don’t typically see) that tries to create a level of variation in responses. But yes, the prompt matters a lot, as I asked about all of them together it may have done a broader comparison.
I’m using 4