I really like this because it reminds me of an infographic come to life. I see it more as an infographic because I don't really believe everything is in perfect scale if it were reality. If it were reality then our point of view would be at the center but the hippo and rhino would be the closest animal because they are further out from the other animals.
If for instance you are nose to nose with the horse, facing the horse, it would be right in front of you but all the other animals will be on your right, in a half a pyramid or phalanx shape, so if that were true the buffalo, for instance, would be closer to the person viewing the picture than to the person shown in the picture nose to nose with the horse. Here it would seem all the animals are set to scale as if they were in a direct straight line from the person in the picture nose to nose, they would be nose to nose with all of them almost if all the animals were directly in front of them, but they would have to veer away.
You > horse
animal
animal
animal
^
my view
The code above is what I mean if it were real, the one below seems to be the infographic version where they are all in a straight line making their scale the same, but if it were real the animals to the back of the line would be closer to my view and appear bigger than the animals in the front of the line which would be farther away.
You> horse animal animal animal animal
^
my view