The longer explanation is this. Let's say your new endpoint has the IP 10.20.30.40, and your firewalls own 10.20.30.1 (VIP), .2 (active member), and .3 (standby member).
When you tell the new endpoint to ping 10.20.30.1, it doesn't know what MAC address to use for the Ethernet frame. It sends an ARP request to ask who owns that IP. Everything on the network gets that ARP request. The active firewall member replies that it owns the address, the new endpoint caches this reply, and the new endpoint sends the ping request.
The active receives the Ethernet frame and extracts the packet. It sees that the destination IP in the packet is itself, so it hands the packet to its network stack to handle. The network stack sees it's a ping request and sends a ping response, but it doesn't know the destination MAC to use in the Ethernet frame, so it sends an ARP request asking who owns 10.20.30.40. The new endpoint replies, and the reply gets cached in the firewall's ARP table. The firewall then sends the reply.
Without ARP working in both directions, you could not get a response to a ping. If the server owner can describe the symptom they're seeing, we may be able to recommend further troubleshooting steps. Their reported symptom does not match your test results, though.