I band aided that by upping the timeout on the Remote IO to 500ms and then put additional timers on the IO points that could fault/shutdown the main motors.
Increasing the timeout makes retry problems more obvious, not less. The timeout should be as short as it can be, and should reflect the worst case response time of the target plus network infrastructure delays. In the case of Ethernet I/O, the DMIOs, EBCs, or EDRVs should have a very short response time, on the order of a few milliseconds. If you are on a wired network, the infrastructure delays should be sub 1ms. If you are having packet loss, the answer is increased retries, not increased timeouts.
I'm assuming a network broadcast from another device on our PLCnet is to blame. We use subnetting to separate a few networks and prevent routing out from the PLCnet but no VLANS or routers yet although we are quickly heading that way.
Broadcast traffic can be a problem, yes. When we were doing the original testing of the Ethernet I/O Master, we kept seeing retries that we didn't think we should see. we were doing the testing on the company network, but knowing that others would too, we wanted to test that way. After much head scratching we finally identified a switch that was doing some kind of promiscuous ARP that was bursting out 20-30 ARP requests with almost zero delay. It was filling the packet FIFO on the PLC's MAC, causing packets to be lost. Once we stopped the switch from doing that, the packet loss stopped.
1. Why didn't the BX-P-ECOMEX work better than the onboard Ethernet for ISOLATED RemoteIO.
Maybe I missed it, but I didn't see any details about how many devices are connected. I saw references to HMI and DMIO, but not the total number of units. Another consideration is PLC scan time. The ECOMEX doesn't use an interrupt, so it will start losing packets after the MAC's FIFO fills up. The FIFO is pretty deep...if memory serves, I'm thinking it is 30-100 packets depending on how big the packets are. If there is a a lot of activity and a high scan time, you can get past that. The only way to know exactly what is going on is to snoop the PLC's port with a port mirroring switch. I'm happy to look over any Wireshark traces you can pull from that.
2. How would I use the BX-P-ECOMEX to address/route to the Ethernet/IP devices I created for the VFD's I control with my EIPMSG instructions.
The IP stack knows what address you are targeting. If you specify an address on the ECOMEX's subnet, it'll route to it. If it's on the internal port's subnet it'll route to it. If it isn't on either subnet, it'll route to the first gateway configured.
If cases where broadcasts are required (which doesn't allow for routing), we allow you to pick the port.
3. What would cause the retry counts on the DMIO is it some kind of broadcast traffic?
Excess traffic or noise. You mentioned VFDs. They are absolute comm killers.
Last thought...if I wanted to to keep the I/O responsive in a less than perfect environment, I would reduce the timeout to the lowest stable setting (per my description above), possibly increase retries beyond the default 4, keep the scan interval low or 0, and make sure the PLC's scan time is as low as possible. Your retries may be high, but the I/O throughput will be higher. Timeouts significantly higher than normal response times don't improve packet delivery, they just delay the retry.
We're also happy work more directly with you. Contact support at hosteng.com.